首页 > 最新文献

Proceedings of 2nd IEEE Workshop on Interactive Voice Technology for Telecommunications Applications最新文献

英文 中文
Voice and audio processing for telephony applications in AT&T AT&T电话应用的语音和音频处理
J.E. Tschirgi
The paper provides an overview of AT&T's collective efforts to develop voice and audio processing (VAP) for network applications. The first section of the paper discusses AT&T's focused efforts to manage technology resources from across the company. Next, key technology challenges within the public network market are discussed. These include required advances in technologies for new and existing markets, and the interaction of VAP technologies with specific applications. Finally, the paper discusses key challenges for managing the technology, including the transition from low-risk to higher-risk markets and the importance of setting expectations for technology performance.<>
本文概述了AT&T为网络应用开发语音和音频处理(VAP)的集体努力。论文的第一部分讨论了AT&T集中精力管理整个公司的技术资源。接下来,讨论了公共网络市场中的关键技术挑战。其中包括新市场和现有市场所需的技术进步,以及VAP技术与特定应用程序的相互作用。最后,本文讨论了管理技术的关键挑战,包括从低风险市场到高风险市场的转变,以及为技术绩效设定预期的重要性。
{"title":"Voice and audio processing for telephony applications in AT&T","authors":"J.E. Tschirgi","doi":"10.1109/IVTTA.1994.341545","DOIUrl":"https://doi.org/10.1109/IVTTA.1994.341545","url":null,"abstract":"The paper provides an overview of AT&T's collective efforts to develop voice and audio processing (VAP) for network applications. The first section of the paper discusses AT&T's focused efforts to manage technology resources from across the company. Next, key technology challenges within the public network market are discussed. These include required advances in technologies for new and existing markets, and the interaction of VAP technologies with specific applications. Finally, the paper discusses key challenges for managing the technology, including the transition from low-risk to higher-risk markets and the importance of setting expectations for technology performance.<<ETX>>","PeriodicalId":435907,"journal":{"name":"Proceedings of 2nd IEEE Workshop on Interactive Voice Technology for Telecommunications Applications","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1994-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130647532","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Speech recognition issues for directory assistance applications 目录协助应用程序的语音识别问题
C. Kamm, C. Shamieh, S. Singhal
Telephone companies in the United States handle over 6 billion directory assistance (DA) calls each year. Automation of even a portion of DA calls could significantly reduce the cost of DA services. The paper explores two factors affecting successful automation of DA: a) the effect of directory size on speech recognition performance, and b) the complexity of existing DA call interactions. Speech recognition performance for a set of 200 spoken names was measured for directories ranging from 200 to 1.5 million unique names. Recognition accuracy decreased from 82.5 percent for a 200-name directory to 18.5 percent for a 1.5 million name directory. In part because high recognition accuracy is not easily achievable for these very large, low-context directories, it is likely that initial implementations of DA automation will focus on a small percentage of calls, requiring a smaller vocabulary. To maximize the potential savings, listings that are most frequently requested appear to be the optimal vocabulary. To identify critical issues in automating frequent DA requests, approximately 13,000 DA calls from an office near a major metropolitan area in the United States were studied. In this sample, 245 listings covered 10 percent of the call volume, and 870 listings covered 20 percent of the call volume.<>
美国的电话公司每年要处理超过60亿的目录查询(DA)电话。即使是部分数据处理调用的自动化也可以显著降低数据处理服务的成本。本文探讨了影响自动数据挖掘成功自动化的两个因素:a)目录大小对语音识别性能的影响;b)现有自动数据挖掘调用交互的复杂性。在200到150万个不同名字的目录中,对一组200个名字的语音识别性能进行了测试。识别准确率从200个名称目录的82.5%下降到150万个名称目录的18.5%。部分原因是对于这些非常大的、低上下文的目录不容易实现高识别精度,因此数据处理自动化的初始实现很可能只关注一小部分调用,需要更小的词汇表。为了最大化潜在的节省,最常被请求的清单似乎是最佳的词汇表。为了确定频繁的数据处理请求自动化中的关键问题,研究了来自美国一个主要大都市附近的一个办公室的大约13,000个数据处理呼叫。在这个示例中,245个列表覆盖了10%的呼叫量,870个列表覆盖了20%的呼叫量。
{"title":"Speech recognition issues for directory assistance applications","authors":"C. Kamm, C. Shamieh, S. Singhal","doi":"10.1109/IVTTA.1994.341552","DOIUrl":"https://doi.org/10.1109/IVTTA.1994.341552","url":null,"abstract":"Telephone companies in the United States handle over 6 billion directory assistance (DA) calls each year. Automation of even a portion of DA calls could significantly reduce the cost of DA services. The paper explores two factors affecting successful automation of DA: a) the effect of directory size on speech recognition performance, and b) the complexity of existing DA call interactions. Speech recognition performance for a set of 200 spoken names was measured for directories ranging from 200 to 1.5 million unique names. Recognition accuracy decreased from 82.5 percent for a 200-name directory to 18.5 percent for a 1.5 million name directory. In part because high recognition accuracy is not easily achievable for these very large, low-context directories, it is likely that initial implementations of DA automation will focus on a small percentage of calls, requiring a smaller vocabulary. To maximize the potential savings, listings that are most frequently requested appear to be the optimal vocabulary. To identify critical issues in automating frequent DA requests, approximately 13,000 DA calls from an office near a major metropolitan area in the United States were studied. In this sample, 245 listings covered 10 percent of the call volume, and 870 listings covered 20 percent of the call volume.<<ETX>>","PeriodicalId":435907,"journal":{"name":"Proceedings of 2nd IEEE Workshop on Interactive Voice Technology for Telecommunications Applications","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1994-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122714333","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 43
Towards non-uniform unit HMMs for speech recognition 面向语音识别的非均匀单位hmm
T. Matsumura, S. Matsunaga
A novel acoustic modeling algorithm that generates non-uniform unit HMMs to effectively cope with spectral variations in fluent speech is proposed. The algorithm is devised for the automatic iterative generation of long-span units for the non-uniform modeling. This generation algorithm is based on an entropy reduction criterion using text data and a maximum likelihood criterion using speech data. The effectiveness of the non-uniform models was confirmed by comparing likelihood values between the long-span unit HMMs and the conventional phoneme-unit HMMs. Preliminary results suggest that non-uniform unit HMMs achieve higher performance than phoneme-unit HMMs.<>
提出了一种新的生成非均匀单元hmm的声学建模算法,以有效应对流利语音中的频谱变化。针对非均匀建模问题,设计了大跨度单元的自动迭代生成算法。该生成算法基于文本数据的熵降准则和语音数据的最大似然准则。通过比较大跨度单位hmm与传统音素单位hmm的似然值,验证了非均匀模型的有效性。初步结果表明,非均匀单位hmm比音素单位hmm具有更高的性能
{"title":"Towards non-uniform unit HMMs for speech recognition","authors":"T. Matsumura, S. Matsunaga","doi":"10.1109/IVTTA.1994.341538","DOIUrl":"https://doi.org/10.1109/IVTTA.1994.341538","url":null,"abstract":"A novel acoustic modeling algorithm that generates non-uniform unit HMMs to effectively cope with spectral variations in fluent speech is proposed. The algorithm is devised for the automatic iterative generation of long-span units for the non-uniform modeling. This generation algorithm is based on an entropy reduction criterion using text data and a maximum likelihood criterion using speech data. The effectiveness of the non-uniform models was confirmed by comparing likelihood values between the long-span unit HMMs and the conventional phoneme-unit HMMs. Preliminary results suggest that non-uniform unit HMMs achieve higher performance than phoneme-unit HMMs.<<ETX>>","PeriodicalId":435907,"journal":{"name":"Proceedings of 2nd IEEE Workshop on Interactive Voice Technology for Telecommunications Applications","volume":"48 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1994-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127596486","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Utterance promoting methods on speech dialog systems 语音对话系统中的话语提升方法
H. Nishi, M. Kitai
New dialog promoting methods for voice storage dialog systems are discussed. The paper describes two hypotheses that attempt to put at ease those callers who hesitate or are shy by designing an interactive dialog or offering attractive information to the caller. The following two hypotheses are introduced. The first is the less information per utterance required by the system, the more comfortable the user feels. The second is the more attractive the information is that is given after an utterance by the system, the more the caller will want to have a message. The experimental conditions are explained to evaluate the previous hypotheses using an opinion score. The system consists of a PC, telephone line interface board, and control software. For eliminating the effect of recognition accuracy, dialogs are designed without speech recognition. Finally, the experimental results are described which indicate the usefulness of each hypothesis, and in addition these results show an increase of 0.7-0.8 points in a mean opinion score.<>
讨论了语音存储对话系统中新的对话提升方法。这篇论文描述了两种假设,它们试图通过设计一个互动对话或向呼叫者提供有吸引力的信息来让那些犹豫或害羞的呼叫者放松下来。介绍了以下两个假设。首先,系统每句话需要的信息越少,用户感觉就越舒服。第二,系统发出的信息越吸引人,呼叫者就越想要留言。解释实验条件,以评估先前的假设使用意见评分。该系统由PC机、电话线接口板和控制软件组成。为了消除对识别精度的影响,在设计对话框时不进行语音识别。最后,对实验结果进行了描述,说明了每个假设的有效性,并且这些结果表明平均意见得分增加了0.7-0.8分
{"title":"Utterance promoting methods on speech dialog systems","authors":"H. Nishi, M. Kitai","doi":"10.1109/IVTTA.1994.341533","DOIUrl":"https://doi.org/10.1109/IVTTA.1994.341533","url":null,"abstract":"New dialog promoting methods for voice storage dialog systems are discussed. The paper describes two hypotheses that attempt to put at ease those callers who hesitate or are shy by designing an interactive dialog or offering attractive information to the caller. The following two hypotheses are introduced. The first is the less information per utterance required by the system, the more comfortable the user feels. The second is the more attractive the information is that is given after an utterance by the system, the more the caller will want to have a message. The experimental conditions are explained to evaluate the previous hypotheses using an opinion score. The system consists of a PC, telephone line interface board, and control software. For eliminating the effect of recognition accuracy, dialogs are designed without speech recognition. Finally, the experimental results are described which indicate the usefulness of each hypothesis, and in addition these results show an increase of 0.7-0.8 points in a mean opinion score.<<ETX>>","PeriodicalId":435907,"journal":{"name":"Proceedings of 2nd IEEE Workshop on Interactive Voice Technology for Telecommunications Applications","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1994-07-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132552284","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
Proceedings of 2nd IEEE Workshop on Interactive Voice Technology for Telecommunications Applications
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1