Proceedings of 2nd IEEE Workshop on Interactive Voice Technology for Telecommunications Applications最新文献

英文中文

Fast telephone channel adaptation based on vector field smoothing technique 基于矢量场平滑技术的快速电话信道自适应

Proceedings of 2nd IEEE Workshop on Interactive Voice Technology for Telecommunications Applications

Pub Date : 1994-09-26 DOI: 10.1109/IVTTA.1994.341536

J. Takahashi, S. Sagayama

The paper presents a fast telephone channel adaptation method of MAP/VFS with a sequential training function. The concept is based on using maximum a posteriori (MAP) estimation as an intra-class training scheme in combination with vector field smoothing (VFS) technique as an inter-class training scheme. Experimental results of simultaneous adaptation to a telephone channel and a speaker show the proposed method is significantly superior to sequential MAP adaptation. The error reduction rate achieved in sequentially adapting a few words of sample data is about 41% using the proposed method, while that of the sequential MAP adaptation hardly improved even with ten-word adaptation data. MAP/VFS, with its fast and sequential adaptation function, is expected to be very useful in developing telephone applications such as information services proceeded by iterative tree-structured item selection.<>

提出了一种带序列训练函数的MAP/VFS电话信道快速自适应方法。该概念基于最大后验估计(MAP)作为类内训练方案，结合向量场平滑(VFS)技术作为类间训练方案。对电话信道和说话人同时自适应的实验结果表明，该方法明显优于序贯MAP自适应。采用该方法对样本数据进行少量词序自适应的误差率约为41%，而对MAP序列自适应的误差率即使是10词自适应也几乎没有提高。MAP/VFS具有快速和顺序的自适应功能，有望在开发由迭代树状结构项目选择进行的信息服务等电话应用程序中发挥重要作用。

引用次数: 3

Field trial of a speaker verification service for caller identity verification in the telephone network 电话网络中用于呼叫者身份验证的说话人验证服务的现场试验

Proceedings of 2nd IEEE Workshop on Interactive Voice Technology for Telecommunications Applications

Pub Date : 1994-09-26 DOI: 10.1109/IVTTA.1994.341529

J. Naik

A field trial of a network-integrated speaker verification system was performed in the NYNEX public switched telephone network in 1993-94. Speaker verification was performed on all calling-card calls placed by NYNEX customers who took part in this trial. Subsequently, a comprehensive impostor field-trial was performed. A variety of phones, channel conditions and caller/calling environments were represented in this large field-trial. The results show that this system performed very well under these real-world conditions. A valid user rejection rate of 1%, which is operationally very desirable, produced an equally low dedicated impostor acceptance of 3.9%. User surveys showed high user preference of this type of service. The paper discusses the results of the field trial in detail.<>

1993- 1994年在NYNEX公共交换电话网中进行了网络综合说话人核查系统的现场试验。对参加本次试验的NYNEX客户拨打的所有电话卡进行了说话人验证。随后，进行了全面的冒名顶替者现场试验。在这个大型现场试验中，代表了各种电话、信道条件和呼叫者/呼叫环境。结果表明，该系统在这些实际条件下表现良好。有效的用户拒绝率为1%，这在操作上是非常理想的，产生了同样低的专用冒名顶替者接受率为3.9%。用户调查显示，用户对这类服务有很高的偏好。本文对田间试验的结果作了较详细的讨论

引用次数: 4

A voice transaction processing application with PSOLA based text to speech conversion for Spanish 一个基于PSOLA的西班牙语文本到语音转换的语音事务处理应用程序

Proceedings of 2nd IEEE Workshop on Interactive Voice Technology for Telecommunications Applications

Pub Date : 1994-09-26 DOI: 10.1109/IVTTA.1994.341534

I. Hernáez, A. Cuesta

Presents a new synthesis scheme for voice transaction processing application. The system makes use of concatenation of previously recorded messages with synthetic speech segments generated by a text to speech converter. Text to speech conversion is made pitch synchronously overlapping and adding diphones and triphone speech units, and is used only for unpredictable vocabulary e.g. names, addresses, account numbers, etc.<>

提出了一种新的语音事务处理综合方案。该系统利用先前记录的消息与由文本到语音转换器生成的合成语音片段的连接。文本到语音的转换是使音高同步重叠和添加双音阶和三音阶语音单位，并仅用于不可预测的词汇，如姓名，地址，账号等。

引用次数: 0

Dialog design for a speech-interactive automation system 语音交互自动化系统的对话框设计

Proceedings of 2nd IEEE Workshop on Interactive Voice Technology for Telecommunications Applications

Pub Date : 1994-09-26 DOI: 10.1109/IVTTA.1994.341532

B. L. Zeigler, B. Bazor

We discuss our approach to dialog design for telephone service orders, and describe the dialog developed for our service disconnect system. Our approach is based on the characterization of applications in terms of information elements and their attributes. We build the acquisition dialog for each information element by customizing generic dialog prototypes to match its type and attributes. The design of the dialog prototypes is based on dyads of system outcomes and recourse actions. Our approach features design modularity, relative ease of scaling dialogs to new applications, and decoupling the dialog design from the specifics of system and recognition technologies.<>

我们讨论了电话服务订单的对话设计方法，并描述了为我们的服务断开系统开发的对话。我们的方法基于根据信息元素及其属性对应用程序进行表征。我们通过定制通用对话框原型来匹配其类型和属性，为每个信息元素构建获取对话框。对话原型的设计是基于系统结果和追索权操作的组合。我们的方法的特点是设计模块化，相对容易将对话框扩展到新的应用程序，并将对话框设计从系统和识别技术的细节中解耦。

引用次数: 12

A field study of performance improvements in HMM-based speaker verification 基于hmm的说话人验证性能改进的实地研究

Proceedings of 2nd IEEE Workshop on Interactive Voice Technology for Telecommunications Applications

Pub Date : 1994-09-26 DOI: 10.1109/IVTTA.1994.341530

T. Jacobs, A. Setlur

This study reports our findings on speaker verification (SV) performance improvements using random 4-digit utterances collected over a single microphone type. The databases used in this study are the result of an ongoing field trial of SV access to automatic teller machines (ATMs) for secure unattended banking services. The SV system uses continuous density HMM models trained on 18 connected 4-digit utterances and has a baseline equal-error-rate (EER) of between 5.5 and 11% for different sets of data. Because of the limited training data, estimates for the mixture variances are most often poor. By calculating average mixture variances using all of the training data for a given speaker and then setting all of the model variances for that speaker to these speaker dependent values and using cohort normalization, the EER decreases consistently to between 2.5 and 6.5%.<>

本研究报告了我们使用在单一麦克风类型上收集的随机4位数话语来改进说话人验证(SV)性能的发现。本研究中使用的数据库是SV进入自动柜员机(atm)进行安全无人值守银行服务的现场试验的结果。SV系统使用连续密度HMM模型，对18个相连的4位数话语进行训练，对于不同的数据集，其基线等错误率(EER)在5.5%到11%之间。由于训练数据有限，对混合方差的估计通常很差。通过使用给定说话者的所有训练数据计算平均混合方差，然后将该说话者的所有模型方差设置为这些与说话者相关的值，并使用队列归一化，EER始终下降到2.5至6.5%之间。

引用次数: 2

Operational and experimental French telecommunication services using CNET speech recognition and text-to-speech synthesis 操作和实验法语电信服务使用CNET语音识别和文本到语音合成

Proceedings of 2nd IEEE Workshop on Interactive Voice Technology for Telecommunications Applications

Pub Date : 1994-09-26 DOI: 10.1109/IVTTA.1994.341550

C. Sorin, D. Jouvet, C. Gagnoulet, D. Dubois, D. Sadek, M. Toularhoat

The paper presents a brief overview of current uses for CNET speech technology (speech recognition and text-to-speech systems) in interactive voice response services (IVR). Several services are described, and the latest evaluation of one ASR-based service is also outlined. Finally, the paper summarizes developments in the CNET ASR and TTS technology.<>

本文简要介绍了CNET语音技术(语音识别和文本转语音系统)在交互式语音应答服务(IVR)中的应用现状。介绍了几种服务，并概述了一种基于asr的服务的最新评估。最后，对CNET ASR和TTS技术的发展进行了总结。

引用次数: 25

Noise suppression in cellular communications 蜂窝通信中的噪声抑制

Proceedings of 2nd IEEE Workshop on Interactive Voice Technology for Telecommunications Applications

Pub Date : 1994-09-26 DOI: 10.1109/IVTTA.1994.341539

H. Hermansky, E. Wan, C. Avendaño

FIR Wiener-like filters are applied to time trajectories of cubic-root compressed short-term power spectrum of noisy speech recorded over cellular communications. Informal listenings indicate that the technique brings a noticeable improvement in quality of noisy speech in the overlap-add analysis-synthesis system while not causing any significant degradation on clean speech.<>

将FIR类维纳滤波器应用于蜂窝通信中记录的噪声语音的三根压缩短期功率谱的时间轨迹。非正式聆听表明，该技术在重叠-添加分析-合成系统中显著改善了噪声语音的质量，同时不会对干净语音造成任何明显的退化。

引用次数: 13

A multimodal consumer information server with IVR menu 一个多模式的消费者信息服务器与IVR菜单

Proceedings of 2nd IEEE Workshop on Interactive Voice Technology for Telecommunications Applications

Pub Date : 1994-09-26 DOI: 10.1109/IVTTA.1994.341542

M. Damhuis, M. Peeters, L. Boves

The paper describes the development of a fully automatic multimodal information system for the consumer market. The system will be able to provide information on a large number of topics via a single telephone number. The eventual system will integrate interactive voice response, speech recognition, speaker verification, direct dial in, calling line identification, facsimile and electronic mail. The present version is limited to DTMF input and voice and facsimile output. The architecture of the system described in the paper allows successive addition of other technologies.<>

本文介绍了面向消费市场的全自动多式联运信息系统的开发。该系统将能够通过一个电话号码提供大量主题的信息。最终的系统将集成交互式语音应答、语音识别、说话人核查、直接拨号、通话线路识别、传真和电子邮件。目前的版本仅限于DTMF输入和语音和传真输出。本文所描述的系统架构允许陆续添加其他技术。

引用次数: 2

Assessment of the VoiceMap spoken language system VoiceMap口语系统的评估

Proceedings of 2nd IEEE Workshop on Interactive Voice Technology for Telecommunications Applications

Pub Date : 1994-09-26 DOI: 10.1109/IVTTA.1994.341528

A. Bayya, M. Ďurian, L. Meiskey, R. Root, R. Sparks

As spoken language systems expand to new tasks, there will be a need for empirical research on how to optimize the usability of human-computer spoken natural language dialogues, including research on methods for chunking information. For allowing users to control provision of that information, and for providing feedback on the system's processing and current context. The paper describes the results of usability study performed to evaluate the performance and usability as well as acceptability of a system that provides street map directions.<>

随着口语系统扩展到新的任务，将需要对如何优化人机口语自然语言对话的可用性进行实证研究，包括对信息分组方法的研究。允许用户控制该信息的提供，并提供关于系统处理和当前上下文的反馈。本文描述了可用性研究的结果，以评估一个提供街道地图方向的系统的性能、可用性和可接受性。

引用次数: 0

Experience with the Philips automatic train timetable information system 有使用飞利浦自动列车时刻表信息系统的经验

Proceedings of 2nd IEEE Workshop on Interactive Voice Technology for Telecommunications Applications

Pub Date : 1994-09-26 DOI: 10.1109/IVTTA.1994.341543

H. Aust, M. Oerder, F. Seide, V. Steinbiss

Introduces an automatic system for train timetable information over the telephone that provides accurate connections between 1200 German cities. The caller can talk to it in unrestricted, natural, and fluent speech, very much like he or she would communicate with a human operator, and is not given any instructions in advance. In an ongoing field trial, this system has been made available to the general public, both to gather speech data and to evaluate its performance. This field test was organized as a bootstrapping process: initially, the system was trained with just the developers' voices, then the telephone number was passed around within the department, the company, and finally, the outside world. After each step, the newly collected material was used for retraining and general improvements. The observations and results from this test are reported.<>

介绍了一种通过电话提供列车时刻表信息的自动系统，该系统可以在1200个德国城市之间提供准确的连接。呼叫者可以用不受限制的、自然的、流利的语言与它交谈，就像他或她与人类操作员交流一样，而且事先不需要任何指示。在一项正在进行的现场试验中，该系统已向公众开放，用于收集语音数据和评估其性能。这个现场测试被组织为一个引导过程:最初，系统只使用开发人员的声音进行训练，然后电话号码在部门、公司内部传递，最后传递给外部世界。在每一步之后，新收集的材料用于再培训和一般改进。报告了这次试验的观察结果和结果。

引用次数: 36

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings of 2nd IEEE Workshop on Interactive Voice Technology for Telecommunications Applications

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀