MAT - A Project to Collect Mandarin Speech Data Through Telephone Net works in Taiwan

Int. J. Comput. Linguistics Chin. Lang. Process. Pub Date : 1997-02-01 DOI:10.30019/IJCLCLP.199702.0003

Hsiao-Chuan Wang

引用次数: 48

Abstract

A cooperative project, called ”Polyphone”, was initiated by the Coordinating Committee on Speech Databases and Speech I/O Systems Assessment (COCOSDA) in 1992. Accordingly, a project to collect Mandarin speech data across Taiwan (MAT) was conducted by a group of researchers from several universities and research organizations in Taiwan. The purpose was to generate a speech corpus for the development of Mandarin-based speech technology and products. The speech data were collected at eight recording stations through telephone networks. The speakers were chosen so as to reflect the population of the gender, the dialect, the educational level, and the residence .in Taiwan. A preliminary Mandarin speech database of 800 speakers has been produced. The final goal is to generate a speech database of at. least 5000 speakers.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

利用台湾电话网搜集普通话语音资料的计画

1992年，语音数据库和语音I/O系统评估协调委员会(COCOSDA)发起了一个名为“Polyphone”的合作项目。因此，来自台湾几所大学和研究机构的一组研究人员开展了一项收集全台湾普通话语音数据的项目。目的是为基于普通话的语音技术和产品的开发生成一个语音语料库。语音数据是通过电话网络在8个录音站收集的。演讲者的选择是为了反映台湾人口的性别、方言、教育水平和居住地。初步建立了800人的普通话语音数据库。最终目标是生成一个at的语音数据库。至少5000人。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Int. J. Comput. Linguistics Chin. Lang. Process.

自引率

0.00%

发文量