首页 > 最新文献

2020 23rd Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques (O-COCOSDA)最新文献

英文 中文
Data Processing for Optimizing Naturalness of Vietnamese Text-to-speech System 优化越南语文本转语音系统自然度的数据处理
V. Phung, Phan Huy Kinh, Anh-Tuan Dinh, Quoc Bao Nguyen
End-to-end text-to-speech (TTS) systems has proved its great success in the presence of a large amount of high-quality training data recorded in an anechoic room with high-quality microphones. Another approach is to use available source of found data like radio broadcast news. We aim to optimize the naturalness of TTS system on the found data using a novel data processing method. The data processing method includes 1) utterance selection and 2) prosodic punctuation insertion to prepare training data which can optimize the naturalness of TTS systems. We showed that using the processing data method, an end-to-end TTS achieved a mean opinion score (MOS) of 4.1 compared to 4.3 of natural speech. We showed that the punctuation insertion contributed the most to the result. To facilitate the research and development of TTS systems, we distributed the processed data, which is known as Zalo-TTS database at https://forms.gle/6Hk5YkqgDxAaC2BU6; It consists of 18-hours of speech at a sampling rate of 44.1 kHz of one speaker with Hanoi dialect.
端到端文本到语音(TTS)系统在具有高质量麦克风的消声室中记录了大量高质量的训练数据,证明了其巨大的成功。另一种方法是使用现有的数据来源,如无线电广播新闻。我们的目标是利用一种新的数据处理方法来优化TTS系统对发现数据的自然度。数据处理方法包括1)话语选择和2)韵律标点插入,以制备训练数据,优化TTS系统的自然度。我们发现,使用处理数据的方法,端到端TTS的平均意见得分(MOS)为4.1,而自然语音的平均意见得分为4.3。我们发现标点符号的插入对结果的贡献最大。为了方便TTS系统的研究和开发,我们将处理后的数据分发到https://forms.gle/6Hk5YkqgDxAaC2BU6,称为Zalo-TTS数据库;它由一个河内方言的讲话者以44.1 kHz的采样率进行的18小时的讲话组成。
{"title":"Data Processing for Optimizing Naturalness of Vietnamese Text-to-speech System","authors":"V. Phung, Phan Huy Kinh, Anh-Tuan Dinh, Quoc Bao Nguyen","doi":"10.1109/O-COCOSDA50338.2020.9295025","DOIUrl":"https://doi.org/10.1109/O-COCOSDA50338.2020.9295025","url":null,"abstract":"End-to-end text-to-speech (TTS) systems has proved its great success in the presence of a large amount of high-quality training data recorded in an anechoic room with high-quality microphones. Another approach is to use available source of found data like radio broadcast news. We aim to optimize the naturalness of TTS system on the found data using a novel data processing method. The data processing method includes 1) utterance selection and 2) prosodic punctuation insertion to prepare training data which can optimize the naturalness of TTS systems. We showed that using the processing data method, an end-to-end TTS achieved a mean opinion score (MOS) of 4.1 compared to 4.3 of natural speech. We showed that the punctuation insertion contributed the most to the result. To facilitate the research and development of TTS systems, we distributed the processed data, which is known as Zalo-TTS database at https://forms.gle/6Hk5YkqgDxAaC2BU6; It consists of 18-hours of speech at a sampling rate of 44.1 kHz of one speaker with Hanoi dialect.","PeriodicalId":385266,"journal":{"name":"2020 23rd Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques (O-COCOSDA)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-04-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114714843","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
A Country Report - OCOCOSDA Activities in China 国别报告-中国的OCOCOSDA活动
Ai-jun Li, Dong Wang
This article consists only of a collection of slides from the author's conference presentation.
本文仅由作者在会议上发表的一些幻灯片组成。
{"title":"A Country Report - OCOCOSDA Activities in China","authors":"Ai-jun Li, Dong Wang","doi":"10.1109/O-COCOSDA50338.2020.9294999","DOIUrl":"https://doi.org/10.1109/O-COCOSDA50338.2020.9294999","url":null,"abstract":"This article consists only of a collection of slides from the author's conference presentation.","PeriodicalId":385266,"journal":{"name":"2020 23rd Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques (O-COCOSDA)","volume":"641 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121085879","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques: Country Report - India 语音数据库和评估技术协调和标准化国际委员会:国别报告-印度
S. Agrawal
This article consists only of a collection of slides from the author's conference presentation.
本文仅由作者在会议上发表的一些幻灯片组成。
{"title":"International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques: Country Report - India","authors":"S. Agrawal","doi":"10.1109/O-COCOSDA50338.2020.9295033","DOIUrl":"https://doi.org/10.1109/O-COCOSDA50338.2020.9295033","url":null,"abstract":"This article consists only of a collection of slides from the author's conference presentation.","PeriodicalId":385266,"journal":{"name":"2020 23rd Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques (O-COCOSDA)","volume":"379 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115907181","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
The message of the O-COCOSDA Convenor O-COCOSDA召集人的话
{"title":"The message of the O-COCOSDA Convenor","authors":"","doi":"10.1109/o-cocosda46868.2019.9050340","DOIUrl":"https://doi.org/10.1109/o-cocosda46868.2019.9050340","url":null,"abstract":"","PeriodicalId":385266,"journal":{"name":"2020 23rd Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques (O-COCOSDA)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116088518","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
O-COCOSDA Country Report of Pakistan O-COCOSDA巴基斯坦国别报告
T. Habib, S. Hussain
Existing Urdu TTS system has been integrated with Microsoft Speech API (SAPI. The system is now being freely distributed to visually impaired community in Pakistan.
现有的乌尔都语TTS系统已经集成了微软语音API (SAPI)。该系统目前正免费分发给巴基斯坦的视障社区。
{"title":"O-COCOSDA Country Report of Pakistan","authors":"T. Habib, S. Hussain","doi":"10.1109/O-COCOSDA50338.2020.9295013","DOIUrl":"https://doi.org/10.1109/O-COCOSDA50338.2020.9295013","url":null,"abstract":"Existing Urdu TTS system has been integrated with Microsoft Speech API (SAPI. The system is now being freely distributed to visually impaired community in Pakistan.","PeriodicalId":385266,"journal":{"name":"2020 23rd Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques (O-COCOSDA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130614705","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Country Report - Singapore 国别报告-新加坡
Haizhou Li
This article consists only of a collection of slides from the author's conference presentation.
本文仅由作者在会议上发表的一些幻灯片组成。
{"title":"Country Report - Singapore","authors":"Haizhou Li","doi":"10.1109/O-COCOSDA50338.2020.9295016","DOIUrl":"https://doi.org/10.1109/O-COCOSDA50338.2020.9295016","url":null,"abstract":"This article consists only of a collection of slides from the author's conference presentation.","PeriodicalId":385266,"journal":{"name":"2020 23rd Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques (O-COCOSDA)","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132148227","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2020 23rd Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques (O-COCOSDA)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1