首页 > 最新文献

2012 8th International Symposium on Chinese Spoken Language Processing最新文献

英文 中文
A hybrid fragment / syllable-based system for improved OOV term detection 一种基于片段/音节的混合系统,用于改进OOV术语检测
Pub Date : 2012-12-01 DOI: 10.1109/ISCSLP.2012.6423479
Yong Xu, Wu Guo, Lirong Dai
Spoken term detection (STD) is a task for open vocabulary search in large recordings of speech. Although the term detection performance for in-vocabulary (INV) terms has achieved a great improvement, the detection performance for out of vocabulary (OOV) terms is still disappointing. In this paper, we propose to combine fragment-based with syllable-based search into a hybrid STD system for OOV terms. Syllable is a kind of knowledge-based subword while fragment is data-driven. We initially compare their different modeling ability for OOVs. Considering the potential complementarities between them, we explore two methods of fusion: index fusion (combining the triphone indexes of a fragment-based and a syllable-based system) and result fusion (merging search results of the two systems). After the result fusion, we achieve a 9.4% relative improvement on NIST STD06 English conversational telephone speech (CTS) EvalSet in actual term weighted value (ATWV).
口语词汇检测(STD)是一项在大量语音录音中进行开放词汇搜索的任务。尽管词汇表内(INV)术语的检测性能有了很大的提高,但词汇表外(OOV)术语的检测性能仍然令人失望。在本文中,我们提出将基于片段的搜索和基于音节的搜索结合成一个混合STD系统来搜索OOV术语。音节是一种基于知识的子词,片段是数据驱动的。我们首先比较了它们对oov的不同建模能力。考虑到它们之间潜在的互补性,我们探索了两种融合方法:索引融合(结合基于片段和基于音节的系统的三重索引)和结果融合(合并两个系统的搜索结果)。结果融合后,我们在NIST STD06英语会话电话语音(CTS) EvalSet的实际词加权值(ATWV)上实现了9.4%的相对改进。
{"title":"A hybrid fragment / syllable-based system for improved OOV term detection","authors":"Yong Xu, Wu Guo, Lirong Dai","doi":"10.1109/ISCSLP.2012.6423479","DOIUrl":"https://doi.org/10.1109/ISCSLP.2012.6423479","url":null,"abstract":"Spoken term detection (STD) is a task for open vocabulary search in large recordings of speech. Although the term detection performance for in-vocabulary (INV) terms has achieved a great improvement, the detection performance for out of vocabulary (OOV) terms is still disappointing. In this paper, we propose to combine fragment-based with syllable-based search into a hybrid STD system for OOV terms. Syllable is a kind of knowledge-based subword while fragment is data-driven. We initially compare their different modeling ability for OOVs. Considering the potential complementarities between them, we explore two methods of fusion: index fusion (combining the triphone indexes of a fragment-based and a syllable-based system) and result fusion (merging search results of the two systems). After the result fusion, we achieve a 9.4% relative improvement on NIST STD06 English conversational telephone speech (CTS) EvalSet in actual term weighted value (ATWV).","PeriodicalId":186099,"journal":{"name":"2012 8th International Symposium on Chinese Spoken Language Processing","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127917560","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Unified denoising and dereverberation method used in restoration of MTF-based power envelope 基于mtf的功率包络恢复中的统一去噪去噪方法
Pub Date : 2012-12-01 DOI: 10.1109/ISCSLP.2012.6423499
M. Unoki, Xugang Lu
Recent methods of speech enhancement have been proposed to suppress the effects of background noise and reverberation. The effect of background noise in these methods is regarded as additive and that of reverberation is convolutive. Therefore, methods of reducing noise and dereverberation have been applied separately in tandem. We previously unified the effects of noise and reverberation in the modulation transfer function (MTF) concept and we proposed an approach to simultaneously removing the effects of noise and reverberation. This paper further verifies our proposed approach, mathematically and quantitatively. In addition, we generalized the model to four types of real application scenarios, which corresponded to far and near field conditions under noisy reverberant conditions, based on mathematical analysis. We carried out 12,000 simulations for each according to the four application scenarios for denoising and dereverberating noisy reverberant speech, and objectively evaluated the proposed approach. The experimental results revealed that the proposed approach worked well in simultaneously denoising and dereverberating noisy reverberant speech.
最近的语音增强方法被提出来抑制背景噪声和混响的影响。在这些方法中,背景噪声的影响被认为是加性的,混响的影响是卷积的。因此,降噪和去噪的方法被分别串联应用。我们之前在调制传递函数(MTF)概念中统一了噪声和混响的影响,并提出了一种同时去除噪声和混响影响的方法。本文从数学和定量上进一步验证了我们提出的方法。此外,在数学分析的基础上,将模型推广到四种实际应用场景,分别对应于噪声混响条件下的远场和近场情况。针对噪声混响语音去噪去噪的四种应用场景,分别进行了1.2万次仿真,客观评价了所提出的方法。实验结果表明,该方法能较好地实现混响噪声的去噪和去噪。
{"title":"Unified denoising and dereverberation method used in restoration of MTF-based power envelope","authors":"M. Unoki, Xugang Lu","doi":"10.1109/ISCSLP.2012.6423499","DOIUrl":"https://doi.org/10.1109/ISCSLP.2012.6423499","url":null,"abstract":"Recent methods of speech enhancement have been proposed to suppress the effects of background noise and reverberation. The effect of background noise in these methods is regarded as additive and that of reverberation is convolutive. Therefore, methods of reducing noise and dereverberation have been applied separately in tandem. We previously unified the effects of noise and reverberation in the modulation transfer function (MTF) concept and we proposed an approach to simultaneously removing the effects of noise and reverberation. This paper further verifies our proposed approach, mathematically and quantitatively. In addition, we generalized the model to four types of real application scenarios, which corresponded to far and near field conditions under noisy reverberant conditions, based on mathematical analysis. We carried out 12,000 simulations for each according to the four application scenarios for denoising and dereverberating noisy reverberant speech, and objectively evaluated the proposed approach. The experimental results revealed that the proposed approach worked well in simultaneously denoising and dereverberating noisy reverberant speech.","PeriodicalId":186099,"journal":{"name":"2012 8th International Symposium on Chinese Spoken Language Processing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129285831","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
A study on the coarticulation of bi-syllabic words in Chinese 汉语双音节词的协同发音研究
Pub Date : 2012-12-01 DOI: 10.1109/ISCSLP.2012.6423453
Maolin Wang, Shengnan Xiong, Jiayun Li, Ziyu Xiong
In this study, trans-consonantal vowel-to-vowel coarticulation in Chinese is analyzed. The target words are in the form of `bV1.ba', which are designed to occur on the sentence initial, medial and final positions, and the subjects are eight native speakers of standard Chinese. Vowel formants are examined at the onset, middle and offset points of the target vowel. Results show that trans-segmental coarticulation exists in Chinese, especially at the onset point of the target vowel. Coarticulation is more likely to occur on F2, and its effect is comparatively great on the sentence initial position.
本文分析了汉语跨辅音元音与元音的协同发音。目标单词的形式是' bV1 '。“八”分别出现在句子的首、中、末三个位置,被测者为8名母语为标准汉语的人。在目标元音的起始点、中间点和偏移点检查元音共振峰。结果表明,汉语中存在跨音段协同发音,尤其是在目标元音起始点。在F2上更容易出现协同发音,其对句子起始位置的影响比较大。
{"title":"A study on the coarticulation of bi-syllabic words in Chinese","authors":"Maolin Wang, Shengnan Xiong, Jiayun Li, Ziyu Xiong","doi":"10.1109/ISCSLP.2012.6423453","DOIUrl":"https://doi.org/10.1109/ISCSLP.2012.6423453","url":null,"abstract":"In this study, trans-consonantal vowel-to-vowel coarticulation in Chinese is analyzed. The target words are in the form of `bV1.ba', which are designed to occur on the sentence initial, medial and final positions, and the subjects are eight native speakers of standard Chinese. Vowel formants are examined at the onset, middle and offset points of the target vowel. Results show that trans-segmental coarticulation exists in Chinese, especially at the onset point of the target vowel. Coarticulation is more likely to occur on F2, and its effect is comparatively great on the sentence initial position.","PeriodicalId":186099,"journal":{"name":"2012 8th International Symposium on Chinese Spoken Language Processing","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115137628","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Nesting hierarchical phrase-based model for speech-to-speech translation 基于嵌套层次短语的语音翻译模型
Pub Date : 2012-12-01 DOI: 10.1109/ISCSLP.2012.6423497
Xiaoyin Fu, Wei Wei, Lichun Fan, Shixiang Lu, Bo Xu
Hierarchical phrase-based (HPB) translation has been introduced to speech-to-speech (S2S) translation system on mobile terminals, such as smartphones. However, it suffers from the explosive growth in the number of rules along with the increment in decoding time for S2S translation system when the memory and decoding speed is restricted. In this paper, we propose a nesting HPB model to capture the topological structure of hierarchical rules on the source language side, which will not only filter out the redundant rules in HPB model but also speed up the decoder. Experiments on the HPB translation system show that our approach can greatly reduce the rule table size by 75% with a faster decoder, and yield the same translation quality (measured by using BLEU) as the state-of-art HPB model.
基于分层短语(HPB)的翻译已被引入到智能手机等移动终端的语音到语音(S2S)翻译系统中。但是,在内存和译码速度受限的情况下,S2S译码系统的规则数量随着译码时间的增加而呈现爆发式增长。本文提出了一种嵌套HPB模型,在源语言端捕获层次规则的拓扑结构,既可以过滤掉HPB模型中的冗余规则,又可以提高解码器的速度。在HPB翻译系统上的实验表明,我们的方法可以使用更快的解码器将规则表的大小大大减少75%,并且产生与最先进的HPB模型相同的翻译质量(使用BLEU测量)。
{"title":"Nesting hierarchical phrase-based model for speech-to-speech translation","authors":"Xiaoyin Fu, Wei Wei, Lichun Fan, Shixiang Lu, Bo Xu","doi":"10.1109/ISCSLP.2012.6423497","DOIUrl":"https://doi.org/10.1109/ISCSLP.2012.6423497","url":null,"abstract":"Hierarchical phrase-based (HPB) translation has been introduced to speech-to-speech (S2S) translation system on mobile terminals, such as smartphones. However, it suffers from the explosive growth in the number of rules along with the increment in decoding time for S2S translation system when the memory and decoding speed is restricted. In this paper, we propose a nesting HPB model to capture the topological structure of hierarchical rules on the source language side, which will not only filter out the redundant rules in HPB model but also speed up the decoder. Experiments on the HPB translation system show that our approach can greatly reduce the rule table size by 75% with a faster decoder, and yield the same translation quality (measured by using BLEU) as the state-of-art HPB model.","PeriodicalId":186099,"journal":{"name":"2012 8th International Symposium on Chinese Spoken Language Processing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129876747","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
2012 8th International Symposium on Chinese Spoken Language Processing
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1