{"title":"基于声学语音分割的口语词检测","authors":"Nadia Benati, Halima Bahi","doi":"10.1109/SETIT.2016.7939878","DOIUrl":null,"url":null,"abstract":"This paper presents a system for spoken term detection in a continuous speech stream. Spoken terms are predefined through a set of acoustic examples provided by the users. Spoken term detection proceeds in two steps: speech segmentation and term verification. We suggest the use of an acoustic-based algorithm for the segmentation which exploits acoustic particularities of the speech stream to detect word frontiers. From the segmentation stage a collection of utterances are delimited, and they are aligned with the spoken term acoustic representation. Tests were conducted on an Arabic corpus using spectral features of the signal. A correct detection rate of about 100% was reached with a false alarm of 20%.","PeriodicalId":426951,"journal":{"name":"2016 7th International Conference on Sciences of Electronics, Technologies of Information and Telecommunications (SETIT)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Spoken term detection based on acoustic speech segmentation\",\"authors\":\"Nadia Benati, Halima Bahi\",\"doi\":\"10.1109/SETIT.2016.7939878\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents a system for spoken term detection in a continuous speech stream. Spoken terms are predefined through a set of acoustic examples provided by the users. Spoken term detection proceeds in two steps: speech segmentation and term verification. We suggest the use of an acoustic-based algorithm for the segmentation which exploits acoustic particularities of the speech stream to detect word frontiers. From the segmentation stage a collection of utterances are delimited, and they are aligned with the spoken term acoustic representation. Tests were conducted on an Arabic corpus using spectral features of the signal. A correct detection rate of about 100% was reached with a false alarm of 20%.\",\"PeriodicalId\":426951,\"journal\":{\"name\":\"2016 7th International Conference on Sciences of Electronics, Technologies of Information and Telecommunications (SETIT)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 7th International Conference on Sciences of Electronics, Technologies of Information and Telecommunications (SETIT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SETIT.2016.7939878\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 7th International Conference on Sciences of Electronics, Technologies of Information and Telecommunications (SETIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SETIT.2016.7939878","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Spoken term detection based on acoustic speech segmentation
This paper presents a system for spoken term detection in a continuous speech stream. Spoken terms are predefined through a set of acoustic examples provided by the users. Spoken term detection proceeds in two steps: speech segmentation and term verification. We suggest the use of an acoustic-based algorithm for the segmentation which exploits acoustic particularities of the speech stream to detect word frontiers. From the segmentation stage a collection of utterances are delimited, and they are aligned with the spoken term acoustic representation. Tests were conducted on an Arabic corpus using spectral features of the signal. A correct detection rate of about 100% was reached with a false alarm of 20%.