{"title":"Spoken term detection based on acoustic speech segmentation","authors":"Nadia Benati, Halima Bahi","doi":"10.1109/SETIT.2016.7939878","DOIUrl":null,"url":null,"abstract":"This paper presents a system for spoken term detection in a continuous speech stream. Spoken terms are predefined through a set of acoustic examples provided by the users. Spoken term detection proceeds in two steps: speech segmentation and term verification. We suggest the use of an acoustic-based algorithm for the segmentation which exploits acoustic particularities of the speech stream to detect word frontiers. From the segmentation stage a collection of utterances are delimited, and they are aligned with the spoken term acoustic representation. Tests were conducted on an Arabic corpus using spectral features of the signal. A correct detection rate of about 100% was reached with a false alarm of 20%.","PeriodicalId":426951,"journal":{"name":"2016 7th International Conference on Sciences of Electronics, Technologies of Information and Telecommunications (SETIT)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 7th International Conference on Sciences of Electronics, Technologies of Information and Telecommunications (SETIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SETIT.2016.7939878","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
This paper presents a system for spoken term detection in a continuous speech stream. Spoken terms are predefined through a set of acoustic examples provided by the users. Spoken term detection proceeds in two steps: speech segmentation and term verification. We suggest the use of an acoustic-based algorithm for the segmentation which exploits acoustic particularities of the speech stream to detect word frontiers. From the segmentation stage a collection of utterances are delimited, and they are aligned with the spoken term acoustic representation. Tests were conducted on an Arabic corpus using spectral features of the signal. A correct detection rate of about 100% was reached with a false alarm of 20%.