{"title":"3 classes segmentation for analysis of football audio sequences","authors":"S. Lefèvre, Benjamin Maillard, N. Vincent","doi":"10.1109/ICDSP.2002.1028253","DOIUrl":null,"url":null,"abstract":"We are dealing with segmentation of audio data in order to analyse football audio/video sequences. Audio data is divided into short sequences (typically with duration of one or half a second) which is classified into several classes (speaker, crowd and referee whistle). Every sequence can then be further analysed depending on the class it belongs to. In order to segment audio data, several methods are presented. First simple techniques are reviewed for segmentation in two classes. From the limitations of these approaches, a method based on cepstral analysis is detailed. Next we present two more complex methods dealing with 3 classes segmentation. The first one is based on hidden Markov models whereas the second one is a combination of a C-mean classifier and multidimensional hidden Markov models.","PeriodicalId":351073,"journal":{"name":"2002 14th International Conference on Digital Signal Processing Proceedings. DSP 2002 (Cat. No.02TH8628)","volume":"39 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2002-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"16","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2002 14th International Conference on Digital Signal Processing Proceedings. DSP 2002 (Cat. No.02TH8628)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDSP.2002.1028253","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 16
Abstract
We are dealing with segmentation of audio data in order to analyse football audio/video sequences. Audio data is divided into short sequences (typically with duration of one or half a second) which is classified into several classes (speaker, crowd and referee whistle). Every sequence can then be further analysed depending on the class it belongs to. In order to segment audio data, several methods are presented. First simple techniques are reviewed for segmentation in two classes. From the limitations of these approaches, a method based on cepstral analysis is detailed. Next we present two more complex methods dealing with 3 classes segmentation. The first one is based on hidden Markov models whereas the second one is a combination of a C-mean classifier and multidimensional hidden Markov models.