M. D. Santo, G. Percannella, Carlo Sansone, M. Vento
{"title":"Classifying audio of movies by a multi-expert system","authors":"M. D. Santo, G. Percannella, Carlo Sansone, M. Vento","doi":"10.1109/ICIAP.2001.957040","DOIUrl":null,"url":null,"abstract":"The paper presents a system for the automatic MPEG format. In contrast to the approaches proposed up to now, it employs a multi-expert classification system arranged according to a multi-stage architecture. The system is able to recognize not only four pure classes (music, speech, silence and noise) but also confused audio signals, such as the ones resulting from the overlap of pure audio components (for example, speech overlapped with music or noise, etc.). An extensive experimental analysis has been carried on a large audio database extracted from about 30 moving pictures recorded on low-quality magnetic media. Results confirm the effectiveness of the approach, with an average improvement of about 45% with respect to single classifier solutions.","PeriodicalId":365627,"journal":{"name":"Proceedings 11th International Conference on Image Analysis and Processing","volume":"37 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2001-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings 11th International Conference on Image Analysis and Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIAP.2001.957040","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 10
Abstract
The paper presents a system for the automatic MPEG format. In contrast to the approaches proposed up to now, it employs a multi-expert classification system arranged according to a multi-stage architecture. The system is able to recognize not only four pure classes (music, speech, silence and noise) but also confused audio signals, such as the ones resulting from the overlap of pure audio components (for example, speech overlapped with music or noise, etc.). An extensive experimental analysis has been carried on a large audio database extracted from about 30 moving pictures recorded on low-quality magnetic media. Results confirm the effectiveness of the approach, with an average improvement of about 45% with respect to single classifier solutions.