M. D. Santo, G. Percannella, Carlo Sansone, M. Vento
{"title":"基于多专家系统的电影音频分类","authors":"M. D. Santo, G. Percannella, Carlo Sansone, M. Vento","doi":"10.1109/ICIAP.2001.957040","DOIUrl":null,"url":null,"abstract":"The paper presents a system for the automatic MPEG format. In contrast to the approaches proposed up to now, it employs a multi-expert classification system arranged according to a multi-stage architecture. The system is able to recognize not only four pure classes (music, speech, silence and noise) but also confused audio signals, such as the ones resulting from the overlap of pure audio components (for example, speech overlapped with music or noise, etc.). An extensive experimental analysis has been carried on a large audio database extracted from about 30 moving pictures recorded on low-quality magnetic media. Results confirm the effectiveness of the approach, with an average improvement of about 45% with respect to single classifier solutions.","PeriodicalId":365627,"journal":{"name":"Proceedings 11th International Conference on Image Analysis and Processing","volume":"37 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2001-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":"{\"title\":\"Classifying audio of movies by a multi-expert system\",\"authors\":\"M. D. Santo, G. Percannella, Carlo Sansone, M. Vento\",\"doi\":\"10.1109/ICIAP.2001.957040\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The paper presents a system for the automatic MPEG format. In contrast to the approaches proposed up to now, it employs a multi-expert classification system arranged according to a multi-stage architecture. The system is able to recognize not only four pure classes (music, speech, silence and noise) but also confused audio signals, such as the ones resulting from the overlap of pure audio components (for example, speech overlapped with music or noise, etc.). An extensive experimental analysis has been carried on a large audio database extracted from about 30 moving pictures recorded on low-quality magnetic media. Results confirm the effectiveness of the approach, with an average improvement of about 45% with respect to single classifier solutions.\",\"PeriodicalId\":365627,\"journal\":{\"name\":\"Proceedings 11th International Conference on Image Analysis and Processing\",\"volume\":\"37 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2001-09-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"10\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings 11th International Conference on Image Analysis and Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICIAP.2001.957040\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings 11th International Conference on Image Analysis and Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIAP.2001.957040","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Classifying audio of movies by a multi-expert system
The paper presents a system for the automatic MPEG format. In contrast to the approaches proposed up to now, it employs a multi-expert classification system arranged according to a multi-stage architecture. The system is able to recognize not only four pure classes (music, speech, silence and noise) but also confused audio signals, such as the ones resulting from the overlap of pure audio components (for example, speech overlapped with music or noise, etc.). An extensive experimental analysis has been carried on a large audio database extracted from about 30 moving pictures recorded on low-quality magnetic media. Results confirm the effectiveness of the approach, with an average improvement of about 45% with respect to single classifier solutions.