{"title":"Game Theory-based Ensemble of Deep Neural Networks for Large Scale Audio Tagging","authors":"H. Ykhlef, F. Ykhlef, Bouchra Amirouche","doi":"10.1109/ICRAMI52622.2021.9585943","DOIUrl":null,"url":null,"abstract":"Audio tagging is concerned with the development of systems that are able to recognize sound events. With the growing interest geared towards audio tagging for various applications, it has become of paramount importance to design systems that distinguish among events of different natures. To mend with this, ensembling many tagging system has become a successful strategy that lives-up to these emerging challenges. In this paper, we introduce a tagging system composed of an ensemble of deep learners. We propose to formulate the fusion strategy as a coalitional game. Our approach weighs these individual learners, while considering two crucial notions that affect the performance of an ensemble: accuracy and diversity. To demonstrate the efficiency of our approach, we have carried out experimental comparisons on a huge dataset made of sound recordings with annotations of varying reliability. The experimental results indicate that the proposed system provides a reliable ranking and outperforms some major state-of-the art ensemble learning approaches.","PeriodicalId":440750,"journal":{"name":"2021 International Conference on Recent Advances in Mathematics and Informatics (ICRAMI)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2021-09-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Recent Advances in Mathematics and Informatics (ICRAMI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICRAMI52622.2021.9585943","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Audio tagging is concerned with the development of systems that are able to recognize sound events. With the growing interest geared towards audio tagging for various applications, it has become of paramount importance to design systems that distinguish among events of different natures. To mend with this, ensembling many tagging system has become a successful strategy that lives-up to these emerging challenges. In this paper, we introduce a tagging system composed of an ensemble of deep learners. We propose to formulate the fusion strategy as a coalitional game. Our approach weighs these individual learners, while considering two crucial notions that affect the performance of an ensemble: accuracy and diversity. To demonstrate the efficiency of our approach, we have carried out experimental comparisons on a huge dataset made of sound recordings with annotations of varying reliability. The experimental results indicate that the proposed system provides a reliable ranking and outperforms some major state-of-the art ensemble learning approaches.