{"title":"Music retrieval by singing and humming using information fusion","authors":"John N. Milner, D. Hsu","doi":"10.1109/ICCI-CC.2013.6622263","DOIUrl":null,"url":null,"abstract":"We present that combinatorial fusion analysis (CFA) can improve results in a music information retrieval (MIR) task, specifically querying a database of recorded music by singing, humming, or whistling. Our experiment considers 10 scoring systems, 55 queries, and a database of 310 original artists' recordings. Through the use of spectral subtraction, we exploit the recording industry's tradition of placing the lead vocal and other prominent melodic features in the center of a stereo mix. We employ the rank/score function previously defined in other studies of CFA to analyze the behavior of scoring systems, and we use the rank/score variation to quantify the diversity of any two scoring systems. We then observe that successful 2-combinations, i.e. cases where the performance of a combination meets or exceeds the performance of its constituent scoring systems, tend to occur when each system performs relatively well and the systems are diverse.","PeriodicalId":130244,"journal":{"name":"2013 IEEE 12th International Conference on Cognitive Informatics and Cognitive Computing","volume":"40 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 IEEE 12th International Conference on Cognitive Informatics and Cognitive Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCI-CC.2013.6622263","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
We present that combinatorial fusion analysis (CFA) can improve results in a music information retrieval (MIR) task, specifically querying a database of recorded music by singing, humming, or whistling. Our experiment considers 10 scoring systems, 55 queries, and a database of 310 original artists' recordings. Through the use of spectral subtraction, we exploit the recording industry's tradition of placing the lead vocal and other prominent melodic features in the center of a stereo mix. We employ the rank/score function previously defined in other studies of CFA to analyze the behavior of scoring systems, and we use the rank/score variation to quantify the diversity of any two scoring systems. We then observe that successful 2-combinations, i.e. cases where the performance of a combination meets or exceeds the performance of its constituent scoring systems, tend to occur when each system performs relatively well and the systems are diverse.