{"title":"MulKINet:用于准确和快速识别翻唱歌曲的多阶段键不变卷积神经网络","authors":"Chengdi Cao, Weiqiang Zhang","doi":"10.1109/ISSPIT51521.2020.9408993","DOIUrl":null,"url":null,"abstract":"Cover song identification (CSI) is a challenging task in the music information retrieval (MIR) community. The employment of convolutional neural networks (CNN) have significantly improved the performance of CSI systems, especially CNN designed to be invariant against key transpositions. In this paper, we propose MulKINet, a multi-stage CNN architecture that preserve the property of key invariance while its representational ability is substantially enhanced. Combined with three options for building blocks, channel and temporal attention mechanism, we present an accurate and fast CSI system.","PeriodicalId":111385,"journal":{"name":"2020 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT)","volume":"8 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"MulKINet: Multi-Stage Key-Invariant Convolutional Neural Networks for Accurate and Fast Cover Song Identification\",\"authors\":\"Chengdi Cao, Weiqiang Zhang\",\"doi\":\"10.1109/ISSPIT51521.2020.9408993\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Cover song identification (CSI) is a challenging task in the music information retrieval (MIR) community. The employment of convolutional neural networks (CNN) have significantly improved the performance of CSI systems, especially CNN designed to be invariant against key transpositions. In this paper, we propose MulKINet, a multi-stage CNN architecture that preserve the property of key invariance while its representational ability is substantially enhanced. Combined with three options for building blocks, channel and temporal attention mechanism, we present an accurate and fast CSI system.\",\"PeriodicalId\":111385,\"journal\":{\"name\":\"2020 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT)\",\"volume\":\"8 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-12-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISSPIT51521.2020.9408993\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISSPIT51521.2020.9408993","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
MulKINet: Multi-Stage Key-Invariant Convolutional Neural Networks for Accurate and Fast Cover Song Identification
Cover song identification (CSI) is a challenging task in the music information retrieval (MIR) community. The employment of convolutional neural networks (CNN) have significantly improved the performance of CSI systems, especially CNN designed to be invariant against key transpositions. In this paper, we propose MulKINet, a multi-stage CNN architecture that preserve the property of key invariance while its representational ability is substantially enhanced. Combined with three options for building blocks, channel and temporal attention mechanism, we present an accurate and fast CSI system.