{"title":"基于深度学习的两两交叉相似矩阵翻唱歌曲识别","authors":"Manan Mehta, Anmol Sajnani, Radhika Chapaneri","doi":"10.1109/IBSSC47189.2019.8973064","DOIUrl":null,"url":null,"abstract":"A cover song, by definition, is a rendition of a previously released song and mapping these cover songs to their original song is defined as ”Cover Song Identification.” In this paper, we propose multiple cover song identification methods using Convolutional Neural Network (CNN) models as well as transfer learning to extract features which can be trained on statistical models for binary classification. We develop two CNN models that are trained on a cross-similarity matrix which is generated from a pair of songs as input. Firstly we designed a simple CNN architecture that was trained on two labels 1. cover pair relationship; 2. non-cover pair relationship. Our second approach uses a CNN model known as the Inception Model. We train the model by generating cross-similarity matrices for both the labels and then converting them into images. At later stage, we use a ranking method that sorts the probabilities of the cover relation in descending order and the song with the highest probability is chosen as a match. Based on the evaluation, Inception model performs the best, scoring the accuracy of 93.4%.","PeriodicalId":148941,"journal":{"name":"2019 IEEE Bombay Section Signature Conference (IBSSC)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Cover Song Identification with Pairwise Cross-Similarity Matrix using Deep Learning\",\"authors\":\"Manan Mehta, Anmol Sajnani, Radhika Chapaneri\",\"doi\":\"10.1109/IBSSC47189.2019.8973064\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A cover song, by definition, is a rendition of a previously released song and mapping these cover songs to their original song is defined as ”Cover Song Identification.” In this paper, we propose multiple cover song identification methods using Convolutional Neural Network (CNN) models as well as transfer learning to extract features which can be trained on statistical models for binary classification. We develop two CNN models that are trained on a cross-similarity matrix which is generated from a pair of songs as input. Firstly we designed a simple CNN architecture that was trained on two labels 1. cover pair relationship; 2. non-cover pair relationship. Our second approach uses a CNN model known as the Inception Model. We train the model by generating cross-similarity matrices for both the labels and then converting them into images. At later stage, we use a ranking method that sorts the probabilities of the cover relation in descending order and the song with the highest probability is chosen as a match. Based on the evaluation, Inception model performs the best, scoring the accuracy of 93.4%.\",\"PeriodicalId\":148941,\"journal\":{\"name\":\"2019 IEEE Bombay Section Signature Conference (IBSSC)\",\"volume\":\"2 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 IEEE Bombay Section Signature Conference (IBSSC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IBSSC47189.2019.8973064\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE Bombay Section Signature Conference (IBSSC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IBSSC47189.2019.8973064","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Cover Song Identification with Pairwise Cross-Similarity Matrix using Deep Learning
A cover song, by definition, is a rendition of a previously released song and mapping these cover songs to their original song is defined as ”Cover Song Identification.” In this paper, we propose multiple cover song identification methods using Convolutional Neural Network (CNN) models as well as transfer learning to extract features which can be trained on statistical models for binary classification. We develop two CNN models that are trained on a cross-similarity matrix which is generated from a pair of songs as input. Firstly we designed a simple CNN architecture that was trained on two labels 1. cover pair relationship; 2. non-cover pair relationship. Our second approach uses a CNN model known as the Inception Model. We train the model by generating cross-similarity matrices for both the labels and then converting them into images. At later stage, we use a ranking method that sorts the probabilities of the cover relation in descending order and the song with the highest probability is chosen as a match. Based on the evaluation, Inception model performs the best, scoring the accuracy of 93.4%.