Comparison of music genre classification using Nearest Centroid Classifier and k-Nearest Neighbours

Elizabeth Nurmiyati Tamatjita, Aditya W. Mahastama
{"title":"Comparison of music genre classification using Nearest Centroid Classifier and k-Nearest Neighbours","authors":"Elizabeth Nurmiyati Tamatjita, Aditya W. Mahastama","doi":"10.1109/ICIMTECH.2016.7930314","DOIUrl":null,"url":null,"abstract":"Music genre is getting complex from time to time. As the size of digital media grows along with amount of data, manual search of digital audio files according to its genre is considered impractical and inefficient; therefore a classification mechanism is needed to improve searching. Zero Crossing Rate (ZCR), Average Energy (E) and Silent Ratio (SR) are a few of features that can be extracted from digital audio files to classify its genre. This research is conducted to classify music from digital audio (songs) into 12 genres: Ballad, Blues, Classic, Harmony, Hip Hop, Jazz, Keroncong, Latin, Pop, Electronic, Reggae and Rock using above mentioned features, extracted from WAV audio files. Classification is performed several times using selected 3, 6, 9 and 12 genres respectively. The result shows that classification of 3 music genres (Ballad, Blues, Classic) has the highest accuracy (96.67%), followed by 6 genres (Ballad, Blues, Classic, Harmony, Hip Hop, Jazz) with 70%, and 9 genres (Ballad, Blues, Classic, Harmony, Hip Hop, Jazz, Keroncong, Latin, Pop) with 53.33% accuracy. Classification of all 12 music genres yields the lowest accuracy of 33.33%. The test results with the k-Nearest Neighbours algorithm to 120 songs for k = 3 accuracy reaches 22.5%, k = 5 accuracy reaches 22.5%, k = 7 accuracy reaching 26.7% and k = 9 accuracy reaches 26.7 %. Results showed that genre classification by matching the shortest distance through the centre of the class, yields better results than using the k-NN algorithm.","PeriodicalId":327070,"journal":{"name":"2016 International Conference on Information Management and Technology (ICIMTech)","volume":"109 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 International Conference on Information Management and Technology (ICIMTech)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIMTECH.2016.7930314","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 9

Abstract

Music genre is getting complex from time to time. As the size of digital media grows along with amount of data, manual search of digital audio files according to its genre is considered impractical and inefficient; therefore a classification mechanism is needed to improve searching. Zero Crossing Rate (ZCR), Average Energy (E) and Silent Ratio (SR) are a few of features that can be extracted from digital audio files to classify its genre. This research is conducted to classify music from digital audio (songs) into 12 genres: Ballad, Blues, Classic, Harmony, Hip Hop, Jazz, Keroncong, Latin, Pop, Electronic, Reggae and Rock using above mentioned features, extracted from WAV audio files. Classification is performed several times using selected 3, 6, 9 and 12 genres respectively. The result shows that classification of 3 music genres (Ballad, Blues, Classic) has the highest accuracy (96.67%), followed by 6 genres (Ballad, Blues, Classic, Harmony, Hip Hop, Jazz) with 70%, and 9 genres (Ballad, Blues, Classic, Harmony, Hip Hop, Jazz, Keroncong, Latin, Pop) with 53.33% accuracy. Classification of all 12 music genres yields the lowest accuracy of 33.33%. The test results with the k-Nearest Neighbours algorithm to 120 songs for k = 3 accuracy reaches 22.5%, k = 5 accuracy reaches 22.5%, k = 7 accuracy reaching 26.7% and k = 9 accuracy reaches 26.7 %. Results showed that genre classification by matching the shortest distance through the centre of the class, yields better results than using the k-NN algorithm.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
最近质心分类器与k近邻音乐类型分类的比较
音乐类型越来越复杂。随着数字媒体的规模和数据量的增长,按类型手动搜索数字音频文件被认为是不切实际和低效的;因此,需要一种分类机制来提高搜索效率。过零率(Zero Crossing Rate, ZCR)、平均能量(Average Energy, E)和无声比(Silent Ratio, SR)是可以从数字音频文件中提取的几个特征,可以用来对其类型进行分类。本研究利用上述特征,从WAV音频文件中提取音乐,将数字音频(歌曲)中的音乐分为12种类型:民谣、布鲁斯、古典、和声、嘻哈、爵士、Keroncong、拉丁、流行、电子、雷鬼和摇滚。分别使用选定的3、6、9和12个类型进行多次分类。结果表明,对3种音乐类型(Ballad、Blues、Classic)的分类准确率最高(96.67%),其次是6种类型(Ballad、Blues、Classic、Harmony、Hip Hop、Jazz),准确率为70%,9种类型(Ballad、Blues、Classic、Harmony、Hip Hop、Jazz、Keroncong、Latin、Pop),准确率为53.33%。对所有12种音乐类型进行分类的准确率最低,为33.33%。k =最近邻算法对120首歌曲的测试结果,k = 3准确率达到22.5%,k = 5准确率达到22.5%,k = 7准确率达到26.7%,k = 9准确率达到26.7%。结果表明,通过类中心匹配最短距离的类型分类比使用k-NN算法的分类效果更好。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Bingar: A computer-assisted speaking exam using integrated multimedia ChinaLatex: Web-based assessment system for ubiquitous Chinese education Understanding the influence of poor scope management affecting the successful of an IT Project The implementation of ERP systems using ASAP methodology: (Case study: OpenERP 9.0 application on sales & distribution module at PT. XYZ) Information system model of succession planning and career path
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1