西方记谱法与卡纳蒂克记谱法图像分类模型的比较研究——从西方音乐记谱法到卡纳蒂克音乐记谱法的转换

V. K. Prathyushaa, P. Chandrasekar, R. Anuradha
{"title":"西方记谱法与卡纳蒂克记谱法图像分类模型的比较研究——从西方音乐记谱法到卡纳蒂克音乐记谱法的转换","authors":"V. K. Prathyushaa, P. Chandrasekar, R. Anuradha","doi":"10.1109/I-SMAC52330.2021.9641052","DOIUrl":null,"url":null,"abstract":"Western music notation converter is essential in the field of music for the conversion of music notes which are mostly in Western staff notation to its Carnatic music note equivalent. It is difficult for the Carnatic musicians and singers to have the music notes in the form of the corresponding Swaras in Carnatic music to comprehend them. Over the past few decades, researchers have built models that recognise handwritten musical notations called Optical Music Recognition (OMR) [9]. But, when researchers who are from a non-musical background work with digital representations, the task becomes tedious and a need for processing the images arises. Therefore, instead of relying on humans for conversion of notations, image processing models are used with the help of transfer learning and classification is done using 4 models, of which 3 are pre-trained, i.e., ResNet50, VGG19, InceptionV3 and one is a simple CNN model. The models provide competitive results when compared to human experts labelling of datasets.","PeriodicalId":178783,"journal":{"name":"2021 Fifth International Conference on I-SMAC (IoT in Social, Mobile, Analytics and Cloud) (I-SMAC)","volume":"187 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"A Comparative Study of Image Classification Models for Western Notation to Carnatic Notation : Conversion of Western Music Notation to Carnatic Music Notation\",\"authors\":\"V. K. Prathyushaa, P. Chandrasekar, R. Anuradha\",\"doi\":\"10.1109/I-SMAC52330.2021.9641052\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Western music notation converter is essential in the field of music for the conversion of music notes which are mostly in Western staff notation to its Carnatic music note equivalent. It is difficult for the Carnatic musicians and singers to have the music notes in the form of the corresponding Swaras in Carnatic music to comprehend them. Over the past few decades, researchers have built models that recognise handwritten musical notations called Optical Music Recognition (OMR) [9]. But, when researchers who are from a non-musical background work with digital representations, the task becomes tedious and a need for processing the images arises. Therefore, instead of relying on humans for conversion of notations, image processing models are used with the help of transfer learning and classification is done using 4 models, of which 3 are pre-trained, i.e., ResNet50, VGG19, InceptionV3 and one is a simple CNN model. The models provide competitive results when compared to human experts labelling of datasets.\",\"PeriodicalId\":178783,\"journal\":{\"name\":\"2021 Fifth International Conference on I-SMAC (IoT in Social, Mobile, Analytics and Cloud) (I-SMAC)\",\"volume\":\"187 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-11-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 Fifth International Conference on I-SMAC (IoT in Social, Mobile, Analytics and Cloud) (I-SMAC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/I-SMAC52330.2021.9641052\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 Fifth International Conference on I-SMAC (IoT in Social, Mobile, Analytics and Cloud) (I-SMAC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/I-SMAC52330.2021.9641052","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

摘要

西方音乐符号转换器在音乐领域中是必不可少的,主要用于将西方五线谱中的音符转换为卡纳蒂克音符。卡纳蒂克的音乐家和歌手很难理解卡纳蒂克音乐中相应的斯瓦拉形式的音符。在过去的几十年里,研究人员已经建立了识别手写乐谱的模型,称为光学音乐识别(OMR)[9]。但是,当来自非音乐背景的研究人员处理数字表示时,任务变得乏味,并且需要处理图像。因此,我们不再依赖人类进行符号转换,而是借助迁移学习的图像处理模型,使用4个模型进行分类,其中3个是预训练的,分别是ResNet50、VGG19、InceptionV3,还有一个是简单的CNN模型。与人类专家标记数据集相比,这些模型提供了有竞争力的结果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
A Comparative Study of Image Classification Models for Western Notation to Carnatic Notation : Conversion of Western Music Notation to Carnatic Music Notation
Western music notation converter is essential in the field of music for the conversion of music notes which are mostly in Western staff notation to its Carnatic music note equivalent. It is difficult for the Carnatic musicians and singers to have the music notes in the form of the corresponding Swaras in Carnatic music to comprehend them. Over the past few decades, researchers have built models that recognise handwritten musical notations called Optical Music Recognition (OMR) [9]. But, when researchers who are from a non-musical background work with digital representations, the task becomes tedious and a need for processing the images arises. Therefore, instead of relying on humans for conversion of notations, image processing models are used with the help of transfer learning and classification is done using 4 models, of which 3 are pre-trained, i.e., ResNet50, VGG19, InceptionV3 and one is a simple CNN model. The models provide competitive results when compared to human experts labelling of datasets.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Research on the Modeling of Fast Face Recognition Against Age Disturbance under Deep Learning Design of IoT Network using Deep Learning-based Model for Anomaly Detection Analysis of the Impact of Blockchain and Net Technology on the Financial Governance of Internet Enterprises Affective Music Player for Multiple Emotion Recognition Using Facial Expressions with SVM A Deep Learning technology based covid-19 prediction
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1