基于音频信号的节奏分类场景深度学习

Muljono Muljono, Pulung Nurtantio Andono, Sari Ayu Wulandari, Harun Al Azies, Muhammad Naufal
{"title":"基于音频信号的节奏分类场景深度学习","authors":"Muljono Muljono, Pulung Nurtantio Andono, Sari Ayu Wulandari, Harun Al Azies, Muhammad Naufal","doi":"10.11591/ijai.v13.i2.pp1687-1701","DOIUrl":null,"url":null,"abstract":"This article explains how to determine the tempo of the kendhang, an Indonesian traditional melodic instrument. This research presents novelty as technological research related to gamelan instruments, which has rarely been achieved thus far, through the introduction of kendhang tempo types through the sounds produced, with the hope of creating an automatic system that can recognize the kendhang tempo during a gamelan performance. The testing in this work will categorize the tempo of kendhang into three categories: slow, medium, and fast, utilizing one of the two scenario models proposed, mel frequency cepstral coefficients (MFCC) and convolutional neural network (CNN) in the first scenario, and mel spectrogram and CNN in the second. Kendhang's original audio data, which was captured in real time and later enhanced, makes up the data set. The model 1 scenario, which entails feature extraction using MFCC and classification using the CNN classification approach, is the best scenario in this research, based on the experimental results. When compared to the other suggested modeling scenarios, model 1 has a level of 97%, an average accuracy, and a gain value of 96.67%, making it a solid assistant in terms of kendhang's good tempo recognition accuracy.","PeriodicalId":507934,"journal":{"name":"IAES International Journal of Artificial Intelligence (IJ-AI)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Deep learning for audio signal-based tempo classification scenarios\",\"authors\":\"Muljono Muljono, Pulung Nurtantio Andono, Sari Ayu Wulandari, Harun Al Azies, Muhammad Naufal\",\"doi\":\"10.11591/ijai.v13.i2.pp1687-1701\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This article explains how to determine the tempo of the kendhang, an Indonesian traditional melodic instrument. This research presents novelty as technological research related to gamelan instruments, which has rarely been achieved thus far, through the introduction of kendhang tempo types through the sounds produced, with the hope of creating an automatic system that can recognize the kendhang tempo during a gamelan performance. The testing in this work will categorize the tempo of kendhang into three categories: slow, medium, and fast, utilizing one of the two scenario models proposed, mel frequency cepstral coefficients (MFCC) and convolutional neural network (CNN) in the first scenario, and mel spectrogram and CNN in the second. Kendhang's original audio data, which was captured in real time and later enhanced, makes up the data set. The model 1 scenario, which entails feature extraction using MFCC and classification using the CNN classification approach, is the best scenario in this research, based on the experimental results. When compared to the other suggested modeling scenarios, model 1 has a level of 97%, an average accuracy, and a gain value of 96.67%, making it a solid assistant in terms of kendhang's good tempo recognition accuracy.\",\"PeriodicalId\":507934,\"journal\":{\"name\":\"IAES International Journal of Artificial Intelligence (IJ-AI)\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IAES International Journal of Artificial Intelligence (IJ-AI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.11591/ijai.v13.i2.pp1687-1701\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IAES International Journal of Artificial Intelligence (IJ-AI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.11591/ijai.v13.i2.pp1687-1701","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

本文阐述了如何确定印尼传统旋律乐器肯德汉琴的节奏。这项研究通过声音来介绍肯德汉琴的节奏类型,希望创建一个能在加麦兰演奏中识别肯德汉琴节奏的自动系统,从而展示了与加麦兰乐器相关的技术研究的新颖性,迄今为止还很少有人能做到这一点。本作品中的测试将把肯德杭的节奏分为慢、中、快三类,并利用所提出的两种情景模式之一:第一种情景模式是梅尔频率倒频谱系数(MFCC)和卷积神经网络(CNN),第二种情景模式是梅尔频谱图和 CNN。数据集由 Kendhang 的原始音频数据组成,这些数据是实时采集的,随后进行了增强。根据实验结果,模型 1(使用 MFCC 提取特征并使用 CNN 分类方法进行分类)是本研究的最佳方案。与其他建议的建模方案相比,模型 1 的水平为 97%,平均准确率为 96.67%,增益值为 96.67%,是 kendhang 良好节奏识别准确率的可靠助手。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Deep learning for audio signal-based tempo classification scenarios
This article explains how to determine the tempo of the kendhang, an Indonesian traditional melodic instrument. This research presents novelty as technological research related to gamelan instruments, which has rarely been achieved thus far, through the introduction of kendhang tempo types through the sounds produced, with the hope of creating an automatic system that can recognize the kendhang tempo during a gamelan performance. The testing in this work will categorize the tempo of kendhang into three categories: slow, medium, and fast, utilizing one of the two scenario models proposed, mel frequency cepstral coefficients (MFCC) and convolutional neural network (CNN) in the first scenario, and mel spectrogram and CNN in the second. Kendhang's original audio data, which was captured in real time and later enhanced, makes up the data set. The model 1 scenario, which entails feature extraction using MFCC and classification using the CNN classification approach, is the best scenario in this research, based on the experimental results. When compared to the other suggested modeling scenarios, model 1 has a level of 97%, an average accuracy, and a gain value of 96.67%, making it a solid assistant in terms of kendhang's good tempo recognition accuracy.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
FinTech forecasting using an evolving connectionist system for lenders and borrowers: ecosystem behavior Dealing imbalance dataset problem in sentiment analysis of recession in Indonesia A survey on planet leaf disease identification and classification by various machine-learning technique Effect of dataset distribution on automatic road extraction in very high-resolution orthophoto using DeepLab V3+ Feature selection techniques for microarray dataset: a review
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1