Deteksi Onset Gamelan Bebasis DWPT dan BLSTM

Hisyam Mustofa, A. E. Putra
{"title":"Deteksi Onset Gamelan Bebasis DWPT dan BLSTM","authors":"Hisyam Mustofa, A. E. Putra","doi":"10.22146/ijeis.79534","DOIUrl":null,"url":null,"abstract":"Gamelan consists of various kinds of instruments that have different characteristics. Each has characteristics in terms of the basic frequency, amplitude, signal envelope, and different ways of playing it, resulting in differences in the sustain power of the signal. These characteristics cause the problem of vanishing gradient in the Elman Network model which was used in previous studies in studying the onset detection in the Saron instrument signal which has an average interval of more than 0.6 seconds. This study uses BLSTM (Bidirectional Long Short Term Memory) as a model for training and Wavelet Packet Transformation to design a psychoacoustic critical bandwidth as a model for feature extraction. For the peak picking method, this study uses a fixed threshold method with a value of 0.25. The use of the BLSTM model supported by the Wavelet Packet Transform is expected to overcome the vanishing gradient that exists in a simple RNN architecture. The model was tested based on 3 evaluation parameters, namely precision, recall and F-Measure. Based on the test scenario carried out, the model can overcome the vanishing gradient problem on the Saron instrument which has an average interval between onset of 600 ms. Out of a total of 428 onsets on the Saron instrument, the model successfully detected 426 correctly, with 4 incorrectly detected onsets and 2 undetected onsets. A thorough evaluation for each of the precision, recall, and F1-Measure algorithms obtained 0.975, 0.945 and 0.960.","PeriodicalId":31590,"journal":{"name":"IJEIS Indonesian Journal of Electronics and Instrumentation Systems","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-04-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IJEIS Indonesian Journal of Electronics and Instrumentation Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.22146/ijeis.79534","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Gamelan consists of various kinds of instruments that have different characteristics. Each has characteristics in terms of the basic frequency, amplitude, signal envelope, and different ways of playing it, resulting in differences in the sustain power of the signal. These characteristics cause the problem of vanishing gradient in the Elman Network model which was used in previous studies in studying the onset detection in the Saron instrument signal which has an average interval of more than 0.6 seconds. This study uses BLSTM (Bidirectional Long Short Term Memory) as a model for training and Wavelet Packet Transformation to design a psychoacoustic critical bandwidth as a model for feature extraction. For the peak picking method, this study uses a fixed threshold method with a value of 0.25. The use of the BLSTM model supported by the Wavelet Packet Transform is expected to overcome the vanishing gradient that exists in a simple RNN architecture. The model was tested based on 3 evaluation parameters, namely precision, recall and F-Measure. Based on the test scenario carried out, the model can overcome the vanishing gradient problem on the Saron instrument which has an average interval between onset of 600 ms. Out of a total of 428 onsets on the Saron instrument, the model successfully detected 426 correctly, with 4 incorrectly detected onsets and 2 undetected onsets. A thorough evaluation for each of the precision, recall, and F1-Measure algorithms obtained 0.975, 0.945 and 0.960.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
检测DWPT和BLSTM的Onset自由游戏系统
Gamelan由具有不同特征的各种乐器组成。每种信号在基频、振幅、信号包络和不同的播放方式方面都有特点,导致信号的维持功率不同。这些特征导致Elman网络模型中的消失梯度问题,该模型在先前的研究中用于研究平均间隔超过0.6秒的Saron仪器信号中的发作检测。本研究使用BLSTM(双向长短期记忆)作为训练模型,并使用小波包变换设计心理声学临界带宽作为特征提取模型。对于峰值拾取方法,本研究使用值为0.25的固定阈值方法。使用小波包变换支持的BLSTM模型有望克服简单RNN架构中存在的消失梯度。该模型基于精度、召回率和F-Measure三个评价参数进行了测试。基于所执行的测试场景,该模型可以克服Saron仪器上的消失梯度问题,该问题的平均发作间隔为600ms。在Saron仪器总共428次发作中,该模型成功检测到426次正确发作,其中4次检测不正确,2次未检测到。对精度、召回率和F1 Measure算法的每种算法进行彻底评估,分别获得0.975、0.945和0.960。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
审稿时长
12 weeks
期刊最新文献
Utilization of Sensor technology as a Sport Technology Innovation in Athlete Performance Measurement: Research Trends Pemilah Jenis Daun Mangga Melalui Deteksi RGB Menggunakan Sistem Pengolahan Citra Sistem Pengering Daun Kelor Berbasis Internet of Things dan Artificial Intteligence Penghematan Daya Pada Sensor Node Sistem Monitoring Kualitas Udara Design and Build Hydroponic Installations and Applications Using IoT-Based Multisensors with Solar Panel Electrical Energy
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1