Application development for recognizing type of infant's cry sound

Welly Setiawan Limantoro, C. Fatichah, Umi Laili Yuhana
{"title":"Application development for recognizing type of infant's cry sound","authors":"Welly Setiawan Limantoro, C. Fatichah, Umi Laili Yuhana","doi":"10.1109/ICTS.2016.7910291","DOIUrl":null,"url":null,"abstract":"Crying infant is a sign of baby who has a problem. But, some people are not able to recognize the meaning of infant's cry. Several researches to recognize infant's cry sound had been done by some researchers, but there is still no research that develop an application which able to recognize type of infant's cry sound based on web. In this research, an application is developed to help users identify the sound of crying infant based on Dunstan Baby Language. The method applied in this application are Mel-Frequency Cepstral Coefficient (MFCC) feature extraction for infant's cry sound, normalization of feature extraction result, and K-nearest neighbor classification. From the various tests performed, it can be concluded that highest average accuracy of 75.95 percent can be obtained by using parameters consist of 0.08 seconds wintime in MFCC feature extraction, 85 percent of training data and 15 percent of test data from any type of infant's cry sound, feature extraction normalization by Standard Deviation Normalization, and K-nearest Neighbor with k equal to 1 classification. While testing application by using all data, average accuracy of 96.57 percent can be obtained by using parameters consist of 0.08 seconds wintime in MFCC feature extraction, 85 percent of training data from any type of infant's cry sound, feature extraction normalization by Standard Deviation Normalization, and K-nearest Neighbor k equal to 1 classification. From that test, it can be concluded that the application has been running well when classifying all types of infant's cry sound data.","PeriodicalId":177275,"journal":{"name":"2016 International Conference on Information & Communication Technology and Systems (ICTS)","volume":"52 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 International Conference on Information & Communication Technology and Systems (ICTS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICTS.2016.7910291","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5

Abstract

Crying infant is a sign of baby who has a problem. But, some people are not able to recognize the meaning of infant's cry. Several researches to recognize infant's cry sound had been done by some researchers, but there is still no research that develop an application which able to recognize type of infant's cry sound based on web. In this research, an application is developed to help users identify the sound of crying infant based on Dunstan Baby Language. The method applied in this application are Mel-Frequency Cepstral Coefficient (MFCC) feature extraction for infant's cry sound, normalization of feature extraction result, and K-nearest neighbor classification. From the various tests performed, it can be concluded that highest average accuracy of 75.95 percent can be obtained by using parameters consist of 0.08 seconds wintime in MFCC feature extraction, 85 percent of training data and 15 percent of test data from any type of infant's cry sound, feature extraction normalization by Standard Deviation Normalization, and K-nearest Neighbor with k equal to 1 classification. While testing application by using all data, average accuracy of 96.57 percent can be obtained by using parameters consist of 0.08 seconds wintime in MFCC feature extraction, 85 percent of training data from any type of infant's cry sound, feature extraction normalization by Standard Deviation Normalization, and K-nearest Neighbor k equal to 1 classification. From that test, it can be concluded that the application has been running well when classifying all types of infant's cry sound data.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
婴儿哭声类型识别的应用开发
婴儿哭是婴儿有问题的迹象。但是,有些人不能认识到婴儿哭泣的意义。一些研究者已经对婴儿哭声类型的识别进行了一些研究,但目前还没有研究开发出一种基于web的婴儿哭声类型识别应用程序。在本研究中,开发了一个基于邓斯坦婴儿语言的应用程序来帮助用户识别婴儿哭泣的声音。本应用中采用的方法是婴儿哭声的Mel-Frequency Cepstral Coefficient (MFCC)特征提取、特征提取结果的归一化、k近邻分类。从所进行的各种测试中可以得出结论,使用MFCC特征提取的0.08秒wintime、85%的训练数据和15%的任何类型婴儿哭声声音的测试数据、通过标准偏差归一化对特征提取进行归一化以及k = 1分类的k近邻,可以获得最高的平均准确率75.95%。在使用所有数据对应用程序进行测试时,使用MFCC特征提取的wintime为0.08秒,85%的训练数据来自任何类型的婴儿哭声,特征提取归一化采用标准差归一化,k近邻k等于1分类的参数,平均准确率为96.57%。从测试中可以得出结论,该应用程序在对所有类型的婴儿哭声数据进行分类时运行良好。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Detecting source code plagiarism on introductory programming course assignments using a bytecode approach Indonesia scholarship selection framework using fuzzy inferences system approach: Case study: “Bidik Misi” scholarship selection Face recognition based on Extended Symmetric Local Graph Structure Early detection study of Kidney Organ Complication caused by Diabetes Mellitus using iris image color constancy Heart murmurs extraction using the complete Ensemble Empirical Mode Decomposition and the Pearson distance metric
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1