婴儿哭声分类的音频特征和DTW算法研究

Xilin Yu, Laishuan Wang, Xian Zhao, Chunmei Lu, X. Long, Wei Chen
{"title":"婴儿哭声分类的音频特征和DTW算法研究","authors":"Xilin Yu, Laishuan Wang, Xian Zhao, Chunmei Lu, X. Long, Wei Chen","doi":"10.1145/3375923.3375929","DOIUrl":null,"url":null,"abstract":"Cry is the most common phenomenon among infants, and it has been reported that babies cry for multiple reasons. Infant cry signals are thought to convey much useful information about the physiological and pathological state of the baby. Hence, in this work we analyzed these audio signals in order to classify different reasons of cries. Cry signals were especially collected for this study including three causes, namely hunger, pain and uncertainty. Modified MFCC features besides basic acoustic features were extracted from each recording. After intergroup variance examination, nine features were selected and subjected to a novel matching process based on Dynamic Time Warping (DTW) for separating infant cries. Experiment results show that nine selected features are effective to recognize cries caused by hunger, pain and other uncertain reasons. The proposed approach for infant cry analysis will provide useful information for designing towards an automatic system for detecting physiological and pathological state of the baby","PeriodicalId":20457,"journal":{"name":"Proceedings of the 2019 6th International Conference on Biomedical and Bioinformatics Engineering","volume":"45 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2019-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"An Investigation into Audio Features and DTW Algorithms for Infant Cry Classification\",\"authors\":\"Xilin Yu, Laishuan Wang, Xian Zhao, Chunmei Lu, X. Long, Wei Chen\",\"doi\":\"10.1145/3375923.3375929\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Cry is the most common phenomenon among infants, and it has been reported that babies cry for multiple reasons. Infant cry signals are thought to convey much useful information about the physiological and pathological state of the baby. Hence, in this work we analyzed these audio signals in order to classify different reasons of cries. Cry signals were especially collected for this study including three causes, namely hunger, pain and uncertainty. Modified MFCC features besides basic acoustic features were extracted from each recording. After intergroup variance examination, nine features were selected and subjected to a novel matching process based on Dynamic Time Warping (DTW) for separating infant cries. Experiment results show that nine selected features are effective to recognize cries caused by hunger, pain and other uncertain reasons. The proposed approach for infant cry analysis will provide useful information for designing towards an automatic system for detecting physiological and pathological state of the baby\",\"PeriodicalId\":20457,\"journal\":{\"name\":\"Proceedings of the 2019 6th International Conference on Biomedical and Bioinformatics Engineering\",\"volume\":\"45 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-11-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2019 6th International Conference on Biomedical and Bioinformatics Engineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3375923.3375929\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2019 6th International Conference on Biomedical and Bioinformatics Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3375923.3375929","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

摘要

哭泣是婴儿中最常见的现象,据报道,婴儿哭泣有多种原因。婴儿的哭声信号被认为传达了关于婴儿生理和病理状态的许多有用信息。因此,在这项工作中,我们分析了这些音频信号,以分类不同的哭泣原因。该研究特别收集了哭泣信号,包括三个原因,即饥饿,疼痛和不确定。从每段录音中提取除基本声学特征外的修正MFCC特征。通过组间方差检验,选取9个特征进行基于动态时间翘曲(DTW)的匹配处理,对婴儿哭声进行分类。实验结果表明,所选择的9个特征可以有效识别由饥饿、疼痛和其他不确定原因引起的哭声。提出的婴儿哭声分析方法将为设计婴儿生理和病理状态自动检测系统提供有用的信息
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
An Investigation into Audio Features and DTW Algorithms for Infant Cry Classification
Cry is the most common phenomenon among infants, and it has been reported that babies cry for multiple reasons. Infant cry signals are thought to convey much useful information about the physiological and pathological state of the baby. Hence, in this work we analyzed these audio signals in order to classify different reasons of cries. Cry signals were especially collected for this study including three causes, namely hunger, pain and uncertainty. Modified MFCC features besides basic acoustic features were extracted from each recording. After intergroup variance examination, nine features were selected and subjected to a novel matching process based on Dynamic Time Warping (DTW) for separating infant cries. Experiment results show that nine selected features are effective to recognize cries caused by hunger, pain and other uncertain reasons. The proposed approach for infant cry analysis will provide useful information for designing towards an automatic system for detecting physiological and pathological state of the baby
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
DMBA Induction Increases H-Ras Gene Expression and Decreases CD8 Count in Sprague Dawley Rats Predicting the Types of Striking and Thrusting Motions by using Deep Learning A World Camera for Recording the Game Tactics in Martial Arts using Bamboo Swords In Vitro Safety Assessment and Permeation Study of Topical Lidocaine Solution for Ocular Administration An Investigation into Audio Features and DTW Algorithms for Infant Cry Classification
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1