{"title":"使用支持向量机和随机森林分类器分析婴儿哭声的实用方法","authors":"Jagadeesh Basavaiah, Audre Arlene Anthony","doi":"10.1007/s11277-024-11491-8","DOIUrl":null,"url":null,"abstract":"<p>A baby’s first spoken communication comes through crying. Before learning to convey their psychological/physiological needs or feelings using language, babies typically express how they feel by crying. Crying is a reaction to an inducement like pain, discomfort or hunger. Nevertheless, it is difficult sometimes to understand why a baby is crying. This will be annoying for a parents/guardian/caretaker, and therefore in this work, we are proposing an infant cry analysis and classification system to classify the kinds of crying of babies to assist parents/guardian/caretaker and attend to the needs of the babies. Presently, five distinct kinds of infant cries are identified: hunger (Neh), belly pain (Eairh), tiredness (Owh), discomfort (Heh) and burping (Eh). The database of this study consists of 456 audio recordings of 7 s each of 0–22-week-old babies. Feature extraction from each crying frame is carried out using Mel-frequency cepstral coefficients and the sequential forward floating selection is later used to choose highly discriminative features. Support Vector Machine and Random Forest classifiers are used for classification of infant crying. The results of the experiments has shown the performance of the proposed system with a accuracy of classification of 78% and 90.8% for Support Vector Machine and Random forest classifiers respectively.</p>","PeriodicalId":23827,"journal":{"name":"Wireless Personal Communications","volume":"76 1","pages":""},"PeriodicalIF":1.9000,"publicationDate":"2024-07-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Pragmatic Approach for Infant Cry Analysis Using Support Vector Machine and Random Forest Classifiers\",\"authors\":\"Jagadeesh Basavaiah, Audre Arlene Anthony\",\"doi\":\"10.1007/s11277-024-11491-8\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>A baby’s first spoken communication comes through crying. Before learning to convey their psychological/physiological needs or feelings using language, babies typically express how they feel by crying. Crying is a reaction to an inducement like pain, discomfort or hunger. Nevertheless, it is difficult sometimes to understand why a baby is crying. This will be annoying for a parents/guardian/caretaker, and therefore in this work, we are proposing an infant cry analysis and classification system to classify the kinds of crying of babies to assist parents/guardian/caretaker and attend to the needs of the babies. Presently, five distinct kinds of infant cries are identified: hunger (Neh), belly pain (Eairh), tiredness (Owh), discomfort (Heh) and burping (Eh). The database of this study consists of 456 audio recordings of 7 s each of 0–22-week-old babies. Feature extraction from each crying frame is carried out using Mel-frequency cepstral coefficients and the sequential forward floating selection is later used to choose highly discriminative features. Support Vector Machine and Random Forest classifiers are used for classification of infant crying. The results of the experiments has shown the performance of the proposed system with a accuracy of classification of 78% and 90.8% for Support Vector Machine and Random forest classifiers respectively.</p>\",\"PeriodicalId\":23827,\"journal\":{\"name\":\"Wireless Personal Communications\",\"volume\":\"76 1\",\"pages\":\"\"},\"PeriodicalIF\":1.9000,\"publicationDate\":\"2024-07-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Wireless Personal Communications\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://doi.org/10.1007/s11277-024-11491-8\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"TELECOMMUNICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Wireless Personal Communications","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s11277-024-11491-8","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"TELECOMMUNICATIONS","Score":null,"Total":0}
A Pragmatic Approach for Infant Cry Analysis Using Support Vector Machine and Random Forest Classifiers
A baby’s first spoken communication comes through crying. Before learning to convey their psychological/physiological needs or feelings using language, babies typically express how they feel by crying. Crying is a reaction to an inducement like pain, discomfort or hunger. Nevertheless, it is difficult sometimes to understand why a baby is crying. This will be annoying for a parents/guardian/caretaker, and therefore in this work, we are proposing an infant cry analysis and classification system to classify the kinds of crying of babies to assist parents/guardian/caretaker and attend to the needs of the babies. Presently, five distinct kinds of infant cries are identified: hunger (Neh), belly pain (Eairh), tiredness (Owh), discomfort (Heh) and burping (Eh). The database of this study consists of 456 audio recordings of 7 s each of 0–22-week-old babies. Feature extraction from each crying frame is carried out using Mel-frequency cepstral coefficients and the sequential forward floating selection is later used to choose highly discriminative features. Support Vector Machine and Random Forest classifiers are used for classification of infant crying. The results of the experiments has shown the performance of the proposed system with a accuracy of classification of 78% and 90.8% for Support Vector Machine and Random forest classifiers respectively.
期刊介绍:
The Journal on Mobile Communication and Computing ...
Publishes tutorial, survey, and original research papers addressing mobile communications and computing;
Investigates theoretical, engineering, and experimental aspects of radio communications, voice, data, images, and multimedia;
Explores propagation, system models, speech and image coding, multiple access techniques, protocols, performance evaluation, radio local area networks, and networking and architectures, etc.;
98% of authors who answered a survey reported that they would definitely publish or probably publish in the journal again.
Wireless Personal Communications is an archival, peer reviewed, scientific and technical journal addressing mobile communications and computing. It investigates theoretical, engineering, and experimental aspects of radio communications, voice, data, images, and multimedia. A partial list of topics included in the journal is: propagation, system models, speech and image coding, multiple access techniques, protocols performance evaluation, radio local area networks, and networking and architectures.
In addition to the above mentioned areas, the journal also accepts papers that deal with interdisciplinary aspects of wireless communications along with: big data and analytics, business and economy, society, and the environment.
The journal features five principal types of papers: full technical papers, short papers, technical aspects of policy and standardization, letters offering new research thoughts and experimental ideas, and invited papers on important and emerging topics authored by renowned experts.