通过大象放牧优化卷积门控递归神经网络进行早产儿哭声分类

IF 1.8 3区工程技术 Q3 ENGINEERING, ELECTRICAL & ELECTRONIC Circuits, Systems and Signal Processing Pub Date : 2024-07-04 DOI:10.1007/s00034-024-02764-5

V. Vaishnavi, M. Braveen, N. Muthukumaran, P. Poonkodi

{"title":"通过大象放牧优化卷积门控递归神经网络进行早产儿哭声分类","authors":"V. Vaishnavi, M. Braveen, N. Muthukumaran, P. Poonkodi","doi":"10.1007/s00034-024-02764-5","DOIUrl":null,"url":null,"abstract":"<p>Premature babies scream to make contact with their mothers or other people. Infants communicate via their screams in different ways based on the motivation behind their cries. A considerable amount of work and focus is required these days to preprocess, extract features, and classify audio signals. This research aims to propose a novel Elephant Herding Optimized Deep Convolutional Gated Recurrent Neural Network (EHO-DCGR net) for classifying cry signals from premature babies. Cry signals are first preprocessed to remove distortion caused by short sample times. MFCC (Mel-frequency cepstral coefficient), Power Normalized Cepstral Coefficients (PNCC), BFCC (Bark-frequency cepstral coefficient), and LPCC (Linear Prediction cepstral coefficient) are used to identify abnormal weeping through their prosodic aspects. The Elephant Herding optimization (EHO) algorithm is utilized for choosing the best features from the extracted set to form a fused feature matrix. These characteristics are then used to categorize premature baby cry sounds using the DCGR net. The proposed EHO-DCGR net effectiveness is measured by precision, specificity, recall, and F1-score, accuracy. According to experimental fallouts, the proposed EHO-DCGR net detects baby cry signals with an astounding 98.45% classification accuracy. From the experimental analysis, the EHO-DCGR Net increases the overall accuracy by 12.64%, 3.18%, 9.71% and 3.50% better than MFCC-SVM, DFFNN, SVM-RBF and SGDM respectively.</p>","PeriodicalId":10227,"journal":{"name":"Circuits, Systems and Signal Processing","volume":"40 1","pages":""},"PeriodicalIF":1.8000,"publicationDate":"2024-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Premature Infant Cry Classification via Elephant Herding Optimized Convolutional Gated Recurrent Neural Network\",\"authors\":\"V. Vaishnavi, M. Braveen, N. Muthukumaran, P. Poonkodi\",\"doi\":\"10.1007/s00034-024-02764-5\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Premature babies scream to make contact with their mothers or other people. Infants communicate via their screams in different ways based on the motivation behind their cries. A considerable amount of work and focus is required these days to preprocess, extract features, and classify audio signals. This research aims to propose a novel Elephant Herding Optimized Deep Convolutional Gated Recurrent Neural Network (EHO-DCGR net) for classifying cry signals from premature babies. Cry signals are first preprocessed to remove distortion caused by short sample times. MFCC (Mel-frequency cepstral coefficient), Power Normalized Cepstral Coefficients (PNCC), BFCC (Bark-frequency cepstral coefficient), and LPCC (Linear Prediction cepstral coefficient) are used to identify abnormal weeping through their prosodic aspects. The Elephant Herding optimization (EHO) algorithm is utilized for choosing the best features from the extracted set to form a fused feature matrix. These characteristics are then used to categorize premature baby cry sounds using the DCGR net. The proposed EHO-DCGR net effectiveness is measured by precision, specificity, recall, and F1-score, accuracy. According to experimental fallouts, the proposed EHO-DCGR net detects baby cry signals with an astounding 98.45% classification accuracy. From the experimental analysis, the EHO-DCGR Net increases the overall accuracy by 12.64%, 3.18%, 9.71% and 3.50% better than MFCC-SVM, DFFNN, SVM-RBF and SGDM respectively.</p>\",\"PeriodicalId\":10227,\"journal\":{\"name\":\"Circuits, Systems and Signal Processing\",\"volume\":\"40 1\",\"pages\":\"\"},\"PeriodicalIF\":1.8000,\"publicationDate\":\"2024-07-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Circuits, Systems and Signal Processing\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.1007/s00034-024-02764-5\",\"RegionNum\":3,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Circuits, Systems and Signal Processing","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1007/s00034-024-02764-5","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}

引用次数: 0

摘要

早产儿尖叫是为了与母亲或其他人取得联系。根据哭声背后的动机，婴儿通过尖叫以不同的方式进行交流。如今，对音频信号进行预处理、提取特征和分类需要大量的工作和关注。本研究旨在提出一种新颖的大象放牧优化深度卷积门控递归神经网络（EHO-DCGR net），用于对早产儿的哭声信号进行分类。首先对哭声信号进行预处理，以消除因采样时间短而造成的失真。采用 MFCC（梅尔频率前谱系数）、功率归一化前谱系数（PNCC）、BFCC（吠声频率前谱系数）和 LPCC（线性预测前谱系数），通过其前音方面来识别异常哭声。利用大象放牧优化（EHO）算法从提取的特征集中选择最佳特征，形成融合特征矩阵。然后使用 DCGR 网将这些特征用于早产婴儿哭声的分类。拟议的 EHO-DCGR 网的有效性通过精确度、特异性、召回率和 F1 分数、准确度来衡量。实验结果表明，拟议的 EHO-DCGR 网检测婴儿哭声信号的分类准确率高达 98.45%。从实验分析来看，EHO-DCGR 网络比 MFCC-SVM、DFFNN、SVM-RBF 和 SGDM 分别提高了 12.64%、3.18%、9.71% 和 3.50%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

摘要图片

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Premature Infant Cry Classification via Elephant Herding Optimized Convolutional Gated Recurrent Neural Network

Premature babies scream to make contact with their mothers or other people. Infants communicate via their screams in different ways based on the motivation behind their cries. A considerable amount of work and focus is required these days to preprocess, extract features, and classify audio signals. This research aims to propose a novel Elephant Herding Optimized Deep Convolutional Gated Recurrent Neural Network (EHO-DCGR net) for classifying cry signals from premature babies. Cry signals are first preprocessed to remove distortion caused by short sample times. MFCC (Mel-frequency cepstral coefficient), Power Normalized Cepstral Coefficients (PNCC), BFCC (Bark-frequency cepstral coefficient), and LPCC (Linear Prediction cepstral coefficient) are used to identify abnormal weeping through their prosodic aspects. The Elephant Herding optimization (EHO) algorithm is utilized for choosing the best features from the extracted set to form a fused feature matrix. These characteristics are then used to categorize premature baby cry sounds using the DCGR net. The proposed EHO-DCGR net effectiveness is measured by precision, specificity, recall, and F1-score, accuracy. According to experimental fallouts, the proposed EHO-DCGR net detects baby cry signals with an astounding 98.45% classification accuracy. From the experimental analysis, the EHO-DCGR Net increases the overall accuracy by 12.64%, 3.18%, 9.71% and 3.50% better than MFCC-SVM, DFFNN, SVM-RBF and SGDM respectively.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Circuits, Systems and Signal Processing 工程技术-工程：电子与电气

CiteScore

4.80

自引率

13.00%

发文量

321

审稿时长

4.6 months

期刊介绍： Rapid developments in the analog and digital processing of signals for communication, control, and computer systems have made the theory of electrical circuits and signal processing a burgeoning area of research and design. The aim of Circuits, Systems, and Signal Processing (CSSP) is to help meet the needs of outlets for significant research papers and state-of-the-art review articles in the area. The scope of the journal is broad, ranging from mathematical foundations to practical engineering design. It encompasses, but is not limited to, such topics as linear and nonlinear networks, distributed circuits and systems, multi-dimensional signals and systems, analog filters and signal processing, digital filters and signal processing, statistical signal processing, multimedia, computer aided design, graph theory, neural systems, communication circuits and systems, and VLSI signal processing. The Editorial Board is international, and papers are welcome from throughout the world. The journal is devoted primarily to research papers, but survey, expository, and tutorial papers are also published. Circuits, Systems, and Signal Processing (CSSP) is published twelve times annually.