Automatic classification of dog barking using deep learning

IF 1.5 4区 生物学 Q4 BEHAVIORAL SCIENCES Behavioural Processes Pub Date : 2024-04-20 DOI:10.1016/j.beproc.2024.105028
José Ramón Gómez-Armenta , Humberto Pérez-Espinosa , José Alberto Fernández-Zepeda , Verónica Reyes-Meza
{"title":"Automatic classification of dog barking using deep learning","authors":"José Ramón Gómez-Armenta ,&nbsp;Humberto Pérez-Espinosa ,&nbsp;José Alberto Fernández-Zepeda ,&nbsp;Verónica Reyes-Meza","doi":"10.1016/j.beproc.2024.105028","DOIUrl":null,"url":null,"abstract":"<div><p>Barking and other dog vocalizations have acoustic properties related to emotions, physiological reactions, attitudes, or some particular internal states. In the field of intelligent audio analysis, researchers use methods based on signal processing and machine learning to analyze the digitized acoustic signals’ properties and obtain relevant information. The present work describes a method to classify the identity, breed, age, sex, and context associated with each bark. This information can support the decisions of people who regularly interact with animals, such as dog trainers, veterinarians, rescuers, police, people with visual impairment. Our approach uses deep neural networks to generate trained models for each classification task. We worked with 19,643 barks recorded from 113 dogs of different breeds, ages and sexes. Our methodology consists of three stages. First, the pre-processing stage prepares the data and transforms it into the appropriate format for each classification model. Second, the characterization stage evaluates different representation models to identify the most suitable for each task. Third, the classification stage trains each classification model and selects the best hyperparameters. After tuning and training each model, we evaluated its performance. We analyzed the most relevant features extracted from the audio and the most appropriate deep neural network architecture for that feature type. Even if the application of our method is not ready for being used in ethological practice, our evaluation showed an outstanding performance of the proposed method, surpassing previous research results on this topic, providing the basis for further technological development.</p></div>","PeriodicalId":8746,"journal":{"name":"Behavioural Processes","volume":"218 ","pages":"Article 105028"},"PeriodicalIF":1.5000,"publicationDate":"2024-04-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Behavioural Processes","FirstCategoryId":"99","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0376635724000433","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"BEHAVIORAL SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

Barking and other dog vocalizations have acoustic properties related to emotions, physiological reactions, attitudes, or some particular internal states. In the field of intelligent audio analysis, researchers use methods based on signal processing and machine learning to analyze the digitized acoustic signals’ properties and obtain relevant information. The present work describes a method to classify the identity, breed, age, sex, and context associated with each bark. This information can support the decisions of people who regularly interact with animals, such as dog trainers, veterinarians, rescuers, police, people with visual impairment. Our approach uses deep neural networks to generate trained models for each classification task. We worked with 19,643 barks recorded from 113 dogs of different breeds, ages and sexes. Our methodology consists of three stages. First, the pre-processing stage prepares the data and transforms it into the appropriate format for each classification model. Second, the characterization stage evaluates different representation models to identify the most suitable for each task. Third, the classification stage trains each classification model and selects the best hyperparameters. After tuning and training each model, we evaluated its performance. We analyzed the most relevant features extracted from the audio and the most appropriate deep neural network architecture for that feature type. Even if the application of our method is not ready for being used in ethological practice, our evaluation showed an outstanding performance of the proposed method, surpassing previous research results on this topic, providing the basis for further technological development.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用深度学习对狗叫声进行自动分类
狗叫和其他狗的发声具有与情绪、生理反应、态度或某些特定内部状态相关的声学特性。在智能音频分析领域,研究人员使用基于信号处理和机器学习的方法来分析数字化声学信号的特性并获取相关信息。本作品介绍了一种对每一声吠叫的相关身份、品种、年龄、性别和背景进行分类的方法。这些信息可为经常与动物打交道的人提供决策支持,如驯狗师、兽医、救援人员、警察、视力障碍者等。我们的方法使用深度神经网络为每个分类任务生成训练有素的模型。我们使用了 113 种不同品种、年龄和性别的狗记录的 19643 次吠叫。我们的方法包括三个阶段。首先,预处理阶段准备数据,并将其转换为适合每个分类模型的格式。第二,特征描述阶段对不同的表示模型进行评估,以确定最适合每项任务的模型。第三,分类阶段训练每个分类模型,并选择最佳超参数。在对每个模型进行调整和训练后,我们对其性能进行了评估。我们分析了从音频中提取的最相关特征,以及最适合该特征类型的深度神经网络架构。尽管我们的方法还不能应用于伦理学实践,但我们的评估结果表明,所提出的方法性能卓越,超越了之前在该主题上的研究成果,为进一步的技术开发提供了基础。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Behavioural Processes
Behavioural Processes 生物-动物学
CiteScore
2.70
自引率
7.70%
发文量
144
审稿时长
4-8 weeks
期刊介绍: Behavioural Processes is dedicated to the publication of high-quality original research on animal behaviour from any theoretical perspective. It welcomes contributions that consider animal behaviour from behavioural analytic, cognitive, ethological, ecological and evolutionary points of view. This list is not intended to be exhaustive, and papers that integrate theory and methodology across disciplines are particularly welcome.
期刊最新文献
Low mobility and similar odors result in peaceful interspecific encounters in termites Enhancement of the hedonic value of an expected reinforcer following prior frustration: An animal model for studying binge-like episodes The front legs of Sepsis flies (Diptera: Sepsidae) are stimulation devices. Diurnal patterns of movement and feeding for an unfished population of Actinopyga echinites (Jaeger, 1833) in a protected marine sanctuary on Ningaloo Reef, northwestern Australia Combined effects of reinstatement and spontaneous recovery
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1