Language-assisted deep learning for autistic behaviors recognition

Q2 Health Professions Smart Health Pub Date : 2023-12-22 DOI:10.1016/j.smhl.2023.100444
Andong Deng , Taojiannan Yang , Chen Chen , Qian Chen , Leslie Neely , Sakiko Oyama
{"title":"Language-assisted deep learning for autistic behaviors recognition","authors":"Andong Deng ,&nbsp;Taojiannan Yang ,&nbsp;Chen Chen ,&nbsp;Qian Chen ,&nbsp;Leslie Neely ,&nbsp;Sakiko Oyama","doi":"10.1016/j.smhl.2023.100444","DOIUrl":null,"url":null,"abstract":"<div><p>Correctly recognizing the behaviors of children with Autism Spectrum Disorder (ASD) is of vital importance for the diagnosis of Autism and timely early intervention. However, the observation and recording during the treatment from the parents of autistic children may not be accurate and objective. In such cases, automatic recognition systems based on computer vision and machine learning (in particular deep learning) technology can alleviate this issue to a large extent. Existing human action recognition models can now achieve impressive performance on challenging activity datasets, e.g., daily activity, and sports activity. However, problem behaviors in children with ASD are very different from these general activities, and recognizing these problem behaviors via computer vision is less studied. In this paper, we first evaluate a strong baseline for action recognition, i.e., Video Swin Transformer, on two autism behaviors datasets (SSBD and ESBD) and show that it can achieve high accuracy and outperform the previous methods by a large margin, demonstrating the feasibility of vision-based problem behaviors recognition. Moreover, we propose language-assisted training to further enhance the action recognition performance. Specifically, we develop a two-branch multimodal deep learning framework by incorporating the ”freely available” language description for each type of problem behavior. Experimental results demonstrate that incorporating additional language supervision can bring an obvious performance boost for the autism problem behaviors recognition task as compared to using the video information only (i.e., 3.49% improvement on ESBD and 1.46% on SSBD). <em>Our code and model will be publicly available for reproducing the results.</em></p></div>","PeriodicalId":37151,"journal":{"name":"Smart Health","volume":"32 ","pages":"Article 100444"},"PeriodicalIF":0.0000,"publicationDate":"2023-12-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Smart Health","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2352648323000727","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Health Professions","Score":null,"Total":0}
引用次数: 0

Abstract

Correctly recognizing the behaviors of children with Autism Spectrum Disorder (ASD) is of vital importance for the diagnosis of Autism and timely early intervention. However, the observation and recording during the treatment from the parents of autistic children may not be accurate and objective. In such cases, automatic recognition systems based on computer vision and machine learning (in particular deep learning) technology can alleviate this issue to a large extent. Existing human action recognition models can now achieve impressive performance on challenging activity datasets, e.g., daily activity, and sports activity. However, problem behaviors in children with ASD are very different from these general activities, and recognizing these problem behaviors via computer vision is less studied. In this paper, we first evaluate a strong baseline for action recognition, i.e., Video Swin Transformer, on two autism behaviors datasets (SSBD and ESBD) and show that it can achieve high accuracy and outperform the previous methods by a large margin, demonstrating the feasibility of vision-based problem behaviors recognition. Moreover, we propose language-assisted training to further enhance the action recognition performance. Specifically, we develop a two-branch multimodal deep learning framework by incorporating the ”freely available” language description for each type of problem behavior. Experimental results demonstrate that incorporating additional language supervision can bring an obvious performance boost for the autism problem behaviors recognition task as compared to using the video information only (i.e., 3.49% improvement on ESBD and 1.46% on SSBD). Our code and model will be publicly available for reproducing the results.

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
自闭症行为识别的语言辅助深度学习
正确识别自闭症谱系障碍(ASD)儿童的行为对于诊断自闭症和及时进行早期干预至关重要。然而,自闭症儿童家长在治疗过程中的观察和记录可能并不准确和客观。在这种情况下,基于计算机视觉和机器学习(尤其是深度学习)技术的自动识别系统可以在很大程度上缓解这一问题。目前,现有的人类动作识别模型可以在具有挑战性的活动数据集(如日常活动和体育活动)上实现令人印象深刻的性能。然而,ASD 儿童的问题行为与这些一般活动有很大不同,通过计算机视觉识别这些问题行为的研究较少。在本文中,我们首先在两个自闭症行为数据集(SSBD 和 ESBD)上评估了一个强大的动作识别基线,即视频 Swin Transformer,结果表明它可以达到很高的准确率,并在很大程度上优于之前的方法,证明了基于视觉的问题行为识别的可行性。此外,我们还提出了语言辅助训练,以进一步提高动作识别性能。具体来说,我们开发了一个双分支多模态深度学习框架,将 "可自由获取 "的语言描述纳入每种类型的问题行为中。实验结果表明,与仅使用视频信息相比,在自闭症问题行为识别任务中加入额外的语言监督能带来明显的性能提升(即在 ESBD 上提升 3.49%,在 SSBD 上提升 1.46%)。我们的代码和模型将公开发布,以便重现结果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Smart Health
Smart Health Computer Science-Computer Science Applications
CiteScore
6.50
自引率
0.00%
发文量
81
期刊最新文献
Editorial Board Smart health practices: Strategies to improve healthcare efficiency through digital twin technology Human knowledge-based artificial intelligence methods for skin cancer management: Accuracy and interpretability study SAFE: Sound Analysis for Fall Event detection using machine learning Latent Space Representation of Adversarial AutoEncoder for Human Activity Recognition: Application to a low-cost commercial force plate and inertial measurement units
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1