Detecting aggression in clinical treatment videos

Machine learning with applications Pub Date : 2023-11-22 DOI:10.1016/j.mlwa.2023.100515

Walker S. Arce , Seth G. Walker , Jordan DeBrine , Benjamin S. Riggan , James E. Gehringer

{"title":"Detecting aggression in clinical treatment videos","authors":"Walker S. Arce , Seth G. Walker , Jordan DeBrine , Benjamin S. Riggan , James E. Gehringer","doi":"10.1016/j.mlwa.2023.100515","DOIUrl":null,"url":null,"abstract":"<div><p>Many clinical spaces are outfitted with centralized video recording systems to monitor patient–client interactions. Considering the increasing interest in video-based machine learning methods, the potential of using these clinical recordings to automate observational data collection is apparent. To explore this, seven patients had videos of their functional assessment and treatment sessions annotated by coders trained by our clinical team. Commonly used clinical software has inherent limitations aligning behavioral and video data, so a custom software tool was employed to address this functionality gap. After developing a Canvas-based coder training course for this tool, a team of six trained coders annotated 82.33 h of data. Two machine learning approaches were considered, where both used a convolutional neural network as a video feature extractor. The first approach used a recurrent network as the classifier on the extracted features and the second used a Transformer architecture. Both models produced promising metrics indicating that the capability of detecting aggression from clinical videos is possible and generalizable. Model performance is directly tied to the feature extractor’s performance on ImageNet, where ConvNeXtXL produced the best performing models. This has applications in automating patient incident response to improve patient and clinician safety and could be directly integrated into existing video management systems for real-time analysis.</p></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"14 ","pages":"Article 100515"},"PeriodicalIF":0.0000,"publicationDate":"2023-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666827023000683/pdfft?md5=7be193e80aa9244b29f8609ccc55e9e6&pid=1-s2.0-S2666827023000683-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Machine learning with applications","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666827023000683","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Many clinical spaces are outfitted with centralized video recording systems to monitor patient–client interactions. Considering the increasing interest in video-based machine learning methods, the potential of using these clinical recordings to automate observational data collection is apparent. To explore this, seven patients had videos of their functional assessment and treatment sessions annotated by coders trained by our clinical team. Commonly used clinical software has inherent limitations aligning behavioral and video data, so a custom software tool was employed to address this functionality gap. After developing a Canvas-based coder training course for this tool, a team of six trained coders annotated 82.33 h of data. Two machine learning approaches were considered, where both used a convolutional neural network as a video feature extractor. The first approach used a recurrent network as the classifier on the extracted features and the second used a Transformer architecture. Both models produced promising metrics indicating that the capability of detecting aggression from clinical videos is possible and generalizable. Model performance is directly tied to the feature extractor’s performance on ImageNet, where ConvNeXtXL produced the best performing models. This has applications in automating patient incident response to improve patient and clinician safety and could be directly integrated into existing video management systems for real-time analysis.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

在临床治疗视频中检测攻击性

许多临床空间配备了集中的视频记录系统来监控病人与病人之间的互动。考虑到人们对基于视频的机器学习方法的兴趣日益增加，使用这些临床记录来自动收集观察数据的潜力是显而易见的。为了探索这一点，我们的临床团队训练了编码员，并对7名患者的功能评估和治疗过程进行了视频注释。常用的临床软件在调整行为和视频数据方面存在固有的局限性，因此采用自定义软件工具来解决这一功能差距。在为这个工具开发了一个基于canvas的编码员培训课程后，一个由六名训练有素的编码员组成的团队注释了82.33小时的数据。考虑了两种机器学习方法，其中都使用卷积神经网络作为视频特征提取器。第一种方法使用循环网络作为提取特征的分类器，第二种方法使用Transformer架构。这两个模型都产生了有希望的指标，表明从临床视频中检测攻击的能力是可能的和可推广的。模型性能与特征提取器在ImageNet上的性能直接相关，在ImageNet上，ConvNeXtXL产生了性能最好的模型。这可以应用于自动化患者事件响应，以提高患者和临床医生的安全性，并可以直接集成到现有的视频管理系统中进行实时分析。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Machine learning with applications Management Science and Operations Research, Artificial Intelligence, Computer Science Applications

自引率

0.00%

发文量

审稿时长

98 days