LaBelle:一个深度学习应用程序,帮助你学习芭蕾

Sarah Fan, Kevin Guo, Yu Sun
{"title":"LaBelle:一个深度学习应用程序,帮助你学习芭蕾","authors":"Sarah Fan, Kevin Guo, Yu Sun","doi":"10.5121/csit.2022.121714","DOIUrl":null,"url":null,"abstract":"Human Pose Estimation has proven versatility in improving real-world applications in healthcare, sports, etc. [1]. Proper stance, form and movement is instrumental to succeeding in these activities. This paper will explain the research process behind the deep learning mobile ballet app, LaBelle [2]. LaBelle takes in two short videos: one of a teacher, and one of a student. Utilizing MediaPipe Pose to identify, analyze, and store data about the poses and movements of both dancers, the app calculates the angles created between different joints and major body parts. The app’s AI Model uses a K-means clustering algorithm to create a group of clusters for both the student dataset and the teacher dataset [3]. Using the two sets of clusters, LaBelle identifies the key frames in the student-video and searches the teacher cluster set for a matching set of properties and frames. It evaluates the differences between the paired frames and produces a final score as well as feedback on the poses that need improving. We propose an unsupervised guided-learning approach with improved efficiency in video comparison, which is usually both time and resource consuming. This efficient model can be used not just in dance, but athletics and medicine (physical therapy like activities) as well, where stance, form, and movements are often hard to track with the naked eye.","PeriodicalId":170432,"journal":{"name":"Signal & Image Processing Trends","volume":"49 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"LaBelle: A Deep Learning APP that Helps You Learn Ballet\",\"authors\":\"Sarah Fan, Kevin Guo, Yu Sun\",\"doi\":\"10.5121/csit.2022.121714\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Human Pose Estimation has proven versatility in improving real-world applications in healthcare, sports, etc. [1]. Proper stance, form and movement is instrumental to succeeding in these activities. This paper will explain the research process behind the deep learning mobile ballet app, LaBelle [2]. LaBelle takes in two short videos: one of a teacher, and one of a student. Utilizing MediaPipe Pose to identify, analyze, and store data about the poses and movements of both dancers, the app calculates the angles created between different joints and major body parts. The app’s AI Model uses a K-means clustering algorithm to create a group of clusters for both the student dataset and the teacher dataset [3]. Using the two sets of clusters, LaBelle identifies the key frames in the student-video and searches the teacher cluster set for a matching set of properties and frames. It evaluates the differences between the paired frames and produces a final score as well as feedback on the poses that need improving. We propose an unsupervised guided-learning approach with improved efficiency in video comparison, which is usually both time and resource consuming. This efficient model can be used not just in dance, but athletics and medicine (physical therapy like activities) as well, where stance, form, and movements are often hard to track with the naked eye.\",\"PeriodicalId\":170432,\"journal\":{\"name\":\"Signal & Image Processing Trends\",\"volume\":\"49 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-10-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Signal & Image Processing Trends\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.5121/csit.2022.121714\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Signal & Image Processing Trends","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5121/csit.2022.121714","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

人体姿势估计已被证明在改善医疗保健,体育等现实世界应用中的多功能性[1]。正确的站位、姿势和动作是这些活动成功的关键。本文将解释深度学习移动芭蕾应用LaBelle[2]背后的研究过程。LaBelle拍摄了两个短视频:一个是老师的,一个是学生的。该应用程序利用MediaPipe Pose来识别、分析和存储两位舞者的姿势和动作数据,计算不同关节和主要身体部位之间产生的角度。该应用的AI模型使用K-means聚类算法为学生数据集和教师数据集创建一组聚类[3]。使用这两组聚类,LaBelle识别学生视频中的关键帧,并在教师聚类集中搜索匹配的属性和帧集。它会评估配对帧之间的差异,并产生最终分数以及需要改进的姿势的反馈。我们提出了一种无监督的引导学习方法,提高了视频比较的效率,这种方法通常既费时又耗资源。这种有效的模型不仅可以用于舞蹈,也可以用于体育和医学(物理治疗之类的活动),在这些领域,姿势、形式和动作通常很难用肉眼追踪。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
LaBelle: A Deep Learning APP that Helps You Learn Ballet
Human Pose Estimation has proven versatility in improving real-world applications in healthcare, sports, etc. [1]. Proper stance, form and movement is instrumental to succeeding in these activities. This paper will explain the research process behind the deep learning mobile ballet app, LaBelle [2]. LaBelle takes in two short videos: one of a teacher, and one of a student. Utilizing MediaPipe Pose to identify, analyze, and store data about the poses and movements of both dancers, the app calculates the angles created between different joints and major body parts. The app’s AI Model uses a K-means clustering algorithm to create a group of clusters for both the student dataset and the teacher dataset [3]. Using the two sets of clusters, LaBelle identifies the key frames in the student-video and searches the teacher cluster set for a matching set of properties and frames. It evaluates the differences between the paired frames and produces a final score as well as feedback on the poses that need improving. We propose an unsupervised guided-learning approach with improved efficiency in video comparison, which is usually both time and resource consuming. This efficient model can be used not just in dance, but athletics and medicine (physical therapy like activities) as well, where stance, form, and movements are often hard to track with the naked eye.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
ComputerBank: A Community-based Computer Donation Platform using Machine Learning and NFT A Gradient Descent Inspired Approach to Optimization of Physics Question LaBelle: A Deep Learning APP that Helps You Learn Ballet Prediction and Key Characteristics of All-Cause Mortality in Maintenance Hemodialysis Patients Using Singular Value Decomposition in a Convolutional Neural Network to Improve Brain Tumor Segmentation Accuracy
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1