{"title":"Looking at People","authors":"D. Forsyth","doi":"10.1109/CRV.2005.52","DOIUrl":null,"url":null,"abstract":"There is a great need for programs that can describe what people are doing from video. This is difficult to do, because it is hard to identify and track people in video sequences, because we have no canonical vocabulary for describing what people are doing, and because the interpretation of what people are doing depends very strongly on what is nearby. Tracking is hard, because it is important to track relatively small structures that can move relatively fast for example, lower arms. I will describe research into kinematic tracking tracking that reports the kinematic configuration of the body that has resulted in a fairly accurate, fully automatic tracker, that can keep track of multiple people. Once one has tracked the body, one must interpret the results. One way to do so is to have a motion synthesis system that takes the track, and produces a motion that is (a) like a human motion and (b) close to the track. Our work has produced a high-quality motion synthesis system that can produce motions that look very much like human activities. I will describe work that couples that system with a tracker to produce a description of the activities, entirely automatically. I will speculate on some of the many open problems. What should one report? How do nearby objects affect one’s interpretation of activities? How can one interpret patterns of behavior?","PeriodicalId":307318,"journal":{"name":"The 2nd Canadian Conference on Computer and Robot Vision (CRV'05)","volume":"68 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2005-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"The 2nd Canadian Conference on Computer and Robot Vision (CRV'05)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CRV.2005.52","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

There is a great need for programs that can describe what people are doing from video. This is difficult to do, because it is hard to identify and track people in video sequences, because we have no canonical vocabulary for describing what people are doing, and because the interpretation of what people are doing depends very strongly on what is nearby. Tracking is hard, because it is important to track relatively small structures that can move relatively fast for example, lower arms. I will describe research into kinematic tracking tracking that reports the kinematic configuration of the body that has resulted in a fairly accurate, fully automatic tracker, that can keep track of multiple people. Once one has tracked the body, one must interpret the results. One way to do so is to have a motion synthesis system that takes the track, and produces a motion that is (a) like a human motion and (b) close to the track. Our work has produced a high-quality motion synthesis system that can produce motions that look very much like human activities. I will describe work that couples that system with a tracker to produce a description of the activities, entirely automatically. I will speculate on some of the many open problems. What should one report? How do nearby objects affect one’s interpretation of activities? How can one interpret patterns of behavior?
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
看别人
人们非常需要能够从视频中描述人们正在做什么的程序。这是很难做到的,因为很难在视频序列中识别和跟踪人,因为我们没有规范的词汇来描述人们在做什么,因为对人们正在做什么的解释很大程度上取决于附近的东西。追踪是很困难的,因为追踪相对较小的结构是很重要的,这些结构可以移动得相对较快,比如,下臂。我将描述对运动学跟踪的研究,运动学跟踪报告了身体的运动学配置,这导致了一个相当准确的,全自动的跟踪器,可以跟踪多个人。一旦追踪到尸体,就必须解释结果。一种方法是使用一个运动合成系统,该系统采用轨迹,并产生一个(a)像人类运动和(b)接近轨迹的运动。我们的工作已经产生了一个高质量的运动合成系统,可以产生看起来非常像人类活动的运动。我将描述将该系统与跟踪器结合起来以完全自动地产生活动描述的工作。我将推测一些悬而未决的问题。应该报告什么?附近的物体如何影响一个人对活动的解释?人们如何解释行为模式?
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Head pose estimation of partially occluded faces Minimum Bayes error features for visual recognition by sequential feature selection and extraction Using vanishing points to correct camera rotation in images Dry granular flows need special tools Body tracking in human walk from monocular video sequences
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1