人类自然图像的三维姿态识别。

CogSci ... Annual Conference of the Cognitive Science Society. Cognitive Science Society (U.S.). Conference Pub Date : 2021-07-01 DOI:10.1167/jov.21.9.1878

Hongru Zhu, A. Yuille, D. Kersten

{"title":"人类自然图像的三维姿态识别。","authors":"Hongru Zhu, A. Yuille, D. Kersten","doi":"10.1167/jov.21.9.1878","DOIUrl":null,"url":null,"abstract":"Perceiving 3D structure in natural images is an immense computational challenge for the visual system. While many previous studies focused on the perception of rigid 3D objects, we applied a novel method on a common set of non-rigid objects-static images of the human body in the natural world. We investigated to what extent human ability to interpret 3D poses in natural images depends on the typicality of the underlying 3D pose and the informativeness of the viewpoint. Using a novel 2AFC pose matching task, we measured how well subjects were able to match a target natural pose image with one of two comparison, synthetic body images from a different viewpoint-one was rendered with the same 3D pose parameters as the target while the other was a distractor rendered with added noises on joint angles. We found that performance for typical poses was measurably better than atypical poses; however, we found no significant difference between informative and less informative viewpoints. Further comparisons of 2D and 3D pose matching models on the same task showed that 3D body knowledge is particularly important when interpreting images of atypical poses. These results suggested that human ability to interpret 3D poses depends on pose typicality but not viewpoint informativeness, and that humans probably use prior knowledge of 3D pose structures.","PeriodicalId":72634,"journal":{"name":"CogSci ... Annual Conference of the Cognitive Science Society. Cognitive Science Society (U.S.). Conference","volume":"25 1","pages":"223-229"},"PeriodicalIF":0.0000,"publicationDate":"2021-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Three-dimensional pose discrimination in natural images of humans.\",\"authors\":\"Hongru Zhu, A. Yuille, D. Kersten\",\"doi\":\"10.1167/jov.21.9.1878\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Perceiving 3D structure in natural images is an immense computational challenge for the visual system. While many previous studies focused on the perception of rigid 3D objects, we applied a novel method on a common set of non-rigid objects-static images of the human body in the natural world. We investigated to what extent human ability to interpret 3D poses in natural images depends on the typicality of the underlying 3D pose and the informativeness of the viewpoint. Using a novel 2AFC pose matching task, we measured how well subjects were able to match a target natural pose image with one of two comparison, synthetic body images from a different viewpoint-one was rendered with the same 3D pose parameters as the target while the other was a distractor rendered with added noises on joint angles. We found that performance for typical poses was measurably better than atypical poses; however, we found no significant difference between informative and less informative viewpoints. Further comparisons of 2D and 3D pose matching models on the same task showed that 3D body knowledge is particularly important when interpreting images of atypical poses. These results suggested that human ability to interpret 3D poses depends on pose typicality but not viewpoint informativeness, and that humans probably use prior knowledge of 3D pose structures.\",\"PeriodicalId\":72634,\"journal\":{\"name\":\"CogSci ... Annual Conference of the Cognitive Science Society. Cognitive Science Society (U.S.). Conference\",\"volume\":\"25 1\",\"pages\":\"223-229\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"CogSci ... Annual Conference of the Cognitive Science Society. Cognitive Science Society (U.S.). Conference\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1167/jov.21.9.1878\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"CogSci ... Annual Conference of the Cognitive Science Society. Cognitive Science Society (U.S.). Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1167/jov.21.9.1878","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

感知自然图像中的三维结构对视觉系统来说是一个巨大的计算挑战。虽然许多先前的研究都集中在刚性3D物体的感知上，但我们将一种新颖的方法应用于一组常见的非刚性物体-自然世界中人体的静态图像。我们研究了人类在多大程度上解释自然图像中的3D姿势取决于底层3D姿势的典型性和视点的信息量。使用一种新颖的2AFC姿态匹配任务，我们测量了受试者能够将目标自然姿态图像与两种比较中的一种进行匹配的程度，从不同的视点合成的身体图像-一种是用与目标相同的3D姿态参数渲染的，而另一种是在关节角度上添加噪声渲染的干扰物。我们发现典型姿势的表现明显优于非典型姿势;然而，我们发现信息丰富和信息较少的观点之间没有显著差异。对同一任务的2D和3D姿势匹配模型的进一步比较表明，在解释非典型姿势图像时，3D身体知识尤为重要。这些结果表明，人类解释3D姿势的能力取决于姿势的典型性，而不是视点信息性，人类可能使用了对3D姿势结构的先验知识。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Three-dimensional pose discrimination in natural images of humans.

Perceiving 3D structure in natural images is an immense computational challenge for the visual system. While many previous studies focused on the perception of rigid 3D objects, we applied a novel method on a common set of non-rigid objects-static images of the human body in the natural world. We investigated to what extent human ability to interpret 3D poses in natural images depends on the typicality of the underlying 3D pose and the informativeness of the viewpoint. Using a novel 2AFC pose matching task, we measured how well subjects were able to match a target natural pose image with one of two comparison, synthetic body images from a different viewpoint-one was rendered with the same 3D pose parameters as the target while the other was a distractor rendered with added noises on joint angles. We found that performance for typical poses was measurably better than atypical poses; however, we found no significant difference between informative and less informative viewpoints. Further comparisons of 2D and 3D pose matching models on the same task showed that 3D body knowledge is particularly important when interpreting images of atypical poses. These results suggested that human ability to interpret 3D poses depends on pose typicality but not viewpoint informativeness, and that humans probably use prior knowledge of 3D pose structures.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

CogSci ... Annual Conference of the Cognitive Science Society. Cognitive Science Society (U.S.). Conference

自引率

0.00%

发文量