The mid-level vision toolbox for computing structural properties of real-world images

IF 2.4 Q3 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Frontiers in Computer Science Pub Date : 2023-09-13 DOI:10.3389/fcomp.2023.1140723
Dirk B. Walther, Delaram Farzanfar, Seohee Han, Morteza Rezanejad
{"title":"The mid-level vision toolbox for computing structural properties of real-world images","authors":"Dirk B. Walther, Delaram Farzanfar, Seohee Han, Morteza Rezanejad","doi":"10.3389/fcomp.2023.1140723","DOIUrl":null,"url":null,"abstract":"Mid-level vision is the intermediate visual processing stage for generating representations of shapes and partial geometries of objects. Our mechanistic understanding of these operations is limited, in part, by a lack of computational tools for analyzing image properties at these levels of representation. We introduce the Mid-Level Vision (MLV) Toolbox, an open-source software that automatically processes low- and mid-level contour features and perceptual grouping cues from real-world images. The MLV toolbox takes vectorized line drawings of scenes as input and extracts structural contour properties. We also include tools for contour detection and tracing for the automatic generation of vectorized line drawings from photographs. Various statistical properties of the contours are computed: the distributions of orientations, contour curvature, and contour lengths, as well as counts and types of contour junctions. The toolbox includes an efficient algorithm for computing the medial axis transform of contour drawings and photographs. Based on the medial axis transform, we compute several scores for local mirror symmetry, local parallelism, and local contour separation. All properties are summarized in histograms that can serve as input into statistical models to relate image properties to human behavioral measures, such as esthetic pleasure, memorability, affective processing, and scene categorization. In addition to measuring contour properties, we include functions for manipulating drawings by separating contours according to their statistical properties, randomly shifting contours, or rotating drawings behind a circular aperture. Finally, the MLV Toolbox offers visualization functions for contour orientations, lengths, curvature, junctions, and medial axis properties on computer-generated and artist-generated line drawings. We include artist-generated vectorized drawings of the Toronto Scenes image set, the International Affective Picture System, and the Snodgrass and Vanderwart object images, as well as automatically traced vectorized drawings of set architectural scenes and the Open Affective Standardized Image Set (OASIS).","PeriodicalId":52823,"journal":{"name":"Frontiers in Computer Science","volume":"62 1","pages":"0"},"PeriodicalIF":2.4000,"publicationDate":"2023-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Computer Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3389/fcomp.2023.1140723","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 3

Abstract

Mid-level vision is the intermediate visual processing stage for generating representations of shapes and partial geometries of objects. Our mechanistic understanding of these operations is limited, in part, by a lack of computational tools for analyzing image properties at these levels of representation. We introduce the Mid-Level Vision (MLV) Toolbox, an open-source software that automatically processes low- and mid-level contour features and perceptual grouping cues from real-world images. The MLV toolbox takes vectorized line drawings of scenes as input and extracts structural contour properties. We also include tools for contour detection and tracing for the automatic generation of vectorized line drawings from photographs. Various statistical properties of the contours are computed: the distributions of orientations, contour curvature, and contour lengths, as well as counts and types of contour junctions. The toolbox includes an efficient algorithm for computing the medial axis transform of contour drawings and photographs. Based on the medial axis transform, we compute several scores for local mirror symmetry, local parallelism, and local contour separation. All properties are summarized in histograms that can serve as input into statistical models to relate image properties to human behavioral measures, such as esthetic pleasure, memorability, affective processing, and scene categorization. In addition to measuring contour properties, we include functions for manipulating drawings by separating contours according to their statistical properties, randomly shifting contours, or rotating drawings behind a circular aperture. Finally, the MLV Toolbox offers visualization functions for contour orientations, lengths, curvature, junctions, and medial axis properties on computer-generated and artist-generated line drawings. We include artist-generated vectorized drawings of the Toronto Scenes image set, the International Affective Picture System, and the Snodgrass and Vanderwart object images, as well as automatically traced vectorized drawings of set architectural scenes and the Open Affective Standardized Image Set (OASIS).
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
用于计算真实世界图像结构属性的中级视觉工具箱
中级视觉是生成物体形状和部分几何形状表示的中间视觉处理阶段。我们对这些操作的机械理解是有限的,部分原因是缺乏在这些表示级别上分析图像属性的计算工具。我们介绍了中级视觉(MLV)工具箱,这是一个开源软件,可以自动处理来自现实世界图像的低级和中级轮廓特征和感知分组线索。MLV工具箱以矢量化的场景线条图为输入,提取结构轮廓属性。我们还包括用于轮廓检测和跟踪的工具,用于从照片中自动生成矢量化线条图。计算了等高线的各种统计性质:方向分布、等高线曲率和等高线长度,以及等高线结点的数量和类型。该工具箱包括一个有效的算法,用于计算等高线图纸和照片的中间轴变换。基于中轴线变换,我们计算了局部镜像对称、局部平行度和局部轮廓分离的分数。所有属性都总结在直方图中,可以作为统计模型的输入,将图像属性与人类行为测量(如审美愉悦、记忆、情感处理和场景分类)联系起来。除了测量轮廓属性外,我们还包括通过根据统计属性分离轮廓、随机移动轮廓或在圆形孔径后面旋转绘图来操纵绘图的功能。最后,MLV工具箱为计算机生成和艺术家生成的线条图提供了轮廓方向、长度、曲率、连接点和中间轴属性的可视化功能。我们包括艺术家生成的多伦多场景图像集、国际情感图像系统、Snodgrass和Vanderwart对象图像的矢量化绘图,以及建筑场景集和开放情感标准化图像集(OASIS)的自动跟踪矢量化绘图。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Frontiers in Computer Science
Frontiers in Computer Science COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS-
CiteScore
4.30
自引率
0.00%
发文量
152
审稿时长
13 weeks
期刊最新文献
Quantum annealing research at CMU: algorithms, hardware, applications Pneumonia detection by binary classification: classical, quantum, and hybrid approaches for support vector machine (SVM) Lived experience in human-building interaction (HBI): an initial framework The impact of architectural form on physiological stress: a systematic review Care-full data, care-less systems: making sense of self-care technologies for mental health with humanistic practitioners in the United Kingdom
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1