一种用于空间图像处理的中心环绕框架

Retinex at 50 Pub Date : 2016-02-14 DOI:10.2352/ISSN.2470-1173.2016.6.RETINEX-020

Vassilios Vonikakis, Stefan Winkler

{"title":"一种用于空间图像处理的中心环绕框架","authors":"Vassilios Vonikakis, Stefan Winkler","doi":"10.2352/ISSN.2470-1173.2016.6.RETINEX-020","DOIUrl":null,"url":null,"abstract":"This paper presents a computational framework inspired by the center-surround antagonistic receptive fields of the human visual system. It demonstrates that, starting from the actual pixel value (center) and a low-pass estimation of the pixel’s neighborhood (surround) and using a mapping function inspired by the shunting inhibition mechanism, some widely used spatial image processing techniques can be implemented, including adaptive tone-mapping, local contrast enhancement, text binarization and local feature detection. As a result, it highlights the relations of these seemingly different applications with the early stages of the human visual system and draws insights about their characteristics. Introduction Center-surround antagonistic Receptive Fields (RFs) are abundant in the Human Visual System (HVS). They have been found in many areas, such as the retina, the Lateral Geniculate Nucleus, V1 or in higher visual areas. It seems that this is a typical strategy that the HVS employs for local signal comparisons, not only in vision but in other sensory areas as well. The RFs of center-surround cells comprise two separate concentric regions sampling the photoreceptor mosaic (namely the center and the surround) that act antagonistically on the final output of the cell. ON center-surround cells exhibit increased output with higher photoreceptor activity on their center and decreased output with increased activity on their surround. Conversely, for OFF center-surround cells, higher photoreceptor activity on the center has a negative impact on their output, whereas, increased photoreceptor activity on the surround increases their output. The size of the two regions defines the spatial frequency of sampling: smaller RF sizes sample finer details from the photoreceptor mosaic, while larger sizes encode coarser scales of the same signal. Center-surround cells are essentially a biological implementation of spatial filtering. Spatial filtering is a very broad term, encompassing any kind of filtering operations that depend on the local content of the signal and are not globally constant. Almost all existing image processing and computational photography techniques include some kind of spatial image processing. Modern denoising, local contrast enhancement, scale decomposition, exposure fusion, HDR tone mapping are some of them. Most of these methods have some common grounds with the basic computational models of the early stages of the HVS. However these similarities are not always so evident. In this paper, we start from the computational model of the first stages of HVS, developed by Grossberg [24], and we adapt it for image processing operations. Explicitly modeling HVS is out of the scope of this paper. We rather draw inspiration from it in order to address real-world imaging problems. More specifically, we define a framework, inspired by Grossberg’s theory, that describes center-surround signal interactions. We show that such a framework can give rise to existing spatial image processing techniques, as many of them are special cases of it. This gives a more unified view between image processing and biological vision models, highlighting their common ground and showing other potential applications that can be developed. Modeling Center-Surround RFs Traditionally, center-surround RFs have been modeled as Difference of Gaussians (DoG) [13]. This linear operator essentially approximates the Laplacian operator, by subtracting two Gaussians of different sigmas, centered in the same position. DoG is at the heart of many computer vision and image processing algorithms, such as edge detection [12], scale-space construction [1] and local feature detectors [11]. Contrary to the linear response of the DoG operator though, the center-surround cells of the HVS exhibit non-linear response in regards to their inputs. Interestingly, their nonlinear response is thought to contribute to illumination invariance and contrast enhancement [24]. According to the standard retinal model [6, 21], the output Vi j of an ON-center OFF-surround cell at grid position (i, j), obeying the membrane equations of physiology is given by dVi j (t) dt = gleak ( Vrest −Vi j ) +Ci j ( Eex−Vi j ) +Si j ( Einh−Vi j )","PeriodicalId":326060,"journal":{"name":"Retinex at 50","volume":"85 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-02-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"A center-surround framework for spatial image processing\",\"authors\":\"Vassilios Vonikakis, Stefan Winkler\",\"doi\":\"10.2352/ISSN.2470-1173.2016.6.RETINEX-020\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents a computational framework inspired by the center-surround antagonistic receptive fields of the human visual system. It demonstrates that, starting from the actual pixel value (center) and a low-pass estimation of the pixel’s neighborhood (surround) and using a mapping function inspired by the shunting inhibition mechanism, some widely used spatial image processing techniques can be implemented, including adaptive tone-mapping, local contrast enhancement, text binarization and local feature detection. As a result, it highlights the relations of these seemingly different applications with the early stages of the human visual system and draws insights about their characteristics. Introduction Center-surround antagonistic Receptive Fields (RFs) are abundant in the Human Visual System (HVS). They have been found in many areas, such as the retina, the Lateral Geniculate Nucleus, V1 or in higher visual areas. It seems that this is a typical strategy that the HVS employs for local signal comparisons, not only in vision but in other sensory areas as well. The RFs of center-surround cells comprise two separate concentric regions sampling the photoreceptor mosaic (namely the center and the surround) that act antagonistically on the final output of the cell. ON center-surround cells exhibit increased output with higher photoreceptor activity on their center and decreased output with increased activity on their surround. Conversely, for OFF center-surround cells, higher photoreceptor activity on the center has a negative impact on their output, whereas, increased photoreceptor activity on the surround increases their output. The size of the two regions defines the spatial frequency of sampling: smaller RF sizes sample finer details from the photoreceptor mosaic, while larger sizes encode coarser scales of the same signal. Center-surround cells are essentially a biological implementation of spatial filtering. Spatial filtering is a very broad term, encompassing any kind of filtering operations that depend on the local content of the signal and are not globally constant. Almost all existing image processing and computational photography techniques include some kind of spatial image processing. Modern denoising, local contrast enhancement, scale decomposition, exposure fusion, HDR tone mapping are some of them. Most of these methods have some common grounds with the basic computational models of the early stages of the HVS. However these similarities are not always so evident. In this paper, we start from the computational model of the first stages of HVS, developed by Grossberg [24], and we adapt it for image processing operations. Explicitly modeling HVS is out of the scope of this paper. We rather draw inspiration from it in order to address real-world imaging problems. More specifically, we define a framework, inspired by Grossberg’s theory, that describes center-surround signal interactions. We show that such a framework can give rise to existing spatial image processing techniques, as many of them are special cases of it. This gives a more unified view between image processing and biological vision models, highlighting their common ground and showing other potential applications that can be developed. Modeling Center-Surround RFs Traditionally, center-surround RFs have been modeled as Difference of Gaussians (DoG) [13]. This linear operator essentially approximates the Laplacian operator, by subtracting two Gaussians of different sigmas, centered in the same position. DoG is at the heart of many computer vision and image processing algorithms, such as edge detection [12], scale-space construction [1] and local feature detectors [11]. Contrary to the linear response of the DoG operator though, the center-surround cells of the HVS exhibit non-linear response in regards to their inputs. Interestingly, their nonlinear response is thought to contribute to illumination invariance and contrast enhancement [24]. According to the standard retinal model [6, 21], the output Vi j of an ON-center OFF-surround cell at grid position (i, j), obeying the membrane equations of physiology is given by dVi j (t) dt = gleak ( Vrest −Vi j ) +Ci j ( Eex−Vi j ) +Si j ( Einh−Vi j )\",\"PeriodicalId\":326060,\"journal\":{\"name\":\"Retinex at 50\",\"volume\":\"85 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-02-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Retinex at 50\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.2352/ISSN.2470-1173.2016.6.RETINEX-020\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Retinex at 50","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2352/ISSN.2470-1173.2016.6.RETINEX-020","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

本文提出了一个受人类视觉系统的中心-环绕对抗性感受野启发的计算框架。研究表明，从实际像素值(中心)和像素邻域(环绕)的低通估计出发，利用受分流抑制机制启发的映射函数，可以实现一些广泛使用的空间图像处理技术，包括自适应色调映射、局部对比度增强、文本二值化和局部特征检测。因此，它突出了这些看似不同的应用与人类视觉系统早期阶段的关系，并得出了对其特征的见解。人类视觉系统(HVS)中存在丰富的中心-环绕对抗性感受野(RFs)。它们存在于许多区域，如视网膜、膝状外侧核、V1或更高的视觉区域。这似乎是HVS用于局部信号比较的典型策略，不仅在视觉方面，而且在其他感官领域也是如此。中心-环绕细胞的rf包括两个独立的同心圆区域，采样光感受器马赛克(即中心和环绕)，它们对细胞的最终输出起拮抗作用。ON中心环绕细胞的输出随着中心感光细胞活性的增加而增加，而输出随着周围感光细胞活性的增加而减少。相反，对于OFF中心-环绕细胞，中心较高的光感受器活性会对其输出产生负面影响，而周围光感受器活性的增加则会增加其输出。两个区域的大小决定了采样的空间频率:较小的RF尺寸从光感受器马赛克中采样更精细的细节，而较大的RF尺寸编码相同信号的较粗尺度。中心环绕细胞本质上是空间滤波的生物实现。空间滤波是一个非常广泛的术语，包括依赖于信号的局部内容而不是全局恒定的任何类型的滤波操作。几乎所有现有的图像处理和计算摄影技术都包括某种形式的空间图像处理。现代去噪、局部对比度增强、尺度分解、曝光融合、HDR色调映射就是其中的一些方法。这些方法大多与HVS早期阶段的基本计算模型有一些共同点。然而，这些相似之处并不总是那么明显。在本文中，我们从Grossberg[24]开发的HVS第一阶段的计算模型开始，并将其应用于图像处理操作。对HVS进行显式建模超出了本文的研究范围。我们宁愿从中汲取灵感，以解决现实世界的成像问题。更具体地说，我们定义了一个框架，受格罗斯伯格理论的启发，描述了中心环绕信号的相互作用。我们表明，这样的框架可以产生现有的空间图像处理技术，因为它们中的许多都是它的特殊情况。这在图像处理和生物视觉模型之间提供了一个更统一的视图，突出了它们的共同点，并展示了其他可以开发的潜在应用。传统上，中心环绕rf被建模为高斯差分(DoG)[13]。这个线性算子本质上近似于拉普拉斯算子，通过减去两个不同西格玛的高斯算子，以同一位置为中心。DoG是许多计算机视觉和图像处理算法的核心，如边缘检测[12]、尺度空间构建[1]和局部特征检测器[11]。与DoG算子的线性响应相反，HVS的中心环绕单元在其输入方面表现出非线性响应。有趣的是，它们的非线性响应被认为有助于照明不变性和对比度增强[24]。根据标准视网膜模型[6,21]，网格位置(i, j) ON-center - OFF-surround细胞的输出Vi j服从膜生理学方程为:dVi j (t) dt = gleak (Vrest−Vi j) +Ci j (Eex−Vi j) +Si j (Einh−Vi j)

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

A center-surround framework for spatial image processing

This paper presents a computational framework inspired by the center-surround antagonistic receptive fields of the human visual system. It demonstrates that, starting from the actual pixel value (center) and a low-pass estimation of the pixel’s neighborhood (surround) and using a mapping function inspired by the shunting inhibition mechanism, some widely used spatial image processing techniques can be implemented, including adaptive tone-mapping, local contrast enhancement, text binarization and local feature detection. As a result, it highlights the relations of these seemingly different applications with the early stages of the human visual system and draws insights about their characteristics. Introduction Center-surround antagonistic Receptive Fields (RFs) are abundant in the Human Visual System (HVS). They have been found in many areas, such as the retina, the Lateral Geniculate Nucleus, V1 or in higher visual areas. It seems that this is a typical strategy that the HVS employs for local signal comparisons, not only in vision but in other sensory areas as well. The RFs of center-surround cells comprise two separate concentric regions sampling the photoreceptor mosaic (namely the center and the surround) that act antagonistically on the final output of the cell. ON center-surround cells exhibit increased output with higher photoreceptor activity on their center and decreased output with increased activity on their surround. Conversely, for OFF center-surround cells, higher photoreceptor activity on the center has a negative impact on their output, whereas, increased photoreceptor activity on the surround increases their output. The size of the two regions defines the spatial frequency of sampling: smaller RF sizes sample finer details from the photoreceptor mosaic, while larger sizes encode coarser scales of the same signal. Center-surround cells are essentially a biological implementation of spatial filtering. Spatial filtering is a very broad term, encompassing any kind of filtering operations that depend on the local content of the signal and are not globally constant. Almost all existing image processing and computational photography techniques include some kind of spatial image processing. Modern denoising, local contrast enhancement, scale decomposition, exposure fusion, HDR tone mapping are some of them. Most of these methods have some common grounds with the basic computational models of the early stages of the HVS. However these similarities are not always so evident. In this paper, we start from the computational model of the first stages of HVS, developed by Grossberg [24], and we adapt it for image processing operations. Explicitly modeling HVS is out of the scope of this paper. We rather draw inspiration from it in order to address real-world imaging problems. More specifically, we define a framework, inspired by Grossberg’s theory, that describes center-surround signal interactions. We show that such a framework can give rise to existing spatial image processing techniques, as many of them are special cases of it. This gives a more unified view between image processing and biological vision models, highlighting their common ground and showing other potential applications that can be developed. Modeling Center-Surround RFs Traditionally, center-surround RFs have been modeled as Difference of Gaussians (DoG) [13]. This linear operator essentially approximates the Laplacian operator, by subtracting two Gaussians of different sigmas, centered in the same position. DoG is at the heart of many computer vision and image processing algorithms, such as edge detection [12], scale-space construction [1] and local feature detectors [11]. Contrary to the linear response of the DoG operator though, the center-surround cells of the HVS exhibit non-linear response in regards to their inputs. Interestingly, their nonlinear response is thought to contribute to illumination invariance and contrast enhancement [24]. According to the standard retinal model [6, 21], the output Vi j of an ON-center OFF-surround cell at grid position (i, j), obeying the membrane equations of physiology is given by dVi j (t) dt = gleak ( Vrest −Vi j ) +Ci j ( Eex−Vi j ) +Si j ( Einh−Vi j )

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Retinex at 50

自引率

0.00%

发文量

期刊最新文献

The Role of Lightness Perception in Determining the Perceived Contrast of Real World Scenes The Oriented Difference-of-Gaussians Model of Brightness Perception Designator Retinex, Milano Retinex and the locality issue A generalized white-patch model for fast color cast detection in natural images Statistical Aspects of Space Sampling in Retinex models