Journal of Mathematical Imaging and Vision最新文献

英文中文

Comparison of Different Parallel Transport Methods for the Study of Deformations in 3D Cardiac Data 不同平行传输方法在三维心脏数据变形研究中的比较

IF 2 4区数学 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Journal of Mathematical Imaging and Vision

Pub Date : 2024-04-29 DOI: 10.1007/s10851-024-01186-x

Paolo Piras, Nicolas Guigui, Valerio Varano

Comparing the deformations of different beating hearts is a challenging operation. As in clinics the impaired condition is often recognized upon (local and global) deformation parameters, the particular nature of heart deformation during one beat can be compared among different individuals in the same ordination space more effectively if initial inter-individual form (shape + size) differences are filtered out. This is even more true if the shape of cardiac trajectory itself is under consideration. This need is satisfied by applying a geometric machinery named “parallel transport” in the field of differential geometry. In recent years several parallel transport methods have been applied to cardiological data acquired via echocardiography, CT scan or magnetic resonance. Concomitantly, some efforts were made for comparing different parallel transport algorithms applied to a variety of toy examples and real deformational data. Here we face the problem of comparing the heavily used LDDMM parallel transport with the recently proposed Riemannian “TPS space” in the context of the deformation of the right ventricle. Using local tensors diagnostics and global energy-based and shape distance-based parameters, we explored the maintenance of original deformations in transported data in four systo-diastolic deformations belonging to one healthy subject and three individuals affected by tetralogy of Fallot, atrial septal defect and pulmonary hypertension. We also do the same in a larger dataset relative to the left ventricle of 82 heathly subjects and 21 patients affected by hypertrophic cardiomyopathy. We also do the same in a larger dataset relative to the left ventricle of 82 heathly subjects and 21 patients affected by hypertrophic cardiomyopathy. In particular, we contrasted the TPS space with classic LDDMM and a modified LDDMM able to manage spherical differences. Our results point toward a neat superiority of TPS space over classic LDDMM. The modified LDDMM performs similarly as it maintains better the chosen diagnostics.

比较不同跳动心脏的变形是一项具有挑战性的工作。在临床中，受损情况通常是通过（局部和整体）变形参数来识别的，因此，如果能过滤掉个体间最初的形态（形状+大小）差异，就能更有效地比较同一序列空间中不同个体在一次跳动过程中心脏变形的特殊性质。如果考虑到心脏运动轨迹本身的形状，情况就更加如此。微分几何学中一种名为 "平行传输 "的几何机制可以满足这一需求。近年来，一些平行传输方法已被应用于通过超声心动图、CT 扫描或磁共振获取的心脏病学数据。与此同时，人们也在努力比较应用于各种玩具示例和真实变形数据的不同平行传输算法。在这里，我们面临的问题是，在右心室变形的背景下，将大量使用的 LDDMM 平行传输与最近提出的黎曼 "TPS 空间 "进行比较。利用局部张量诊断和基于全局能量和形状距离的参数，我们探索了传输数据中原始变形的保持情况，这些数据分别属于一名健康受试者和三名受法洛四联症、房间隔缺损和肺动脉高压影响的受试者。我们还在一个更大的数据集中对 82 名健康受试者和 21 名肥厚型心肌病患者的左心室进行了同样的处理。我们还在一个更大的数据集中对 82 名健康受试者和 21 名肥厚型心肌病患者的左心室进行了同样的研究。特别是，我们将 TPS 空间与经典的 LDDMM 和能够处理球面差异的改进型 LDDMM 进行了对比。我们的研究结果表明，TPS 空间比传统 LDDMM 更有优势。修改后的 LDDMM 表现类似，因为它能更好地保持所选的诊断方法。

{"title":"Comparison of Different Parallel Transport Methods for the Study of Deformations in 3D Cardiac Data","authors":"Paolo Piras, Nicolas Guigui, Valerio Varano","doi":"10.1007/s10851-024-01186-x","DOIUrl":"https://doi.org/10.1007/s10851-024-01186-x","url":null,"abstract":"Comparing the deformations of different beating hearts is a challenging operation. As in clinics the impaired condition is often recognized upon (local and global) deformation parameters, the particular nature of heart deformation during one beat can be compared among different individuals in the same ordination space more effectively if initial inter-individual form (shape + size) differences are filtered out. This is even more true if the shape of cardiac trajectory itself is under consideration. This need is satisfied by applying a geometric machinery named “parallel transport” in the field of differential geometry. In recent years several parallel transport methods have been applied to cardiological data acquired via echocardiography, CT scan or magnetic resonance. Concomitantly, some efforts were made for comparing different parallel transport algorithms applied to a variety of toy examples and real deformational data. Here we face the problem of comparing the heavily used LDDMM parallel transport with the recently proposed Riemannian “TPS space” in the context of the deformation of the right ventricle. Using local tensors diagnostics and global energy-based and shape distance-based parameters, we explored the maintenance of original deformations in transported data in four systo-diastolic deformations belonging to one healthy subject and three individuals affected by tetralogy of Fallot, atrial septal defect and pulmonary hypertension. We also do the same in a larger dataset relative to the left ventricle of 82 heathly subjects and 21 patients affected by hypertrophic cardiomyopathy. We also do the same in a larger dataset relative to the left ventricle of 82 heathly subjects and 21 patients affected by hypertrophic cardiomyopathy. In particular, we contrasted the TPS space with classic LDDMM and a modified LDDMM able to manage spherical differences. Our results point toward a neat superiority of TPS space over classic LDDMM. The modified LDDMM performs similarly as it maintains better the chosen diagnostics.","PeriodicalId":16196,"journal":{"name":"Journal of Mathematical Imaging and Vision","volume":"145 1","pages":""},"PeriodicalIF":2.0,"publicationDate":"2024-04-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140835658","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Hypergraph p-Laplacians and Scale Spaces 超图 p-Laplacians 和尺度空间

IF 2 4区数学 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Journal of Mathematical Imaging and Vision

Pub Date : 2024-04-23 DOI: 10.1007/s10851-024-01183-0

Ariane Fazeny, Daniel Tenbrinck, Kseniia Lukin, Martin Burger

The aim of this paper is to revisit the definition of differential operators on hypergraphs, which are a natural extension of graphs in systems based on interactions beyond pairs. In particular, we focus on the definition of Laplacian and p-Laplace operators for oriented and unoriented hypergraphs, their basic properties, variational structure, and their scale spaces. We illustrate that diffusion equations on hypergraphs are possible models for different applications such as information flow on social networks or image processing. Moreover, the spectral analysis and scale spaces induced by these operators provide a potential method to further analyze complex data and their multiscale structure. The quest for spectral analysis and suitable scale spaces on hypergraphs motivates in particular a definition of differential operators with trivial first eigenfunction and thus more interpretable second eigenfunctions. This property is not automatically satisfied in existing definitions of hypergraph p-Laplacians, and we hence provide a novel axiomatic approach that extends previous definitions and can be specialized to satisfy such (or other) desired properties.

本文旨在重温超图上微分算子的定义，超图是基于成对之外的相互作用的系统中图的自然扩展。我们特别关注有向和无向超图的拉普拉斯算子和 p-Laplace 算子的定义、基本性质、变异结构及其尺度空间。我们说明，超图上的扩散方程是社交网络信息流或图像处理等不同应用的可能模型。此外，这些算子引起的谱分析和尺度空间为进一步分析复杂数据及其多尺度结构提供了一种潜在方法。在超图上寻求频谱分析和合适的尺度空间，尤其需要定义具有微不足道的第一特征函数的微分算子，从而获得更多可解释的第二特征函数。现有的超图 p-Laplacians 定义并不能自动满足这一属性，因此我们提供了一种新的公理方法，它扩展了以前的定义，并可专门用于满足此类（或其他）所需属性。

{"title":"Hypergraph p-Laplacians and Scale Spaces","authors":"Ariane Fazeny, Daniel Tenbrinck, Kseniia Lukin, Martin Burger","doi":"10.1007/s10851-024-01183-0","DOIUrl":"https://doi.org/10.1007/s10851-024-01183-0","url":null,"abstract":"The aim of this paper is to revisit the definition of differential operators on hypergraphs, which are a natural extension of graphs in systems based on interactions beyond pairs. In particular, we focus on the definition of Laplacian and p-Laplace operators for oriented and unoriented hypergraphs, their basic properties, variational structure, and their scale spaces. We illustrate that diffusion equations on hypergraphs are possible models for different applications such as information flow on social networks or image processing. Moreover, the spectral analysis and scale spaces induced by these operators provide a potential method to further analyze complex data and their multiscale structure. The quest for spectral analysis and suitable scale spaces on hypergraphs motivates in particular a definition of differential operators with trivial first eigenfunction and thus more interpretable second eigenfunctions. This property is not automatically satisfied in existing definitions of hypergraph p-Laplacians, and we hence provide a novel axiomatic approach that extends previous definitions and can be specialized to satisfy such (or other) desired properties.","PeriodicalId":16196,"journal":{"name":"Journal of Mathematical Imaging and Vision","volume":"31 1","pages":""},"PeriodicalIF":2.0,"publicationDate":"2024-04-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140799365","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A Variational Approach for Joint Image Recovery and Feature Extraction Based on Spatially Varying Generalised Gaussian Models 基于空间变化广义高斯模型的联合图像复原和特征提取变分法

IF 2 4区数学 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Journal of Mathematical Imaging and Vision

Pub Date : 2024-04-12 DOI: 10.1007/s10851-024-01184-z

Émilie Chouzenoux, Marie-Caroline Corbineau, Jean-Christophe Pesquet, Gabriele Scrivanti

The joint problem of reconstruction/feature extraction is a challenging task in image processing. It consists in performing, in a joint manner, the restoration of an image and the extraction of its features. In this work, we firstly propose a novel non-smooth and non-convex variational formulation of the problem. For this purpose, we introduce a versatile generalised Gaussian prior whose parameters, including its exponent, are space-variant. Secondly, we design an alternating proximal-based optimisation algorithm that efficiently exploits the structure of the proposed non-convex objective function. We also analyse the convergence of this algorithm. As shown in numerical experiments conducted on joint deblurring/segmentation tasks, the proposed method provides high-quality results.

重建/特征提取联合问题是图像处理中一项具有挑战性的任务。它包括以联合方式执行图像复原和特征提取。在这项工作中，我们首先提出了一个新颖的非平滑和非凸变式问题。为此，我们引入了一种通用的高斯先验，其参数（包括指数）是空间变量。其次，我们设计了一种基于交替近似的优化算法，该算法能有效利用所提出的非凸目标函数的结构。我们还分析了该算法的收敛性。正如在联合去模糊/分割任务中进行的数值实验所显示的，所提出的方法能提供高质量的结果。

引用次数: 0

Finding Space-Time Boundaries with Deformable Hypersurfaces 用可变形超曲面寻找时空边界

IF 2 4区数学 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Journal of Mathematical Imaging and Vision

Pub Date : 2024-04-09 DOI: 10.1007/s10851-024-01185-y

Patrick M. Jensen, J. Andreas Bærentzen, Anders B. Dahl, Vedrana A. Dahl

Dynamic 3D imaging is increasingly used to study evolving objects. We address the problem of detecting and tracking simple objects that merge or split in time. Common solutions involve detecting topological changes. Instead, we solve the problem in 4D by exploiting the observation that if objects only merge or only split, they appear as a single component in 4D. This allows us to initiate a topologically simple 3D hypersurface and deform it to fit the surface of all objects at all times. This gives an extremely compact representation of the objects’ evolution. We test our method on artificial 4D images and compare it to other segmentation methods. We also apply our method to a 4D X-ray data set to quantify evolving topology. Our method performs comparably to existing methods with better resource use and improved robustness.

动态三维成像技术越来越多地用于研究不断演变的物体。我们要解决的问题是检测和跟踪在时间上合并或分裂的简单物体。常见的解决方案包括检测拓扑变化。相反，我们利用物体只合并或只分裂时在 4D 中显示为单一成分这一观察结果，在 4D 中解决了这一问题。这样，我们就可以启动一个拓扑结构简单的三维超曲面，并对其进行变形，使其在任何时候都适合所有物体的表面。这样，物体的演变过程就得到了极其紧凑的呈现。我们在人工 4D 图像上测试了我们的方法，并将其与其他分割方法进行了比较。我们还将我们的方法应用于 4D X 射线数据集，以量化不断演变的拓扑结构。我们的方法与现有方法性能相当，资源利用率更高，鲁棒性更好。

引用次数: 0

Regularised Diffusion–Shock Inpainting 正则化扩散冲击涂色

IF 2 4区数学 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Journal of Mathematical Imaging and Vision

Pub Date : 2024-04-01 DOI: 10.1007/s10851-024-01175-0

Kristina Schaefer, Joachim Weickert

We introduce regularised diffusion–shock (RDS) inpainting as a modification of diffusion–shock inpainting from our SSVM 2023 conference paper. RDS inpainting combines two carefully chosen components: homogeneous diffusion and coherence-enhancing shock filtering. It benefits from the complementary synergy of its building blocks: The shock term propagates edge data with perfect sharpness and directional accuracy over large distances due to its high degree of anisotropy. Homogeneous diffusion fills large areas efficiently. The second order equation underlying RDS inpainting inherits a maximum–minimum principle from its components, which is also fulfilled in the discrete case, in contrast to competing anisotropic methods. The regularisation addresses the largest drawback of the original model: It allows a drastic reduction in model parameters without any loss in quality. Furthermore, we extend RDS inpainting to vector-valued data. Our experiments show a performance that is comparable to or better than many inpainting methods based on partial differential equations and related integrodifferential models, including anisotropic processes of second or fourth order.

我们在 SSVM 2023 会议论文中介绍了正则化扩散冲击（RDS）绘制，它是对扩散冲击绘制的修改。RDS 内画结合了两个精心选择的组件：均匀扩散和相干性增强冲击滤波。它得益于其构建模块的互补协同作用：由于各向异性程度较高，冲击项可以在大范围内以完美的清晰度和方向准确性传播边缘数据。均匀扩散可有效填充大面积区域。RDS Inpainting 所依据的二阶方程继承了其各组成部分的最大-最小原则，与其他各向异性方法相比，离散情况下也符合这一原则。正则化解决了原始模型的最大缺点：它允许在不降低质量的情况下大幅减少模型参数。此外，我们还将 RDS 内绘扩展到了矢量值数据。我们的实验表明，其性能与许多基于偏微分方程和相关积分微分模型（包括二阶或四阶各向异性过程）的绘制方法相当，甚至更好。

{"title":"Regularised Diffusion–Shock Inpainting","authors":"Kristina Schaefer, Joachim Weickert","doi":"10.1007/s10851-024-01175-0","DOIUrl":"https://doi.org/10.1007/s10851-024-01175-0","url":null,"abstract":"We introduce regularised diffusion–shock (RDS) inpainting as a modification of diffusion–shock inpainting from our SSVM 2023 conference paper. RDS inpainting combines two carefully chosen components: homogeneous diffusion and coherence-enhancing shock filtering. It benefits from the complementary synergy of its building blocks: The shock term propagates edge data with perfect sharpness and directional accuracy over large distances due to its high degree of anisotropy. Homogeneous diffusion fills large areas efficiently. The second order equation underlying RDS inpainting inherits a maximum–minimum principle from its components, which is also fulfilled in the discrete case, in contrast to competing anisotropic methods. The regularisation addresses the largest drawback of the original model: It allows a drastic reduction in model parameters without any loss in quality. Furthermore, we extend RDS inpainting to vector-valued data. Our experiments show a performance that is comparable to or better than many inpainting methods based on partial differential equations and related integrodifferential models, including anisotropic processes of second or fourth order.","PeriodicalId":16196,"journal":{"name":"Journal of Mathematical Imaging and Vision","volume":"32 1","pages":""},"PeriodicalIF":2.0,"publicationDate":"2024-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140594594","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Averaged Deep Denoisers for Image Regularization 用于图像正规化的平均深度去噪器

IF 2 4区数学 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Journal of Mathematical Imaging and Vision

Pub Date : 2024-03-24 DOI: 10.1007/s10851-024-01181-2

Abstract

Plug-and-Play (PnP) and Regularization-by-Denoising (RED) are recent paradigms for image reconstruction that leverage the power of modern denoisers for image regularization. In particular, they have been shown to deliver state-of-the-art reconstructions with CNN denoisers. Since the regularization is performed in an ad-hoc manner, understanding the convergence of PnP and RED has been an active research area. It was shown in recent works that iterate convergence can be guaranteed if the denoiser is averaged or nonexpansive. However, integrating nonexpansivity with gradient-based learning is challenging, the core issue being that testing nonexpansivity is intractable. Using numerical examples, we show that existing CNN denoisers tend to violate the nonexpansive property, which can cause PnP or RED to diverge. In fact, algorithms for training nonexpansive denoisers either cannot guarantee nonexpansivity or are computationally intensive. In this work, we construct contractive and averaged image denoisers by unfolding splitting-based optimization algorithms applied to wavelet denoising and demonstrate that their regularization capacity for PnP and RED can be matched with CNN denoisers. To our knowledge, this is the first work to propose a simple framework for training contractive denoisers using network unfolding.

摘要即插即用（PnP）和去噪正则化（RED）是最近的图像重建范例，它们利用现代去噪器的强大功能进行图像正则化。特别是，它们已被证明能利用 CNN 去噪器实现最先进的重建。由于正则化是以临时方式进行的，因此了解 PnP 和 RED 的收敛性一直是一个活跃的研究领域。最近的研究表明，如果去噪器是平均的或非膨胀的，则可以保证迭代收敛。然而，将非膨胀性与基于梯度的学习结合起来具有挑战性，其核心问题是非膨胀性测试难以进行。我们利用数值示例表明，现有的 CNN 去噪器往往会违反非膨胀性特性，从而导致 PnP 或 RED 发散。事实上，训练非膨胀去噪器的算法要么不能保证非膨胀性，要么计算量很大。在这项工作中，我们通过应用于小波去噪的基于分裂的展开优化算法，构建了收缩和平均图像去噪器，并证明其对 PnP 和 RED 的正则化能力可与 CNN 去噪器相媲美。据我们所知，这是首次提出利用网络展开训练收缩去噪器的简单框架。

{"title":"Averaged Deep Denoisers for Image Regularization","authors":"","doi":"10.1007/s10851-024-01181-2","DOIUrl":"https://doi.org/10.1007/s10851-024-01181-2","url":null,"abstract":"<h3>Abstract</h3> Plug-and-Play (PnP) and Regularization-by-Denoising (RED) are recent paradigms for image reconstruction that leverage the power of modern denoisers for image regularization. In particular, they have been shown to deliver state-of-the-art reconstructions with CNN denoisers. Since the regularization is performed in an ad-hoc manner, understanding the convergence of PnP and RED has been an active research area. It was shown in recent works that iterate convergence can be guaranteed if the denoiser is averaged or nonexpansive. However, integrating nonexpansivity with gradient-based learning is challenging, the core issue being that testing nonexpansivity is intractable. Using numerical examples, we show that existing CNN denoisers tend to violate the nonexpansive property, which can cause PnP or RED to diverge. In fact, algorithms for training nonexpansive denoisers either cannot guarantee nonexpansivity or are computationally intensive. In this work, we construct contractive and averaged image denoisers by unfolding splitting-based optimization algorithms applied to wavelet denoising and demonstrate that their regularization capacity for PnP and RED can be matched with CNN denoisers. To our knowledge, this is the first work to propose a simple framework for training contractive denoisers using network unfolding. ","PeriodicalId":16196,"journal":{"name":"Journal of Mathematical Imaging and Vision","volume":"7 1","pages":""},"PeriodicalIF":2.0,"publicationDate":"2024-03-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140199470","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Sparse Resultant-Based Minimal Solvers in Computer Vision and Their Connection with the Action Matrix 计算机视觉中基于稀疏结果的最小求解器及其与动作矩阵的联系

IF 2 4区数学 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Journal of Mathematical Imaging and Vision

Pub Date : 2024-03-23 DOI: 10.1007/s10851-024-01182-1

Snehal Bhayani, Janne Heikkilä, Zuzana Kukelova

Many computer vision applications require robust and efficient estimation of camera geometry from a minimal number of input data measurements. Minimal problems are usually formulated as complex systems of sparse polynomial equations. The systems usually are overdetermined and consist of polynomials with algebraically constrained coefficients. Most state-of-the-art efficient polynomial solvers are based on the action matrix method that has been automated and highly optimized in recent years. On the other hand, the alternative theory of sparse resultants based on the Newton polytopes has not been used so often for generating efficient solvers, primarily because the polytopes do not respect the constraints amongst the coefficients. In an attempt to tackle this challenge, here we propose a simple iterative scheme to test various subsets of the Newton polytopes and search for the most efficient solver. Moreover, we propose to use an extra polynomial with a special form to further improve the solver efficiency via Schur complement computation. We show that for some camera geometry problems our resultant-based method leads to smaller and more stable solvers than the state-of-the-art Gröbner basis-based solvers, while being significantly smaller than the state-of-the-art resultant-based methods. The proposed method can be fully automated and incorporated into existing tools for the automatic generation of efficient polynomial solvers. It provides a competitive alternative to popular Gröbner basis-based methods for minimal problems in computer vision. Additionally, we study the conditions under which the minimal solvers generated by the state-of-the-art action matrix-based methods and the proposed extra polynomial resultant-based method, are equivalent. Specifically, we consider a step-by-step comparison between the approaches based on the action matrix and the sparse resultant, followed by a set of substitutions, which would lead to equivalent minimal solvers.

许多计算机视觉应用都需要根据极少量的输入数据测量结果，对相机几何形状进行稳健而高效的估计。最小问题通常被表述为稀疏多项式方程的复杂系统。这些系统通常是过确定的，由具有代数约束系数的多项式组成。最先进的高效多项式求解器大多基于行动矩阵法，近年来该方法已经实现了自动化和高度优化。另一方面，基于牛顿多面体的稀疏结果的替代理论并不常用于生成高效求解器，这主要是因为多面体并不尊重系数之间的约束。为了应对这一挑战，我们在此提出了一个简单的迭代方案，以测试牛顿多面体的各种子集，并寻找最高效的求解器。此外，我们还建议使用具有特殊形式的额外多项式，通过舒尔补码计算进一步提高求解器的效率。我们的研究表明，对于一些照相机几何问题，我们基于结果的方法比最先进的基于格罗伯纳基础的求解器更小更稳定，同时也比最先进的基于结果的方法小得多。所提出的方法可以完全自动化，并可集成到现有工具中，自动生成高效的多项式求解器。它为计算机视觉中的最小问题提供了一种有竞争力的替代方法，可替代流行的基于格罗伯纳基础的方法。此外，我们还研究了基于行动矩阵的最先进方法和基于额外多项式结果的拟议方法所生成的最小解算器在哪些条件下是等价的。具体来说，我们考虑逐步比较基于动作矩阵和稀疏结果的方法，然后进行一系列替换，从而得出等效的最小解算器。

{"title":"Sparse Resultant-Based Minimal Solvers in Computer Vision and Their Connection with the Action Matrix","authors":"Snehal Bhayani, Janne Heikkilä, Zuzana Kukelova","doi":"10.1007/s10851-024-01182-1","DOIUrl":"https://doi.org/10.1007/s10851-024-01182-1","url":null,"abstract":"Many computer vision applications require robust and efficient estimation of camera geometry from a minimal number of input data measurements. Minimal problems are usually formulated as complex systems of sparse polynomial equations. The systems usually are overdetermined and consist of polynomials with algebraically constrained coefficients. Most state-of-the-art efficient polynomial solvers are based on the action matrix method that has been automated and highly optimized in recent years. On the other hand, the alternative theory of sparse resultants based on the Newton polytopes has not been used so often for generating efficient solvers, primarily because the polytopes do not respect the constraints amongst the coefficients. In an attempt to tackle this challenge, here we propose a simple iterative scheme to test various subsets of the Newton polytopes and search for the most efficient solver. Moreover, we propose to use an extra polynomial with a special form to further improve the solver efficiency via Schur complement computation. We show that for some camera geometry problems our resultant-based method leads to smaller and more stable solvers than the state-of-the-art Gröbner basis-based solvers, while being significantly smaller than the state-of-the-art resultant-based methods. The proposed method can be fully automated and incorporated into existing tools for the automatic generation of efficient polynomial solvers. It provides a competitive alternative to popular Gröbner basis-based methods for minimal problems in computer vision. Additionally, we study the conditions under which the minimal solvers generated by the state-of-the-art action matrix-based methods and the proposed extra polynomial resultant-based method, are equivalent. Specifically, we consider a step-by-step comparison between the approaches based on the action matrix and the sparse resultant, followed by a set of substitutions, which would lead to equivalent minimal solvers.\u0000","PeriodicalId":16196,"journal":{"name":"Journal of Mathematical Imaging and Vision","volume":"86 1","pages":""},"PeriodicalIF":2.0,"publicationDate":"2024-03-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140198825","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Generalized Inversion of Nonlinear Operators 非线性算子的广义反演

IF 2 4区数学 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Journal of Mathematical Imaging and Vision

Pub Date : 2024-03-19 DOI: 10.1007/s10851-024-01179-w

Eyal Gofer, Guy Gilboa

Inversion of operators is a fundamental concept in data processing. Inversion of linear operators is well studied, supported by established theory. When an inverse either does not exist or is not unique, generalized inverses are used. Most notable is the Moore–Penrose inverse, widely used in physics, statistics, and various fields of engineering. This work investigates generalized inversion of nonlinear operators. We first address broadly the desired properties of generalized inverses, guided by the Moore–Penrose axioms. We define the notion for general sets and then a refinement, termed pseudo-inverse, for normed spaces. We present conditions for existence and uniqueness of a pseudo-inverse and establish theoretical results investigating its properties, such as continuity, its value for operator compositions and projection operators, and others. Analytic expressions are given for the pseudo-inverse of some well-known, non-invertible, nonlinear operators, such as hard- or soft-thresholding and ReLU. We analyze a neural layer and discuss relations to wavelet thresholding. Next, the Drazin inverse, and a relaxation, are investigated for operators with equal domain and range. We present scenarios where inversion is expressible as a linear combination of forward applications of the operator. Such scenarios arise for classes of nonlinear operators with vanishing polynomials, similar to the minimal or characteristic polynomials for matrices. Inversion using forward applications may facilitate the development of new efficient algorithms for approximating generalized inversion of complex nonlinear operators.

算子反演是数据处理中的一个基本概念。线性算子的逆运算已得到深入研究，并有成熟的理论支持。当逆运算不存在或不唯一时，就会使用广义逆运算。最著名的是 Moore-Penrose 反演，它被广泛应用于物理学、统计学和工程学的各个领域。这项工作研究非线性算子的广义反演。首先，我们在摩尔-彭罗斯公理的指导下，大致探讨了广义反演所需的性质。我们定义了一般集合的概念，然后定义了规范空间的细化概念，称为伪逆。我们提出了伪逆存在性和唯一性的条件，并建立了研究其性质的理论结果，如连续性、其对算子组合和投影算子的价值等。我们给出了一些著名的非可逆非线性算子（如硬或软阈值和 ReLU）的伪逆分析表达式。我们分析了一个神经层，并讨论了与小波阈值的关系。接下来，我们研究了具有相等域和范围的算子的 Drazin 逆和松弛。我们介绍了反演可表示为算子正向应用的线性组合的情况。这种情况出现在具有消失多项式的非线性算子类中，类似于矩阵的最小多项式或特征多项式。使用前向应用进行反演可能有助于开发新的高效算法，以近似对复杂非线性算子进行广义反演。

{"title":"Generalized Inversion of Nonlinear Operators","authors":"Eyal Gofer, Guy Gilboa","doi":"10.1007/s10851-024-01179-w","DOIUrl":"https://doi.org/10.1007/s10851-024-01179-w","url":null,"abstract":"Inversion of operators is a fundamental concept in data processing. Inversion of linear operators is well studied, supported by established theory. When an inverse either does not exist or is not unique, generalized inverses are used. Most notable is the Moore–Penrose inverse, widely used in physics, statistics, and various fields of engineering. This work investigates generalized inversion of nonlinear operators. We first address broadly the desired properties of generalized inverses, guided by the Moore–Penrose axioms. We define the notion for general sets and then a refinement, termed pseudo-inverse, for normed spaces. We present conditions for existence and uniqueness of a pseudo-inverse and establish theoretical results investigating its properties, such as continuity, its value for operator compositions and projection operators, and others. Analytic expressions are given for the pseudo-inverse of some well-known, non-invertible, nonlinear operators, such as hard- or soft-thresholding and ReLU. We analyze a neural layer and discuss relations to wavelet thresholding. Next, the Drazin inverse, and a relaxation, are investigated for operators with equal domain and range. We present scenarios where inversion is expressible as a linear combination of forward applications of the operator. Such scenarios arise for classes of nonlinear operators with vanishing polynomials, similar to the minimal or characteristic polynomials for matrices. Inversion using forward applications may facilitate the development of new efficient algorithms for approximating generalized inversion of complex nonlinear operators.","PeriodicalId":16196,"journal":{"name":"Journal of Mathematical Imaging and Vision","volume":"57 1","pages":""},"PeriodicalIF":2.0,"publicationDate":"2024-03-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140167632","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Assessing Hierarchies by Their Consistent Segmentations 通过一致的细分来评估层次结构

IF 2 4区数学 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Journal of Mathematical Imaging and Vision

Pub Date : 2024-03-18 DOI: 10.1007/s10851-024-01176-z

Zeev Gutman, Ritvik Vij, Laurent Najman, Michael Lindenbaum

Current approaches to generic segmentation start by creating a hierarchy of nested image partitions and then specifying a segmentation from it. Our first contribution is to describe several ways, most of them new, for specifying segmentations using the hierarchy elements. Then, we consider the best hierarchy-induced segmentation specified by a limited number of hierarchy elements. We focus on a common quality measure for binary segmentations, the Jaccard index (also known as IoU). Optimizing the Jaccard index is highly nontrivial, and yet we propose an efficient approach for doing exactly that. This way we get algorithm-independent upper bounds on the quality of any segmentation created from the hierarchy. We found that the obtainable segmentation quality varies significantly depending on the way that the segments are specified by the hierarchy elements, and that representing a segmentation with only a few hierarchy elements is often possible.

目前的通用分割方法首先是创建嵌套图像分区的层次结构，然后从中指定分割。我们的第一个贡献是描述了几种使用层次元素指定分割的方法，其中大部分是新方法。然后，我们考虑由有限数量的层次结构元素指定的最佳层次结构诱导分割。我们将重点放在二元分割的常用质量度量上，即 Jaccard 指数（也称为 IoU）。优化 Jaccard 指数并非易事，但我们提出了一种高效的方法来实现这一目标。通过这种方法，我们可以获得与算法无关的分层质量上限。我们发现，可获得的分割质量会因层次结构元素指定分割的方式不同而有很大差异，而且通常只需几个层次结构元素就能代表一个分割。

引用次数: 0

Stochastic Primal–Dual Hybrid Gradient Algorithm with Adaptive Step Sizes 自适应步长的随机原始-双重混合梯度算法

IF 2 4区数学 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Journal of Mathematical Imaging and Vision

Pub Date : 2024-03-16 DOI: 10.1007/s10851-024-01174-1

Abstract

In this work, we propose a new primal–dual algorithm with adaptive step sizes. The stochastic primal–dual hybrid gradient (SPDHG) algorithm with constant step sizes has become widely applied in large-scale convex optimization across many scientific fields due to its scalability. While the product of the primal and dual step sizes is subject to an upper-bound in order to ensure convergence, the selection of the ratio of the step sizes is critical in applications. Up-to-now there is no systematic and successful way of selecting the primal and dual step sizes for SPDHG. In this work, we propose a general class of adaptive SPDHG (A-SPDHG) algorithms and prove their convergence under weak assumptions. We also propose concrete parameters-updating strategies which satisfy the assumptions of our theory and thereby lead to convergent algorithms. Numerical examples on computed tomography demonstrate the effectiveness of the proposed schemes.

摘要在这项工作中，我们提出了一种新的具有自适应步长的初等双梯度算法。具有恒定步长的随机主双混合梯度（SPDHG）算法因其可扩展性，已在许多科学领域的大规模凸优化中得到广泛应用。为了确保收敛性，原始步长和二元步长的乘积受制于一个上限值，而步长比例的选择在应用中至关重要。迄今为止，还没有一种系统而成功的方法来选择 SPDHG 的原始步长和对偶步长。在这项研究中，我们提出了一类自适应 SPDHG（A-SPDHG）算法，并证明了它们在弱假设条件下的收敛性。我们还提出了具体的参数更新策略，这些策略满足我们理论中的假设，从而导致算法收敛。计算断层扫描的数值示例证明了所提方案的有效性。

引用次数: 0

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Journal of Mathematical Imaging and Vision

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀