首页 > 最新文献

Journal of Mathematical Imaging and Vision最新文献

英文 中文
Assessing Hierarchies by Their Consistent Segmentations 通过一致的细分来评估层次结构
IF 2 4区 数学 Q1 Mathematics Pub Date : 2024-03-18 DOI: 10.1007/s10851-024-01176-z
Zeev Gutman, Ritvik Vij, Laurent Najman, Michael Lindenbaum

Current approaches to generic segmentation start by creating a hierarchy of nested image partitions and then specifying a segmentation from it. Our first contribution is to describe several ways, most of them new, for specifying segmentations using the hierarchy elements. Then, we consider the best hierarchy-induced segmentation specified by a limited number of hierarchy elements. We focus on a common quality measure for binary segmentations, the Jaccard index (also known as IoU). Optimizing the Jaccard index is highly nontrivial, and yet we propose an efficient approach for doing exactly that. This way we get algorithm-independent upper bounds on the quality of any segmentation created from the hierarchy. We found that the obtainable segmentation quality varies significantly depending on the way that the segments are specified by the hierarchy elements, and that representing a segmentation with only a few hierarchy elements is often possible.

目前的通用分割方法首先是创建嵌套图像分区的层次结构,然后从中指定分割。我们的第一个贡献是描述了几种使用层次元素指定分割的方法,其中大部分是新方法。然后,我们考虑由有限数量的层次结构元素指定的最佳层次结构诱导分割。我们将重点放在二元分割的常用质量度量上,即 Jaccard 指数(也称为 IoU)。优化 Jaccard 指数并非易事,但我们提出了一种高效的方法来实现这一目标。通过这种方法,我们可以获得与算法无关的分层质量上限。我们发现,可获得的分割质量会因层次结构元素指定分割的方式不同而有很大差异,而且通常只需几个层次结构元素就能代表一个分割。
{"title":"Assessing Hierarchies by Their Consistent Segmentations","authors":"Zeev Gutman, Ritvik Vij, Laurent Najman, Michael Lindenbaum","doi":"10.1007/s10851-024-01176-z","DOIUrl":"https://doi.org/10.1007/s10851-024-01176-z","url":null,"abstract":"<p>Current approaches to generic segmentation start by creating a hierarchy of nested image partitions and then specifying a segmentation from it. Our first contribution is to describe several ways, most of them new, for specifying segmentations using the hierarchy elements. Then, we consider the best hierarchy-induced segmentation specified by a limited number of hierarchy elements. We focus on a common quality measure for binary segmentations, the Jaccard index (also known as IoU). Optimizing the Jaccard index is highly nontrivial, and yet we propose an efficient approach for doing exactly that. This way we get algorithm-independent upper bounds on the quality of any segmentation created from the hierarchy. We found that the obtainable segmentation quality varies significantly depending on the way that the segments are specified by the hierarchy elements, and that representing a segmentation with only a few hierarchy elements is often possible.</p>","PeriodicalId":16196,"journal":{"name":"Journal of Mathematical Imaging and Vision","volume":null,"pages":null},"PeriodicalIF":2.0,"publicationDate":"2024-03-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140151337","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Stochastic Primal–Dual Hybrid Gradient Algorithm with Adaptive Step Sizes 自适应步长的随机原始-双重混合梯度算法
IF 2 4区 数学 Q1 Mathematics Pub Date : 2024-03-16 DOI: 10.1007/s10851-024-01174-1

Abstract

In this work, we propose a new primal–dual algorithm with adaptive step sizes. The stochastic primal–dual hybrid gradient (SPDHG) algorithm with constant step sizes has become widely applied in large-scale convex optimization across many scientific fields due to its scalability. While the product of the primal and dual step sizes is subject to an upper-bound in order to ensure convergence, the selection of the ratio of the step sizes is critical in applications. Up-to-now there is no systematic and successful way of selecting the primal and dual step sizes for SPDHG. In this work, we propose a general class of adaptive SPDHG (A-SPDHG) algorithms and prove their convergence under weak assumptions. We also propose concrete parameters-updating strategies which satisfy the assumptions of our theory and thereby lead to convergent algorithms. Numerical examples on computed tomography demonstrate the effectiveness of the proposed schemes.

摘要 在这项工作中,我们提出了一种新的具有自适应步长的初等双梯度算法。具有恒定步长的随机主双混合梯度(SPDHG)算法因其可扩展性,已在许多科学领域的大规模凸优化中得到广泛应用。为了确保收敛性,原始步长和二元步长的乘积受制于一个上限值,而步长比例的选择在应用中至关重要。迄今为止,还没有一种系统而成功的方法来选择 SPDHG 的原始步长和对偶步长。在这项研究中,我们提出了一类自适应 SPDHG(A-SPDHG)算法,并证明了它们在弱假设条件下的收敛性。我们还提出了具体的参数更新策略,这些策略满足我们理论中的假设,从而导致算法收敛。计算断层扫描的数值示例证明了所提方案的有效性。
{"title":"Stochastic Primal–Dual Hybrid Gradient Algorithm with Adaptive Step Sizes","authors":"","doi":"10.1007/s10851-024-01174-1","DOIUrl":"https://doi.org/10.1007/s10851-024-01174-1","url":null,"abstract":"<h3>Abstract</h3> <p>In this work, we propose a new primal–dual algorithm with adaptive step sizes. The stochastic primal–dual hybrid gradient (SPDHG) algorithm with constant step sizes has become widely applied in large-scale convex optimization across many scientific fields due to its scalability. While the product of the primal and dual step sizes is subject to an upper-bound in order to ensure convergence, the selection of the ratio of the step sizes is critical in applications. Up-to-now there is no systematic and successful way of selecting the primal and dual step sizes for SPDHG. In this work, we propose a general class of adaptive SPDHG (A-SPDHG) algorithms and prove their convergence under weak assumptions. We also propose concrete parameters-updating strategies which satisfy the assumptions of our theory and thereby lead to convergent algorithms. Numerical examples on computed tomography demonstrate the effectiveness of the proposed schemes. </p>","PeriodicalId":16196,"journal":{"name":"Journal of Mathematical Imaging and Vision","volume":null,"pages":null},"PeriodicalIF":2.0,"publicationDate":"2024-03-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140151418","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Product of Gaussian Mixture Diffusion Models 高斯混合扩散模型的乘积
IF 2 4区 数学 Q1 Mathematics Pub Date : 2024-03-15 DOI: 10.1007/s10851-024-01180-3
Martin Zach, Erich Kobler, Antonin Chambolle, Thomas Pock

In this work, we tackle the problem of estimating the density ( f_X ) of a random variable ( X ) by successive smoothing, such that the smoothed random variable ( Y ) fulfills the diffusion partial differential equation ( (partial _t - Delta _1)f_Y(,cdot ,, t) = 0 ) with initial condition ( f_Y(,cdot ,, 0) = f_X ). We propose a product-of-experts-type model utilizing Gaussian mixture experts and study configurations that admit an analytic expression for ( f_Y (,cdot ,, t) ). In particular, with a focus on image processing, we derive conditions for models acting on filter, wavelet, and shearlet responses. Our construction naturally allows the model to be trained simultaneously over the entire diffusion horizon using empirical Bayes. We show numerical results for image denoising where our models are competitive while being tractable, interpretable, and having only a small number of learnable parameters. As a by-product, our models can be used for reliable noise level estimation, allowing blind denoising of images corrupted by heteroscedastic noise.

在这项工作中,我们要解决的问题是通过连续平滑来估计随机变量 ( X ) 的密度 ( f_X )、使得平滑后的随机变量( Y) 满足扩散偏微分方程( (partial _t - Delta _1)f_Y(,cdot ,, t) = 0 ),初始条件为( f_Y(,cdot ,, 0) = f_X )。我们提出了一种利用高斯混合物专家的专家产品型模型,并研究了允许对 ( f_Y (,cdot , t) ) 进行解析表达的配置。特别是,以图像处理为重点,我们推导出了作用于滤波、小波和小剪响应的模型条件。我们的构造自然允许使用经验贝叶斯在整个扩散范围内同时训练模型。我们展示了图像去噪的数值结果,在这些结果中,我们的模型是有竞争力的,同时也是可操作、可解释的,并且只有少量可学习参数。作为副产品,我们的模型可用于可靠的噪声水平估计,从而对受异速噪声干扰的图像进行盲去噪。
{"title":"Product of Gaussian Mixture Diffusion Models","authors":"Martin Zach, Erich Kobler, Antonin Chambolle, Thomas Pock","doi":"10.1007/s10851-024-01180-3","DOIUrl":"https://doi.org/10.1007/s10851-024-01180-3","url":null,"abstract":"<p>In this work, we tackle the problem of estimating the density <span>( f_X )</span> of a random variable <span>( X )</span> by successive smoothing, such that the smoothed random variable <span>( Y )</span> fulfills the diffusion partial differential equation <span>( (partial _t - Delta _1)f_Y(,cdot ,, t) = 0 )</span> with initial condition <span>( f_Y(,cdot ,, 0) = f_X )</span>. We propose a product-of-experts-type model utilizing Gaussian mixture experts and study configurations that admit an analytic expression for <span>( f_Y (,cdot ,, t) )</span>. In particular, with a focus on image processing, we derive conditions for models acting on filter, wavelet, and shearlet responses. Our construction naturally allows the model to be trained simultaneously over the entire diffusion horizon using empirical Bayes. We show numerical results for image denoising where our models are competitive while being tractable, interpretable, and having only a small number of learnable parameters. As a by-product, our models can be used for reliable noise level estimation, allowing blind denoising of images corrupted by heteroscedastic noise.</p>","PeriodicalId":16196,"journal":{"name":"Journal of Mathematical Imaging and Vision","volume":null,"pages":null},"PeriodicalIF":2.0,"publicationDate":"2024-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140151336","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Batch-Less Stochastic Gradient Descent for Compressive Learning of Deep Regularization for Image Denoising 用于图像去噪的深度正则化压缩学习的无批次随机梯度下降技术
IF 2 4区 数学 Q1 Mathematics Pub Date : 2024-03-13 DOI: 10.1007/s10851-024-01178-x
Hui Shi, Yann Traonmilin, Jean-François Aujol

We consider the problem of denoising with the help of prior information taken from a database of clean signals or images. Denoising with variational methods is very efficient if a regularizer well-adapted to the nature of the data is available. Thanks to the maximum a posteriori Bayesian framework, such regularizer can be systematically linked with the distribution of the data. With deep neural networks (DNN), complex distributions can be recovered from a large training database. To reduce the computational burden of this task, we adapt the compressive learning framework to the learning of regularizers parametrized by DNN. We propose two variants of stochastic gradient descent (SGD) for the recovery of deep regularization parameters from a heavily compressed database. These algorithms outperform the initially proposed method that was limited to low-dimensional signals, each iteration using information from the whole database. They also benefit from classical SGD convergence guarantees. Thanks to these improvements we show that this method can be applied for patch-based image denoising.

我们考虑的问题是利用从干净信号或图像数据库中获取的先验信息进行去噪。如果有一个非常适合数据性质的正则化器,使用变分法去噪就会非常有效。得益于最大后验贝叶斯框架,这种正则器可以与数据分布系统地联系起来。利用深度神经网络(DNN),可以从大型训练数据库中恢复复杂的分布。为了减轻这项任务的计算负担,我们将压缩学习框架调整为以 DNN 为参数的正则学习。我们提出了两种随机梯度下降(SGD)变体,用于从严重压缩的数据库中恢复深度正则化参数。这些算法优于最初提出的方法,后者仅限于低维信号,每次迭代都使用来自整个数据库的信息。它们还受益于经典的 SGD 收敛保证。得益于这些改进,我们证明这种方法可用于基于补丁的图像去噪。
{"title":"Batch-Less Stochastic Gradient Descent for Compressive Learning of Deep Regularization for Image Denoising","authors":"Hui Shi, Yann Traonmilin, Jean-François Aujol","doi":"10.1007/s10851-024-01178-x","DOIUrl":"https://doi.org/10.1007/s10851-024-01178-x","url":null,"abstract":"<p>We consider the problem of denoising with the help of prior information taken from a database of clean signals or images. Denoising with variational methods is very efficient if a regularizer well-adapted to the nature of the data is available. Thanks to the maximum a posteriori Bayesian framework, such regularizer can be systematically linked with the distribution of the data. With deep neural networks (DNN), complex distributions can be recovered from a large training database. To reduce the computational burden of this task, we adapt the compressive learning framework to the learning of regularizers parametrized by DNN. We propose two variants of stochastic gradient descent (SGD) for the recovery of deep regularization parameters from a heavily compressed database. These algorithms outperform the initially proposed method that was limited to low-dimensional signals, each iteration using information from the <i>whole database</i>. They also benefit from classical SGD convergence guarantees. Thanks to these improvements we show that this method can be applied for patch-based image denoising.</p>","PeriodicalId":16196,"journal":{"name":"Journal of Mathematical Imaging and Vision","volume":null,"pages":null},"PeriodicalIF":2.0,"publicationDate":"2024-03-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140114838","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sufficient Conditions for Topology-Preserving Parallel Reductions on the Face-Centered Cubic Grid 面心立方网格上拓扑保全并行还原的充分条件
IF 2 4区 数学 Q1 Mathematics Pub Date : 2024-03-12 DOI: 10.1007/s10851-024-01177-y
Gábor Karai, Péter Kardos, Kálmán Palágyi

Topology preservation is a crucial issue in parallel reductions that transform binary pictures by changing only a set of black points to white at a time. In this paper, we present sufficient conditions for topology-preserving parallel reductions on the three types of pictures of the unconventional 3D face-centered cubic (FCC) grid. Some conditions provide methods of verifying that a given parallel reduction always preserves the topology, and the remaining ones directly provide deletion rules of topology-preserving parallel reductions, and make us possible to generate topologically correct thinning algorithms. We give local characterizations of P-simple points, whose simultaneous deletion preserves the topology, and the relationships among the existing universal sufficient conditions for arbitrary types of binary pictures and our new FCC-specific results are also established.

在通过一次只将一组黑点变为白点来变换二值图片的并行还原中,拓扑结构的保持是一个关键问题。在本文中,我们提出了对非常规三维面心立方网格(FCC)的三类图片进行拓扑保存并行还原的充分条件。其中一些条件提供了验证给定并行还原是否始终保持拓扑结构的方法,其余条件直接提供了保持拓扑结构的并行还原的删除规则,并使我们有可能生成拓扑结构正确的减薄算法。我们给出了同时删除可保留拓扑的 P 简单点的局部特征,还建立了任意类型二元图片的现有通用充分条件与我们针对 FCC 的新结果之间的关系。
{"title":"Sufficient Conditions for Topology-Preserving Parallel Reductions on the Face-Centered Cubic Grid","authors":"Gábor Karai, Péter Kardos, Kálmán Palágyi","doi":"10.1007/s10851-024-01177-y","DOIUrl":"https://doi.org/10.1007/s10851-024-01177-y","url":null,"abstract":"<p>Topology preservation is a crucial issue in parallel reductions that transform binary pictures by changing only a set of black points to white at a time. In this paper, we present sufficient conditions for topology-preserving parallel reductions on the three types of pictures of the unconventional 3D face-centered cubic (FCC) grid. Some conditions provide methods of verifying that a given parallel reduction always preserves the topology, and the remaining ones directly provide deletion rules of topology-preserving parallel reductions, and make us possible to generate topologically correct thinning algorithms. We give local characterizations of P-simple points, whose simultaneous deletion preserves the topology, and the relationships among the existing universal sufficient conditions for arbitrary types of binary pictures and our new FCC-specific results are also established.</p>","PeriodicalId":16196,"journal":{"name":"Journal of Mathematical Imaging and Vision","volume":null,"pages":null},"PeriodicalIF":2.0,"publicationDate":"2024-03-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140114971","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Riesz Networks: Scale-Invariant Neural Networks in a Single Forward Pass 里兹网络一次前向传递中的规模不变神经网络
IF 2 4区 数学 Q1 Mathematics Pub Date : 2024-02-29 DOI: 10.1007/s10851-024-01171-4
Tin Barisin, Katja Schladitz, Claudia Redenbach

Scale invariance of an algorithm refers to its ability to treat objects equally independently of their size. For neural networks, scale invariance is typically achieved by data augmentation. However, when presented with a scale far outside the range covered by the training set, neural networks may fail to generalize. Here, we introduce the Riesz network, a novel scale- invariant neural network. Instead of standard 2d or 3d convolutions for combining spatial information, the Riesz network is based on the Riesz transform which is a scale-equivariant operation. As a consequence, this network naturally generalizes to unseen or even arbitrary scales in a single forward pass. As an application example, we consider detecting and segmenting cracks in tomographic images of concrete. In this context, ‘scale’ refers to the crack thickness which may vary strongly even within the same sample. To prove its scale invariance, the Riesz network is trained on one fixed crack width. We then validate its performance in segmenting simulated and real tomographic images featuring a wide range of crack widths. An additional experiment is carried out on the MNIST Large Scale data set.

算法的尺度不变性是指算法能够平等对待不同大小的对象。对于神经网络来说,规模不变性通常是通过数据增强来实现的。然而,当遇到远远超出训练集覆盖范围的尺度时,神经网络可能无法泛化。在此,我们介绍一种新型尺度不变神经网络--Riesz 网络。Riesz 网络以 Riesz 变换为基础,而不是以标准的 2d 或 3d 卷积来组合空间信息,Riesz 变换是一种尺度不变运算。因此,该网络只需一次前向传递,就能自然地泛化到未见过的甚至任意的尺度。作为一个应用实例,我们考虑检测和分割混凝土断层图像中的裂缝。在这种情况下,"尺度 "指的是裂缝厚度,即使在同一个样本中,裂缝厚度也会有很大变化。为了证明其尺度不变性,我们在一个固定的裂缝宽度上对 Riesz 网络进行了训练。然后,我们验证了它在分割具有多种裂缝宽度的模拟和真实断层图像时的性能。我们还在 MNIST 大尺度数据集上进行了额外的实验。
{"title":"Riesz Networks: Scale-Invariant Neural Networks in a Single Forward Pass","authors":"Tin Barisin, Katja Schladitz, Claudia Redenbach","doi":"10.1007/s10851-024-01171-4","DOIUrl":"https://doi.org/10.1007/s10851-024-01171-4","url":null,"abstract":"<p>Scale invariance of an algorithm refers to its ability to treat objects equally independently of their size. For neural networks, scale invariance is typically achieved by data augmentation. However, when presented with a scale far outside the range covered by the training set, neural networks may fail to generalize. Here, we introduce the Riesz network, a novel scale- invariant neural network. Instead of standard 2d or 3d convolutions for combining spatial information, the Riesz network is based on the Riesz transform which is a scale-equivariant operation. As a consequence, this network naturally generalizes to unseen or even arbitrary scales in a single forward pass. As an application example, we consider detecting and segmenting cracks in tomographic images of concrete. In this context, ‘scale’ refers to the crack thickness which may vary strongly even within the same sample. To prove its scale invariance, the Riesz network is trained on one fixed crack width. We then validate its performance in segmenting simulated and real tomographic images featuring a wide range of crack widths. An additional experiment is carried out on the MNIST Large Scale data set.</p>","PeriodicalId":16196,"journal":{"name":"Journal of Mathematical Imaging and Vision","volume":null,"pages":null},"PeriodicalIF":2.0,"publicationDate":"2024-02-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140009764","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A New Prediction–Correction Primal–Dual Hybrid Gradient Algorithm for Solving Convex Minimization Problems with Linear Constraints 解决线性约束条件下凸最小化问题的新型预测-校正原始-双重混合梯度算法
IF 2 4区 数学 Q1 Mathematics Pub Date : 2024-02-24 DOI: 10.1007/s10851-024-01173-2

Abstract

The primal–dual hybrid gradient (PDHG) algorithm has been applied for solving linearly constrained convex problems. However, it was shown that without some additional assumptions, convergence may fail. In this work, we propose a new competitive prediction–correction primal–dual hybrid gradient algorithm to solve this kind of problem. Under some conditions, we prove the global convergence for the proposed algorithm with the rate of O(1/T) in a nonergodic sense, and also in the ergodic sense, in terms of the objective function value gap and the constraint violation. Comparative performance analysis of our method with other related methods on some matrix completion and wavelet-based image inpainting test problems shows the outperformance of our approach, in terms of iteration number and CPU time.

摘要 原始-双重混合梯度(PDHG)算法已被用于求解线性约束凸问题。然而,研究表明,如果没有一些额外的假设,收敛可能会失败。在这项研究中,我们提出了一种新的竞争性预测-修正原始-双重混合梯度算法来解决这类问题。在某些条件下,我们证明了所提算法的全局收敛性,在非遍历意义上收敛率为 O(1/T),而且在遍历意义上,在目标函数值差距和约束违反方面也是如此。我们的方法与其他相关方法在一些矩阵补全和基于小波的图像绘制测试问题上的性能对比分析表明,我们的方法在迭代次数和 CPU 时间上都优于其他方法。
{"title":"A New Prediction–Correction Primal–Dual Hybrid Gradient Algorithm for Solving Convex Minimization Problems with Linear Constraints","authors":"","doi":"10.1007/s10851-024-01173-2","DOIUrl":"https://doi.org/10.1007/s10851-024-01173-2","url":null,"abstract":"<h3>Abstract</h3> <p>The primal–dual hybrid gradient (PDHG) algorithm has been applied for solving linearly constrained convex problems. However, it was shown that without some additional assumptions, convergence may fail. In this work, we propose a new competitive prediction–correction primal–dual hybrid gradient algorithm to solve this kind of problem. Under some conditions, we prove the global convergence for the proposed algorithm with the rate of <em>O</em>(1/<em>T</em>) in a nonergodic sense, and also in the ergodic sense, in terms of the objective function value gap and the constraint violation. Comparative performance analysis of our method with other related methods on some matrix completion and wavelet-based image inpainting test problems shows the outperformance of our approach, in terms of iteration number and CPU time.</p>","PeriodicalId":16196,"journal":{"name":"Journal of Mathematical Imaging and Vision","volume":null,"pages":null},"PeriodicalIF":2.0,"publicationDate":"2024-02-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139953149","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
About the Incorporation of Topological Prescriptions in CNNs for Medical Image Semantic Segmentation 关于将拓扑规定纳入用于医学图像语义分割的 CNN
IF 2 4区 数学 Q1 Mathematics Pub Date : 2024-02-21 DOI: 10.1007/s10851-024-01172-3
Zoé Lambert, Carole Le Guyader

Incorporating prior knowledge into a segmentation task, whether it be under the form of geometrical constraints (area/volume penalisation, convexity enforcement, etc.) or of topological constraints (to preserve the contextual relations between objects, to monitor the number of connected components), proves to increase accuracy in medical image segmentation. In particular, it allows to compensate for the issue of weak boundary definition, of imbalanced classes, and to be more in line with anatomical consistency even though the data do not explicitly exhibit those features. This observation underpins the introduced contribution that aims, in a hybrid setting, to leverage the best of both worlds that variational methods and supervised deep learning approaches embody: (a) versatility and adaptability in the mathematical formulation of the problem to encode geometrical/topological constraints, (b) interpretability of the results for the former formalism, while (c) more efficient and effective processing models, (d) ability to become more proficient at learning intricate features and executing more computationally intensive tasks, for the latter one. To be more precise, a unified variational framework involving topological prescriptions in the training of convolutional neural networks through the design of a suitable penalty in the loss function is provided. These topological constraints are implicitly enforced by viewing the segmentation procedure as a registration task between the processed image and its associated ground truth under incompressibility conditions, thus making them homeomorphic. A very preliminary version (Lambert et al., in Calatroni, Donatelli, Morigi, Prato, Santacesaria (eds) Scale space and variational methods in computer vision, Springer, Berlin, 2023, pp. 363–375) of this work has been published in the proceedings of the Ninth International Conference on Scale Space and Variational Methods in Computer Vision, 2023. It contained neither all the theoretical results, nor the detailed related proofs, nor did it include the numerical analysis of the designed algorithm. Besides these more involved developments in the present version, a more complete, systematic and thorough analysis of the numerical experiments is also conducted, addressing several issues: (i) limited amount of labelled data in the training phase, (ii) low contrast or imbalanced classes exhibited by the data, and (iii) explainability of the results.

在分割任务中加入先验知识,无论是几何约束(面积/体积惩罚、凸度执行等)还是拓扑约束(保留对象之间的上下文关系、监控连接成分的数量),都能提高医学影像分割的准确性。特别是,它可以弥补边界定义不清、类别不平衡的问题,并更符合解剖学的一致性,即使数据并没有明确显示出这些特征。这一观察结果支持了所介绍的贡献,其目的是在混合设置中利用变分方法和监督深度学习方法所体现的两个世界的优点:(a)问题数学表述的多功能性和适应性,以编码几何/拓扑约束;(b)前一种形式主义的结果的可解释性;而(c)后一种形式主义的更高效和有效的处理模型;(d)更熟练地学习复杂特征和执行计算密集型任务的能力。更准确地说,本文提供了一个统一的变分框架,通过在损失函数中设计适当的惩罚,在卷积神经网络的训练中涉及拓扑规定。这些拓扑约束是通过将分割过程视为在不可压缩条件下处理过的图像与其相关的地面实况之间的配准任务来隐含执行的,从而使它们具有同构性。这项工作的初步版本(Lambert 等人,载于 Calatroni、Donatelli、Morigi、Prato、Santacesaria(编)《计算机视觉中的尺度空间和变分方法》,施普林格,柏林,2023 年,第 363-375 页)已发表在 2023 年第九届计算机视觉中的尺度空间和变分方法国际会议论文集上。它既不包含所有理论结果,也不包含详细的相关证明,更不包括对所设计算法的数值分析。除了这些涉及面更广的发展之外,本版本还对数值实验进行了更完整、系统和透彻的分析,解决了以下几个问题:(i) 训练阶段标记数据量有限;(ii) 数据显示的低对比度或不平衡类别;(iii) 结果的可解释性。
{"title":"About the Incorporation of Topological Prescriptions in CNNs for Medical Image Semantic Segmentation","authors":"Zoé Lambert, Carole Le Guyader","doi":"10.1007/s10851-024-01172-3","DOIUrl":"https://doi.org/10.1007/s10851-024-01172-3","url":null,"abstract":"<p>Incorporating prior knowledge into a segmentation task, whether it be under the form of geometrical constraints (area/volume penalisation, convexity enforcement, etc.) or of topological constraints (to preserve the contextual relations between objects, to monitor the number of connected components), proves to increase accuracy in medical image segmentation. In particular, it allows to compensate for the issue of weak boundary definition, of imbalanced classes, and to be more in line with anatomical consistency even though the data do not explicitly exhibit those features. This observation underpins the introduced contribution that aims, in a hybrid setting, to leverage the best of both worlds that variational methods and supervised deep learning approaches embody: (a) versatility and adaptability in the mathematical formulation of the problem to encode geometrical/topological constraints, (b) interpretability of the results for the former formalism, while (c) more efficient and effective processing models, (d) ability to become more proficient at learning intricate features and executing more computationally intensive tasks, for the latter one. To be more precise, a unified variational framework involving topological prescriptions in the training of convolutional neural networks through the design of a suitable penalty in the loss function is provided. These topological constraints are implicitly enforced by viewing the segmentation procedure as a registration task between the processed image and its associated ground truth under incompressibility conditions, thus making them homeomorphic. A very preliminary version (Lambert et al., in Calatroni, Donatelli, Morigi, Prato, Santacesaria (eds) Scale space and variational methods in computer vision, Springer, Berlin, 2023, pp. 363–375) of this work has been published in the proceedings of the Ninth International Conference on Scale Space and Variational Methods in Computer Vision, 2023. It contained neither all the theoretical results, nor the detailed related proofs, nor did it include the numerical analysis of the designed algorithm. Besides these more involved developments in the present version, a more complete, systematic and thorough analysis of the numerical experiments is also conducted, addressing several issues: (i) limited amount of labelled data in the training phase, (ii) low contrast or imbalanced classes exhibited by the data, and (iii) explainability of the results.</p>","PeriodicalId":16196,"journal":{"name":"Journal of Mathematical Imaging and Vision","volume":null,"pages":null},"PeriodicalIF":2.0,"publicationDate":"2024-02-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139923724","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Geodesic Tracking via New Data-Driven Connections of Cartan Type for Vascular Tree Tracking 通过用于血管树跟踪的 Cartan 型新数据驱动连接进行大地跟踪
IF 2 4区 数学 Q1 Mathematics Pub Date : 2024-01-13 DOI: 10.1007/s10851-023-01170-x
Nicky J. van den Berg, Bart M. N. Smets, Gautam Pai, Jean-Marie Mirebeau, Remco Duits

We introduce a data-driven version of the plus Cartan connection on the homogeneous space ({mathbb {M}}_2) of 2D positions and orientations. We formulate a theorem that describes all shortest and straight curves (parallel velocity and parallel momentum, respectively) with respect to this new data-driven connection and corresponding Riemannian manifold. Then we use these shortest curves for geodesic tracking of complex vasculature in multi-orientation image representations defined on ({mathbb {M}}_{2}). The data-driven Cartan connection characterizes the Hamiltonian flow of all geodesics. It also allows for improved adaptation to curvature and misalignment of the (lifted) vessel structure that we track via globally optimal geodesics. We compute these geodesics numerically via steepest descent on distance maps on ({mathbb {M}}_2) that we compute by a new modified anisotropic fast-marching method.Our experiments range from tracking single blood vessels with fixed endpoints to tracking complete vascular trees in retinal images. Single vessel tracking is performed in a single run in the multi-orientation image representation, where we project the resulting geodesics back onto the underlying image. The complete vascular tree tracking requires only two runs and avoids prior segmentation, placement of extra anchor points, and dynamic switching between geodesic models. Altogether we provide a geodesic tracking method using a single, flexible, transparent, data-driven geodesic model providing globally optimal curves which correctly follow highly complex vascular structures in retinal images. All experiments in this article can be reproduced via documented Mathematica notebooks available at van den Berg (Data-driven left-invariant tracking in Mathematica, 2022).

我们在二维位置和方向的同质空间 ({mathbb {M}}_2) 上引入了数据驱动版本的加 Cartan 连接。我们提出了一个定理,描述了关于这个新的数据驱动连接和相应的黎曼流形的所有最短曲线和直线(分别是平行速度和平行动量)。然后,我们利用这些最短曲线对定义在 ({mathbb {M}}_{2}) 上的多方向图像表示中的复杂脉管进行大地跟踪。数据驱动的 Cartan 连接描述了所有测地线的哈密顿流。它还允许我们通过全局最优大地线跟踪(抬升的)血管结构,从而改善对曲率和错位的适应性。我们通过对 ({mathbb {M}}_2) 上的距离图进行最陡下降来数值计算这些大地线,这些大地线是我们通过一种新的改良各向异性快速行进方法计算得出的。单根血管的跟踪是在多方向图像表示法中一次运行完成的,我们将得到的大地线投影到底层图像上。完整血管树的跟踪只需运行两次,并避免了事先分割、放置额外锚点和在大地模型之间动态切换。总之,我们提供了一种使用单一、灵活、透明、数据驱动的测地线模型进行测地线跟踪的方法,它提供了全局最优曲线,能正确跟踪视网膜图像中高度复杂的血管结构。本文中的所有实验都可以通过 van den Berg 网站上的 Mathematica 笔记本(Mathematica 中的数据驱动左不变跟踪,2022 年)重现。
{"title":"Geodesic Tracking via New Data-Driven Connections of Cartan Type for Vascular Tree Tracking","authors":"Nicky J. van den Berg, Bart M. N. Smets, Gautam Pai, Jean-Marie Mirebeau, Remco Duits","doi":"10.1007/s10851-023-01170-x","DOIUrl":"https://doi.org/10.1007/s10851-023-01170-x","url":null,"abstract":"<p>We introduce a data-driven version of the plus Cartan connection on the homogeneous space <span>({mathbb {M}}_2)</span> of 2D positions and orientations. We formulate a theorem that describes all shortest and straight curves (parallel velocity and parallel momentum, respectively) with respect to this new data-driven connection and corresponding Riemannian manifold. Then we use these shortest curves for geodesic tracking of complex vasculature in multi-orientation image representations defined on <span>({mathbb {M}}_{2})</span>. The data-driven Cartan connection characterizes the Hamiltonian flow of all geodesics. It also allows for improved adaptation to curvature and misalignment of the (lifted) vessel structure that we track via globally optimal geodesics. We compute these geodesics numerically via steepest descent on distance maps on <span>({mathbb {M}}_2)</span> that we compute by a new modified anisotropic fast-marching method.Our experiments range from tracking single blood vessels with fixed endpoints to tracking complete vascular trees in retinal images. Single vessel tracking is performed in a single run in the multi-orientation image representation, where we project the resulting geodesics back onto the underlying image. The complete vascular tree tracking requires only two runs and avoids prior segmentation, placement of extra anchor points, and dynamic switching between geodesic models. Altogether we provide a geodesic tracking method using a single, flexible, transparent, data-driven geodesic model providing globally optimal curves which correctly follow highly complex vascular structures in retinal images. All experiments in this article can be reproduced via documented <i>Mathematica</i> notebooks available at van den Berg (Data-driven left-invariant tracking in <i>Mathematica</i>, 2022).</p>","PeriodicalId":16196,"journal":{"name":"Journal of Mathematical Imaging and Vision","volume":null,"pages":null},"PeriodicalIF":2.0,"publicationDate":"2024-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139461111","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Mathematical Properties of Pyramid-Transform-Based Resolution Conversion and Its Applications 基于金字塔变换的分辨率转换的数学特性及其应用
IF 2 4区 数学 Q1 Mathematics Pub Date : 2023-12-25 DOI: 10.1007/s10851-023-01166-7
Kento Hosoya, Kouki Nozawa, Hayato Itoh, Atsushi Imiya

In this paper, we aim to clarify the statistical and geometric properties of linear resolution conversion for registration between different resolutions observed using the same modality. The pyramid transform is achieved by smoothing and downsampling. The dual operation of the pyramid transform is achieved by linear smoothing after upsampling. The rational-order pyramid transform is decomposed into upsampling for smoothing and the conventional integer-order pyramid transform. By controlling the ratio between upsampling for smoothing and downsampling in the pyramid transform, the rational-order pyramid transform is computed. The tensor expression of the multiway pyramid transform implies that the transform yields orthogonal base systems for any ratio of the rational pyramid transform. The numerical evaluation of the transform shows that the rational-order pyramid transform preserves the normalised distribution of greyscale in images.

本文旨在阐明线性分辨率转换的统计和几何特性,以便在使用同一模式观测到的不同分辨率之间进行配准。金字塔变换是通过平滑和下采样实现的。金字塔变换的双重操作是通过上采样后的线性平滑来实现的。有理阶金字塔变换分解为用于平滑的上采样和传统的整数阶金字塔变换。通过控制金字塔变换中平滑上采样和下采样的比例,可以计算出有理阶金字塔变换。多向金字塔变换的张量表达式意味着,对于有理阶金字塔变换的任何比率,该变换都能产生正交的基础系统。对变换的数值评估表明,有理阶金字塔变换保留了图像中灰度的归一化分布。
{"title":"Mathematical Properties of Pyramid-Transform-Based Resolution Conversion and Its Applications","authors":"Kento Hosoya, Kouki Nozawa, Hayato Itoh, Atsushi Imiya","doi":"10.1007/s10851-023-01166-7","DOIUrl":"https://doi.org/10.1007/s10851-023-01166-7","url":null,"abstract":"<p>In this paper, we aim to clarify the statistical and geometric properties of linear resolution conversion for registration between different resolutions observed using the same modality. The pyramid transform is achieved by smoothing and downsampling. The dual operation of the pyramid transform is achieved by linear smoothing after upsampling. The rational-order pyramid transform is decomposed into upsampling for smoothing and the conventional integer-order pyramid transform. By controlling the ratio between upsampling for smoothing and downsampling in the pyramid transform, the rational-order pyramid transform is computed. The tensor expression of the multiway pyramid transform implies that the transform yields orthogonal base systems for any ratio of the rational pyramid transform. The numerical evaluation of the transform shows that the rational-order pyramid transform preserves the normalised distribution of greyscale in images.</p>","PeriodicalId":16196,"journal":{"name":"Journal of Mathematical Imaging and Vision","volume":null,"pages":null},"PeriodicalIF":2.0,"publicationDate":"2023-12-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139035005","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of Mathematical Imaging and Vision
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1