首页 > 最新文献

Image Analysis & Stereology最新文献

英文 中文
PU-NET DEEP LEARNING ARCHITECTURE FOR GLIOMAS BRAIN TUMOUR SEGMENTATION IN MAGNETIC RESONANCE IMAGES 脑胶质瘤磁共振图像分割的Pu-net深度学习架构
4区 计算机科学 Q4 IMAGING SCIENCE & PHOTOGRAPHIC TECHNOLOGY Pub Date : 2023-11-02 DOI: 10.5566/ias.2879
Yamina Azzi, Abdelouhab Moussaoui, Mohand-Tahar Kechadi
Automatic medical image segmentation is one of the main tasks for many organs and pathology structure delineation. It is also a crucial technique in the posterior clinical examination of brain tumours, like applying radiotherapy or tumour restrictions. Various image segmentation techniques have been proposed and applied to different image types of images. Recently, it has been shown that the deep learning approach accurately segments images, and its implementation is usually straightforward. In this paper, we proposed a novel approach, called PU-net, for automatic brain tumour segmentation in multi-modal magnetic resonance images (MRI) based on deep learning. We introduced an input processing block to a customised fully convolutional network derived from the U-Net network to handle the multi-modal inputs. We performed experiments over the Brats brain tumour dataset collected in 2018 and achieved dice scores of 0.905,0.827, and 0.803 for the whole tumour (WT), tumour core (TC), and enhancing tumour (ET) classes, respectively. This study also provides promising results compared to the traditional machine learning methods, such as support vector machines (SVM), random forest (RF) and other deep learning methods used in this context.
医学图像的自动分割是许多器官和病理结构描绘的主要任务之一。它也是脑肿瘤后临床检查的关键技术,如应用放射治疗或肿瘤限制。各种图像分割技术已经被提出并应用于不同的图像类型。最近,研究表明,深度学习方法可以准确地分割图像,并且其实现通常很简单。在本文中,我们提出了一种基于深度学习的多模态磁共振图像(MRI)自动脑肿瘤分割的新方法,称为PU-net。我们将输入处理块引入自定义的全卷积网络,该网络源自U-Net网络,用于处理多模态输入。我们在2018年收集的Brats脑肿瘤数据集上进行了实验,在整个肿瘤(WT)、肿瘤核心(TC)和增强肿瘤(ET)类别上分别获得了0.905、0.827和0.803的骰子分数。与传统的机器学习方法(如支持向量机(SVM)、随机森林(RF)和在此背景下使用的其他深度学习方法)相比,本研究也提供了有希望的结果。
{"title":"PU-NET DEEP LEARNING ARCHITECTURE FOR GLIOMAS BRAIN TUMOUR SEGMENTATION IN MAGNETIC RESONANCE IMAGES","authors":"Yamina Azzi, Abdelouhab Moussaoui, Mohand-Tahar Kechadi","doi":"10.5566/ias.2879","DOIUrl":"https://doi.org/10.5566/ias.2879","url":null,"abstract":"Automatic medical image segmentation is one of the main tasks for many organs and pathology structure delineation. It is also a crucial technique in the posterior clinical examination of brain tumours, like applying radiotherapy or tumour restrictions. Various image segmentation techniques have been proposed and applied to different image types of images. Recently, it has been shown that the deep learning approach accurately segments images, and its implementation is usually straightforward. In this paper, we proposed a novel approach, called PU-net, for automatic brain tumour segmentation in multi-modal magnetic resonance images (MRI) based on deep learning. We introduced an input processing block to a customised fully convolutional network derived from the U-Net network to handle the multi-modal inputs. We performed experiments over the Brats brain tumour dataset collected in 2018 and achieved dice scores of 0.905,0.827, and 0.803 for the whole tumour (WT), tumour core (TC), and enhancing tumour (ET) classes, respectively. This study also provides promising results compared to the traditional machine learning methods, such as support vector machines (SVM), random forest (RF) and other deep learning methods used in this context.","PeriodicalId":49062,"journal":{"name":"Image Analysis & Stereology","volume":"99 ","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-11-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136017964","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sample-balanced and IoU-guided anchor-free visual tracking 样本平衡和iou引导无锚视觉跟踪
4区 计算机科学 Q4 IMAGING SCIENCE & PHOTOGRAPHIC TECHNOLOGY Pub Date : 2023-11-01 DOI: 10.5566/ias.2929
Jueyu Zhu, Yu Qin, Kai Wang, Gao zhi Zeng
Siamese network-based visual tracking algorithms have achieved excellent performance in recent years, but challenges such as fast target motion, shape and scale variations have made the tracking extremely difficult. The regression of anchor-free tracking has low computational complexity, strong real-time performance, and is suitable for visual tracking. Based on the anchor-free siamese tracking framework, this paper firstly introduces balance factors and modulation coefficients into the cross-entropy loss function to solve the classification inaccuracy caused by the imbalance between positive and negative samples as well as the imbalance between hard and easy samples during the training process, so that the model focuses more on the positive samples and the hard samples that make the major contribution to the training. Secondly, the intersection over union (IoU) loss function of the regression branch is improved, not only focusing on the IoU between the predicted box and the ground truth box, but also considering the aspect ratios of the two boxes and the minimum bounding box area that accommodate the two, which guides the generation of more accurate regression offsets. The overall loss of classification and regression is iteratively minimized and improves the accuracy and robustness of visual tracking. Experiments on four public datasets, OTB2015, VOT2016, UAV123 and GOT-10k, show that the proposed algorithm achieves the state-of-the-art performance.
基于Siamese网络的视觉跟踪算法近年来取得了很好的效果,但目标快速运动、形状和尺度变化等挑战使得跟踪非常困难。无锚跟踪回归计算复杂度低,实时性强,适合于视觉跟踪。基于无锚暹罗跟踪框架,本文首先在交叉熵损失函数中引入平衡因子和调制系数,解决训练过程中正负样本不平衡、难易样本不平衡造成的分类不准确问题,使模型更加关注对训练贡献较大的正样本和难样本。其次,改进回归分支的IoU (intersection over union)损失函数,不仅关注预测框与地面真值框之间的IoU,还考虑了两个框的宽高比和容纳两者的最小边界框面积,从而指导生成更精确的回归偏移量。迭代最小化了分类和回归的总体损失,提高了视觉跟踪的准确性和鲁棒性。在OTB2015、VOT2016、UAV123和GOT-10k四个公共数据集上的实验表明,该算法达到了最先进的性能。
{"title":"Sample-balanced and IoU-guided anchor-free visual tracking","authors":"Jueyu Zhu, Yu Qin, Kai Wang, Gao zhi Zeng","doi":"10.5566/ias.2929","DOIUrl":"https://doi.org/10.5566/ias.2929","url":null,"abstract":"Siamese network-based visual tracking algorithms have achieved excellent performance in recent years, but challenges such as fast target motion, shape and scale variations have made the tracking extremely difficult. The regression of anchor-free tracking has low computational complexity, strong real-time performance, and is suitable for visual tracking. Based on the anchor-free siamese tracking framework, this paper firstly introduces balance factors and modulation coefficients into the cross-entropy loss function to solve the classification inaccuracy caused by the imbalance between positive and negative samples as well as the imbalance between hard and easy samples during the training process, so that the model focuses more on the positive samples and the hard samples that make the major contribution to the training. Secondly, the intersection over union (IoU) loss function of the regression branch is improved, not only focusing on the IoU between the predicted box and the ground truth box, but also considering the aspect ratios of the two boxes and the minimum bounding box area that accommodate the two, which guides the generation of more accurate regression offsets. The overall loss of classification and regression is iteratively minimized and improves the accuracy and robustness of visual tracking. Experiments on four public datasets, OTB2015, VOT2016, UAV123 and GOT-10k, show that the proposed algorithm achieves the state-of-the-art performance.","PeriodicalId":49062,"journal":{"name":"Image Analysis & Stereology","volume":"46 6","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135763452","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Existence and approximation of densities of chord length- and cross section area distributions 弦长和横截面积分布密度的存在和近似
4区 计算机科学 Q4 IMAGING SCIENCE & PHOTOGRAPHIC TECHNOLOGY Pub Date : 2023-10-27 DOI: 10.5566/ias.2923
Thomas Van der Jagt, Geurt Jongbloed, Martina Vittorietti
In various stereological problems an $n$-dimensional convex body is intersected with an $(n-1)$-dimensional Isotropic Uniformly Random (IUR) hyperplane. In this paper the cumulative distribution function associated with the $(n-1)$-dimensional volume of such a random section is studied. This distribution is also known as chord length distribution and cross section area distribution in the planar and spatial case respectively. For various classes of convex bodies it is shown that these distribution functions are absolutely continuous with respect to Lebesgue measure. A Monte Carlo simulation scheme is proposed for approximating the corresponding probability density functions.
在各种立体问题中,n维凸体与n (n-1)维各向同性均匀随机超平面相交。本文研究了这种随机截面$(n-1)$维体积的累积分布函数。这种分布在平面和空间情况下分别称为弦长分布和横截面积分布。对于各种类型的凸体,证明了这些分布函数相对于勒贝格测度是绝对连续的。提出了一种蒙特卡罗模拟方案来近似相应的概率密度函数。
{"title":"Existence and approximation of densities of chord length- and cross section area distributions","authors":"Thomas Van der Jagt, Geurt Jongbloed, Martina Vittorietti","doi":"10.5566/ias.2923","DOIUrl":"https://doi.org/10.5566/ias.2923","url":null,"abstract":"In various stereological problems an $n$-dimensional convex body is intersected with an $(n-1)$-dimensional Isotropic Uniformly Random (IUR) hyperplane. In this paper the cumulative distribution function associated with the $(n-1)$-dimensional volume of such a random section is studied. This distribution is also known as chord length distribution and cross section area distribution in the planar and spatial case respectively. For various classes of convex bodies it is shown that these distribution functions are absolutely continuous with respect to Lebesgue measure. A Monte Carlo simulation scheme is proposed for approximating the corresponding probability density functions.","PeriodicalId":49062,"journal":{"name":"Image Analysis & Stereology","volume":"34 4","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136319960","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
IMPROVEMENT PROCEDURE FOR IMAGE SEGMENTATION OF FRUITS AND VEGETABLES BASED ON THE OTSU METHOD. 基于otsu方法的果蔬图像分割改进程序。
4区 计算机科学 Q4 IMAGING SCIENCE & PHOTOGRAPHIC TECHNOLOGY Pub Date : 2023-10-26 DOI: 10.5566/ias.2939
Osbaldo Vite-Chávez, Jorge Flores-Troncoso, Reynel Olivera-Reyna, Jorge Ulises Munoz
Currently, there are significant challenges in the classification, recognition, and detection of fruits and vegetables. An important step to solve this problem is to obtain an accurate segmentation of the object of interest. However, the background and object separation in a gray image shows high errors for some thresholding techniques due to uneven or poorly conditioned lighting. An accepted strategy to decrease the error segmentation is to select the channel of a RGB image with high contrast. This paper presents the results of an experimental procedure based on a binary segmentation enhancement by using the Otsu method. The procedure was carried out with images of real agricultural products with and without additional noise to corroborate the robustness of the proposed strategy. The experimental tests were performed by using our database of RGB images of agricultural products under uncontrolled illumination. The results exhibit that the best segmentation is based on the selection of the Blue channel of the RGB test images due to its better contrast. Here, the quantitative results are measured by applying the Jaccard and Dice metrics based on the ground-truth images as optimal reference. Most of the results show an average percentage improvement difference greater than 45.5% in two experimental tests.
目前,水果和蔬菜的分类、识别和检测面临着重大挑战。解决这一问题的一个重要步骤是获得感兴趣对象的准确分割。然而,由于光照不均匀或条件不佳,某些阈值分割技术在灰度图像中显示出很高的误差。一种公认的减小分割误差的策略是选择具有高对比度的RGB图像的通道。本文给出了一种基于Otsu方法的二值分割增强实验程序的结果。该程序与真实农产品的图像进行,有或没有额外的噪声,以证实所提出的策略的鲁棒性。实验采用我们的非受控光照下的农产品RGB图像数据库进行。结果表明,选择RGB测试图像的蓝色通道具有较好的对比度,因此分割效果最好。在这里,定量的结果是通过应用Jaccard和Dice度量来衡量的,这些度量基于ground-truth图像作为最佳参考。在两次实验测试中,大多数结果的平均改进百分比差异大于45.5%。
{"title":"IMPROVEMENT PROCEDURE FOR IMAGE SEGMENTATION OF FRUITS AND VEGETABLES BASED ON THE OTSU METHOD.","authors":"Osbaldo Vite-Chávez, Jorge Flores-Troncoso, Reynel Olivera-Reyna, Jorge Ulises Munoz","doi":"10.5566/ias.2939","DOIUrl":"https://doi.org/10.5566/ias.2939","url":null,"abstract":"Currently, there are significant challenges in the classification, recognition, and detection of fruits and vegetables. An important step to solve this problem is to obtain an accurate segmentation of the object of interest. However, the background and object separation in a gray image shows high errors for some thresholding techniques due to uneven or poorly conditioned lighting. An accepted strategy to decrease the error segmentation is to select the channel of a RGB image with high contrast. This paper presents the results of an experimental procedure based on a binary segmentation enhancement by using the Otsu method. The procedure was carried out with images of real agricultural products with and without additional noise to corroborate the robustness of the proposed strategy. The experimental tests were performed by using our database of RGB images of agricultural products under uncontrolled illumination. The results exhibit that the best segmentation is based on the selection of the Blue channel of the RGB test images due to its better contrast. Here, the quantitative results are measured by applying the Jaccard and Dice metrics based on the ground-truth images as optimal reference. Most of the results show an average percentage improvement difference greater than 45.5% in two experimental tests.","PeriodicalId":49062,"journal":{"name":"Image Analysis & Stereology","volume":"46 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134971598","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Completed Multiply Threshold Encoding Pattern for Texture Classification 一种完整的纹理分类多重阈值编码模式
4区 计算机科学 Q4 IMAGING SCIENCE & PHOTOGRAPHIC TECHNOLOGY Pub Date : 2023-10-22 DOI: 10.5566/ias.2824
Bin Li, Yibing Li, Q.M.Jonathan Wu
The binary pattern family has drawn wide attention for texture representation due to its promising performance and simple operation. However, most binary pattern methods focus on local neighborhoods but ignore center pixels. Even if some studies introduce the center based sub-pattern to provide complementary information, extant center based sub-patterns are much weaker than other local neighborhood based sub-patterns. This severe unbalance limits the classification performance of fusion features significantly. To alleviate this problem, this paper designs a multiply threshold center pattern (MTCP) to provide a more discriminative and complementary local texture representation with a compact form. First, a multiply thresholds encoding strategy is designed to encode the center pixel that generates three 1-bit binary patterns. Second, it adopts a compact multi-pattern encoding strategy to combine them into the 3-bit MTCP. Furthermore, this paper proposes a completed multiply threshold encoding pattern by fusing the MTCP, local sign pattern, and local magnitude pattern. Comprehensive experimental evaluations on three popular texture classification benchmarks confirm that the completed multiply threshold encoding pattern achieves superior texture classification performance.
二进制模式族以其良好的性能和简单的操作引起了纹理表示领域的广泛关注。然而,大多数二进制模式方法只关注局部邻域,而忽略中心像素。即使一些研究引入了基于中心的子模式来提供补充信息,现有的基于中心的子模式也远远弱于其他基于局部邻域的子模式。这种严重的不平衡严重限制了融合特征的分类性能。为了解决这一问题,本文设计了一种多阈值中心模式(MTCP),以紧凑的形式提供了一种更具区别性和互补性的局部纹理表示。首先,设计了一种多阈值编码策略,对生成三个1位二进制模式的中心像素进行编码。其次,采用紧凑的多模式编码策略,将它们组合成3位MTCP;在此基础上,提出了一种融合MTCP、局部符号模式和局部幅度模式的完整的多阈值编码模式。对三种常用的纹理分类基准进行了综合实验评估,验证了完成的多重阈值编码模式具有较好的纹理分类性能。
{"title":"A Completed Multiply Threshold Encoding Pattern for Texture Classification","authors":"Bin Li, Yibing Li, Q.M.Jonathan Wu","doi":"10.5566/ias.2824","DOIUrl":"https://doi.org/10.5566/ias.2824","url":null,"abstract":"The binary pattern family has drawn wide attention for texture representation due to its promising performance and simple operation. However, most binary pattern methods focus on local neighborhoods but ignore center pixels. Even if some studies introduce the center based sub-pattern to provide complementary information, extant center based sub-patterns are much weaker than other local neighborhood based sub-patterns. This severe unbalance limits the classification performance of fusion features significantly. To alleviate this problem, this paper designs a multiply threshold center pattern (MTCP) to provide a more discriminative and complementary local texture representation with a compact form. First, a multiply thresholds encoding strategy is designed to encode the center pixel that generates three 1-bit binary patterns. Second, it adopts a compact multi-pattern encoding strategy to combine them into the 3-bit MTCP. Furthermore, this paper proposes a completed multiply threshold encoding pattern by fusing the MTCP, local sign pattern, and local magnitude pattern. Comprehensive experimental evaluations on three popular texture classification benchmarks confirm that the completed multiply threshold encoding pattern achieves superior texture classification performance.","PeriodicalId":49062,"journal":{"name":"Image Analysis & Stereology","volume":"54 3","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135464122","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Image Analysis: 22nd Scandinavian Conference, SCIA 2023, Sirkka, Finland, April 18–21, 2023, Proceedings, Part II 图像分析:第22届斯堪的纳维亚会议,SCIA 2023, Sirkka,芬兰,4月18-21日,2023,会议录,第二部分
IF 0.9 4区 计算机科学 Q4 IMAGING SCIENCE & PHOTOGRAPHIC TECHNOLOGY Pub Date : 2023-01-01 DOI: 10.1007/978-3-031-31438-4
{"title":"Image Analysis: 22nd Scandinavian Conference, SCIA 2023, Sirkka, Finland, April 18–21, 2023, Proceedings, Part II","authors":"","doi":"10.1007/978-3-031-31438-4","DOIUrl":"https://doi.org/10.1007/978-3-031-31438-4","url":null,"abstract":"","PeriodicalId":49062,"journal":{"name":"Image Analysis & Stereology","volume":"53 1","pages":""},"PeriodicalIF":0.9,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76336346","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Image Analysis: 22nd Scandinavian Conference, SCIA 2023, Sirkka, Finland, April 18–21, 2023, Proceedings, Part I. 图像分析:第22届斯堪的纳维亚会议,SCIA 2023, Sirkka,芬兰,4月18-21日,2023,会议录,第一部分。
IF 0.9 4区 计算机科学 Q4 IMAGING SCIENCE & PHOTOGRAPHIC TECHNOLOGY Pub Date : 2023-01-01 DOI: 10.1007/978-3-031-31435-3
{"title":"Image Analysis: 22nd Scandinavian Conference, SCIA 2023, Sirkka, Finland, April 18–21, 2023, Proceedings, Part I.","authors":"","doi":"10.1007/978-3-031-31435-3","DOIUrl":"https://doi.org/10.1007/978-3-031-31435-3","url":null,"abstract":"","PeriodicalId":49062,"journal":{"name":"Image Analysis & Stereology","volume":"13 1","pages":""},"PeriodicalIF":0.9,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87482325","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Two-Step Method for Assessing Similarity of Random Sets 随机集相似度评估的两步法
IF 0.9 4区 计算机科学 Q4 IMAGING SCIENCE & PHOTOGRAPHIC TECHNOLOGY Pub Date : 2021-12-15 DOI: 10.5566/ias.2600
Vesna Gotovac Đogaš, K. Helisova, B. Radović, J. Stanek, M. Zikmundová, Kateřina Brejchová
The paper concerns a new statistical method for assessing dissimilarity of two random sets based on one realisation of each of them. The method focuses on shapes of the components of the random sets, namely on the curvature of their boundaries together with the ratios of their perimeters and areas. Theoretical background is introduced and then, the method is described, justified by a simulation study and applied to real data of two different types of tissue - mammary cancer and mastopathy.
本文研究了一种基于两个随机集的一种实现来评估两个随机集的不相似性的新统计方法。该方法关注随机集组成部分的形状,即它们边界的曲率以及它们的周长和面积的比值。介绍了该方法的理论背景,并通过仿真研究验证了该方法的正确性,并将其应用于乳腺癌和乳腺病变两种不同类型组织的实际数据中。
{"title":"Two-Step Method for Assessing Similarity of Random Sets","authors":"Vesna Gotovac Đogaš, K. Helisova, B. Radović, J. Stanek, M. Zikmundová, Kateřina Brejchová","doi":"10.5566/ias.2600","DOIUrl":"https://doi.org/10.5566/ias.2600","url":null,"abstract":"The paper concerns a new statistical method for assessing dissimilarity of two random sets based on one realisation of each of them. The method focuses on shapes of the components of the random sets, namely on the curvature of their boundaries together with the ratios of their perimeters and areas. Theoretical background is introduced and then, the method is described, justified by a simulation study and applied to real data of two different types of tissue - mammary cancer and mastopathy.","PeriodicalId":49062,"journal":{"name":"Image Analysis & Stereology","volume":"206 1","pages":""},"PeriodicalIF":0.9,"publicationDate":"2021-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80419146","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Improved Model Configuration Strategies for Kannada Handwritten Numeral Recognition 改进的卡纳达语手写数字识别模型配置策略
IF 0.9 4区 计算机科学 Q4 IMAGING SCIENCE & PHOTOGRAPHIC TECHNOLOGY Pub Date : 2021-12-15 DOI: 10.5566/ias.2586
G. D. Upadhye, U. Kulkarni, D. Mane
Handwritten numeral recognition has been an important area in the domain of pattern classification. The task becomes even more daunting when working with non-Roman numerals. While convolutional neural networks are the preferred choice for modeling the image data, the conception of techniques to obtain faster convergence and accurate results still poses an enigma to the researchers. In this paper, we present new methods for the initialization and the optimization of the traditional convolutional neural network architecture to obtain better results for Kannada numeral images. Specifically, we propose two different methods- an encoderdecoder setup for unsupervised training and weight initialization, and a particle swarm optimization strategy for choosing the ideal architecture configuration of the CNN. Unsupervised initial training of the architecture helps for a faster convergence owing to more task-suited weights as compared to random initialization while the optimization strategy is helpful to reduce the time required for the manual iterative approach of architecture selection. The proposed setup is trained on varying handwritten Kannada numerals. The proposed approaches are evaluated on two different datasets: a standard Dig-MNIST dataset and a custom-built dataset. Significant improvements across multiple performance metrics are observed in our proposed system over the traditional CNN training setup. The improvement in results makes a strong case for relying on such methods for faster and more accurate training and inference of digit classification, especially when working in the absence of transfer learning.
手写体数字识别一直是模式分类领域的一个重要研究方向。当处理非罗马数字时,这项任务变得更加艰巨。虽然卷积神经网络是图像数据建模的首选,但获得更快收敛和准确结果的技术概念仍然是研究人员的一个谜。在本文中,我们提出了新的方法来初始化和优化传统的卷积神经网络架构,以获得更好的结果对卡纳达数字图像。具体来说,我们提出了两种不同的方法——用于无监督训练和权值初始化的编码器和解码器设置,以及用于选择CNN理想架构配置的粒子群优化策略。与随机初始化相比,无监督的体系结构初始化训练具有更适合任务的权重,有助于更快的收敛,而优化策略有助于减少手动迭代体系结构选择方法所需的时间。所提出的设置是在不同的手写卡纳达数字上进行训练的。提出的方法在两个不同的数据集上进行了评估:一个标准的Dig-MNIST数据集和一个定制的数据集。与传统的CNN训练设置相比,我们提出的系统在多个性能指标上有了显著的改进。结果的改进为依赖这些方法进行更快、更准确的数字分类训练和推理提供了强有力的理由,特别是在没有迁移学习的情况下。
{"title":"Improved Model Configuration Strategies for Kannada Handwritten Numeral Recognition","authors":"G. D. Upadhye, U. Kulkarni, D. Mane","doi":"10.5566/ias.2586","DOIUrl":"https://doi.org/10.5566/ias.2586","url":null,"abstract":"Handwritten numeral recognition has been an important area in the domain of pattern classification. The task becomes even more daunting when working with non-Roman numerals. While convolutional neural networks are the preferred choice for modeling the image data, the conception of techniques to obtain faster convergence and accurate results still poses an enigma to the researchers. In this paper, we present new methods for the initialization and the optimization of the traditional convolutional neural network architecture to obtain better results for Kannada numeral images. Specifically, we propose two different methods- an encoderdecoder setup for unsupervised training and weight initialization, and a particle swarm optimization strategy for choosing the ideal architecture configuration of the CNN. Unsupervised initial training of the architecture helps for a faster convergence owing to more task-suited weights as compared to random initialization while the optimization strategy is helpful to reduce the time required for the manual iterative approach of architecture selection. The proposed setup is trained on varying handwritten Kannada numerals. The proposed approaches are evaluated on two different datasets: a standard Dig-MNIST dataset and a custom-built dataset. Significant improvements across multiple performance metrics are observed in our proposed system over the traditional CNN training setup. The improvement in results makes a strong case for relying on such methods for faster and more accurate training and inference of digit classification, especially when working in the absence of transfer learning.","PeriodicalId":49062,"journal":{"name":"Image Analysis & Stereology","volume":"2 1","pages":""},"PeriodicalIF":0.9,"publicationDate":"2021-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85264895","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Bayesian Approach to Morphological Models Characterization 形态学模型表征的贝叶斯方法
IF 0.9 4区 计算机科学 Q4 IMAGING SCIENCE & PHOTOGRAPHIC TECHNOLOGY Pub Date : 2021-12-15 DOI: 10.5566/ias.2641
B. Figliuzzi, Antoine Montaux-Lambert, F. Willot, Grégoire Naudin, Pierre Dupuis, B. Querleux, E. Huguet
Morphological models are commonly used to describe microstructures observed in heterogeneous materials. Usually, these models depend upon a set of parameters that must be chosen carefully to match experimental observations conducted on the microstructure. A common approach to perform the parameters determination is to try to minimize an objective function, usually taken to be the discrepancy between measurements computed on the simulations and on the experimental observations, respectively. In this article, we present a Bayesian approach for determining the parameters of morphological models, based upon the definition of a posterior distribution for the parameters. A Monte Carlo Markov Chains (MCMC) algorithm is then used to generate samples from the posterior distribution and to identify a set of optimal parameters. We show on several examples that the Bayesian approach allows us to properly identify the optimal parameters of distinct morphological models and to identify potential correlations between the parameters of the models.
形态学模型通常用于描述在非均质材料中观察到的微观结构。通常,这些模型依赖于一组必须仔细选择的参数,以匹配对微观结构进行的实验观察。执行参数确定的一种常用方法是尽量使目标函数最小化,目标函数通常被认为是分别在模拟和实验观察中计算的测量值之间的差异。在本文中,我们提出了一种贝叶斯方法来确定形态学模型的参数,基于参数后验分布的定义。然后使用蒙特卡洛马尔可夫链(MCMC)算法从后验分布中生成样本并识别一组最优参数。我们通过几个例子表明,贝叶斯方法使我们能够正确识别不同形态模型的最佳参数,并识别模型参数之间的潜在相关性。
{"title":"A Bayesian Approach to Morphological Models Characterization","authors":"B. Figliuzzi, Antoine Montaux-Lambert, F. Willot, Grégoire Naudin, Pierre Dupuis, B. Querleux, E. Huguet","doi":"10.5566/ias.2641","DOIUrl":"https://doi.org/10.5566/ias.2641","url":null,"abstract":"Morphological models are commonly used to describe microstructures observed in heterogeneous materials. Usually, these models depend upon a set of parameters that must be chosen carefully to match experimental observations conducted on the microstructure. A common approach to perform the parameters determination is to try to minimize an objective function, usually taken to be the discrepancy between measurements computed on the simulations and on the experimental observations, respectively. In this article, we present a Bayesian approach for determining the parameters of morphological models, based upon the definition of a posterior distribution for the parameters. A Monte Carlo Markov Chains (MCMC) algorithm is then used to generate samples from the posterior distribution and to identify a set of optimal parameters. We show on several examples that the Bayesian approach allows us to properly identify the optimal parameters of distinct morphological models and to identify potential correlations between the parameters of the models.","PeriodicalId":49062,"journal":{"name":"Image Analysis & Stereology","volume":"1 1","pages":""},"PeriodicalIF":0.9,"publicationDate":"2021-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73071814","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Image Analysis & Stereology
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1