首页 > 最新文献

Int. J. Image Graph.最新文献

英文 中文
Feature Extraction and Selection in Archaeological Images for Automatic Annotation 面向自动标注的考古图像特征提取与选择
Pub Date : 2021-04-10 DOI: 10.1142/S0219467822500061
M. Salah, Ameni Yengui, M. Neji
In this paper, we present two steps in the process of automatic annotation in archeological images. These steps are feature extraction and feature selection. We focus our research on archeological images which are very much studied in our days. It presents the most important steps in the process of automatic annotation in an image. Feature extraction techniques are applied to get the feature that will be used in classifying and recognizing the images. Also, the selection of characteristics reduces the number of unattractive characteristics. However, we reviewed various images of feature extraction techniques to analyze the archaeological images. Each feature represents one or more feature descriptors in the archeological images. We focus on the descriptor shape of the archaeological objects extraction in the images using contour method-based shape recognition of the monuments. So, the feature selection stage serves to acquire the most interesting characteristics to improve the accuracy of the classification. In the feature selection section, we present a comparative study between feature selection techniques. Then we give our proposal of application of methods of selection of the characteristics of the archaeological images. Finally, we calculate the performance of two steps already mentioned: the extraction of characteristics and the selection of characteristics.
本文介绍了考古图像自动标注的两个步骤。这两个步骤分别是特征提取和特征选择。我们的研究重点是考古图像,这些图像在我们的时代被研究得非常多。介绍了图像自动标注过程中最重要的步骤。特征提取技术用于提取图像分类和识别所需的特征。此外,特征的选择减少了不吸引人的特征的数量。然而,我们回顾了各种图像特征提取技术来分析考古图像。每个特征代表考古图像中的一个或多个特征描述符。利用基于轮廓法的古迹形状识别方法,重点研究了图像中考古物体的描述符形状提取。因此,特征选择阶段是为了获取最有趣的特征,以提高分类的准确性。在特征选择部分,我们对特征选择技术进行了比较研究。在此基础上,提出了考古图像特征选择方法的应用建议。最后,我们计算了特征提取和特征选择两个步骤的性能。
{"title":"Feature Extraction and Selection in Archaeological Images for Automatic Annotation","authors":"M. Salah, Ameni Yengui, M. Neji","doi":"10.1142/S0219467822500061","DOIUrl":"https://doi.org/10.1142/S0219467822500061","url":null,"abstract":"In this paper, we present two steps in the process of automatic annotation in archeological images. These steps are feature extraction and feature selection. We focus our research on archeological images which are very much studied in our days. It presents the most important steps in the process of automatic annotation in an image. Feature extraction techniques are applied to get the feature that will be used in classifying and recognizing the images. Also, the selection of characteristics reduces the number of unattractive characteristics. However, we reviewed various images of feature extraction techniques to analyze the archaeological images. Each feature represents one or more feature descriptors in the archeological images. We focus on the descriptor shape of the archaeological objects extraction in the images using contour method-based shape recognition of the monuments. So, the feature selection stage serves to acquire the most interesting characteristics to improve the accuracy of the classification. In the feature selection section, we present a comparative study between feature selection techniques. Then we give our proposal of application of methods of selection of the characteristics of the archaeological images. Finally, we calculate the performance of two steps already mentioned: the extraction of characteristics and the selection of characteristics.","PeriodicalId":177479,"journal":{"name":"Int. J. Image Graph.","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122291388","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Kumaraswamy Distribution Based Bi-histogram Equalization for Enhancement of Microscopic Images 基于Kumaraswamy分布的双直方图均衡化显微图像增强
Pub Date : 2021-04-10 DOI: 10.1142/S0219467822500036
M. Suresha, D. Raghukumar, Subramanya Kuppa
Among all image enhancement techniques, histogram equalization is the most used technique. However, preserving brightness is the main issue, and it creates a weird look by destroying its originality. This paper proposes a new method that has command on the brightness issue of histogram equalization to enhance the quality of microscopic images. The method splits the histogram of each color channel into two sub-histograms based on their mean as the threshold and supplanting their cumulative distribution with Kumaraswamy distribution. The proposed method is tested with color microscopic images of cancer-affected lymph nodes gathered from Biological Image Repository IICBU, and objective and subjective assessments confirm that the proposed approach performs more efficiently compared to other state-of-the-art methods.
在所有的图像增强技术中,直方图均衡化是最常用的技术。然而,保持亮度是主要问题,它破坏了它的原创性,造成了奇怪的外观。本文提出了一种新的方法来解决直方图均衡化的亮度问题,以提高显微图像的质量。该方法将每个颜色通道的直方图以其均值为阈值分割成两个子直方图,并用Kumaraswamy分布代替它们的累积分布。采用生物图像库IICBU收集的癌症影响淋巴结的彩色显微图像对所提出的方法进行了测试,客观和主观评估证实,与其他最先进的方法相比,所提出的方法更有效。
{"title":"Kumaraswamy Distribution Based Bi-histogram Equalization for Enhancement of Microscopic Images","authors":"M. Suresha, D. Raghukumar, Subramanya Kuppa","doi":"10.1142/S0219467822500036","DOIUrl":"https://doi.org/10.1142/S0219467822500036","url":null,"abstract":"Among all image enhancement techniques, histogram equalization is the most used technique. However, preserving brightness is the main issue, and it creates a weird look by destroying its originality. This paper proposes a new method that has command on the brightness issue of histogram equalization to enhance the quality of microscopic images. The method splits the histogram of each color channel into two sub-histograms based on their mean as the threshold and supplanting their cumulative distribution with Kumaraswamy distribution. The proposed method is tested with color microscopic images of cancer-affected lymph nodes gathered from Biological Image Repository IICBU, and objective and subjective assessments confirm that the proposed approach performs more efficiently compared to other state-of-the-art methods.","PeriodicalId":177479,"journal":{"name":"Int. J. Image Graph.","volume":"110 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127983647","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Image Authentication Using Block Truncation Coding in Lifting Wavelet Domain 基于提升小波域的块截断编码图像认证
Pub Date : 2021-04-10 DOI: 10.1142/S0219467822500115
Anuj Bhardwaj, Vivek Singh Verma, Sandesh Gupta
Image watermarking is one of the most accepted solutions protecting image authenticity. The method presented in this paper not only provides the desired outcome also efficient in terms of memory requirements and preserving image characteristics. This scheme effectively utilizes the concepts of block truncation coding (BTC) and lifting wavelet transform (LWT). The BTC method is applied to observe the binary watermark image corresponding to its gray-scale image. Whereas, the LWT is incorporated to transform the cover image from spatial coordinates to corresponding transform coordinates. In this, a quantization-based approach for watermark bit embedding is applied. And, the extraction of binary watermark data from the attacked watermarked image is based on adaptive thresholding. To show the effectiveness of the proposed scheme, the experiment over different benchmark images is performed. The experimental results and the comparison with state-of-the-art schemes depict not only the good imperceptibility but also high robustness against various attacks.
图像水印是目前公认的保护图像真实性的解决方案之一。本文提出的方法不仅提供了期望的结果,而且在内存要求和保持图像特征方面效率高。该方案有效利用了块截断编码(BTC)和提升小波变换(LWT)的概念。采用BTC方法观察其灰度图像对应的二值水印图像。利用LWT将覆盖图像从空间坐标转换为相应的变换坐标。在此基础上,提出了一种基于量化的水印嵌入方法。基于自适应阈值法从被攻击的水印图像中提取二值水印数据。为了验证该方法的有效性,在不同的基准图像上进行了实验。实验结果和与现有方案的比较表明,该方案不仅具有良好的隐蔽性,而且对各种攻击具有较高的鲁棒性。
{"title":"Image Authentication Using Block Truncation Coding in Lifting Wavelet Domain","authors":"Anuj Bhardwaj, Vivek Singh Verma, Sandesh Gupta","doi":"10.1142/S0219467822500115","DOIUrl":"https://doi.org/10.1142/S0219467822500115","url":null,"abstract":"Image watermarking is one of the most accepted solutions protecting image authenticity. The method presented in this paper not only provides the desired outcome also efficient in terms of memory requirements and preserving image characteristics. This scheme effectively utilizes the concepts of block truncation coding (BTC) and lifting wavelet transform (LWT). The BTC method is applied to observe the binary watermark image corresponding to its gray-scale image. Whereas, the LWT is incorporated to transform the cover image from spatial coordinates to corresponding transform coordinates. In this, a quantization-based approach for watermark bit embedding is applied. And, the extraction of binary watermark data from the attacked watermarked image is based on adaptive thresholding. To show the effectiveness of the proposed scheme, the experiment over different benchmark images is performed. The experimental results and the comparison with state-of-the-art schemes depict not only the good imperceptibility but also high robustness against various attacks.","PeriodicalId":177479,"journal":{"name":"Int. J. Image Graph.","volume":"42 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126951988","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Generating Spectrum Images from Different Types - Visible, Thermal, and Infrared Based on Autoencoder Architecture (GVTI-AE) 基于自动编码器架构(GVTI-AE)的可见光、热成像和红外光谱图像生成
Pub Date : 2021-04-10 DOI: 10.1142/S021946782250005X
S. Jameel, Jafar Majidpour
Recently, numerous challenging problems have existed for transforming different image types (thermal infrared (TIR), visible spectrum, and near-infrared (NIR)). Other types of cameras may lack the ability and features of certain types of frequently-used cameras that produce different types of images. Based on camera features, different applications might emerge from observing a scenario under specific conditions (darkness, fog, night, day, and artificial light). We need to jump from one field to another to understand the scenario better. This paper proposes a fully automatic model (GVTI-AE) to manipulate the transformation into different types of vibrant, realistic images using the AutoEncoder method, which requires neither pre-nor post-processing or any user input. The experiments carried out using the GVTI-AE model showed that the perceptually realistic results produced in the widely available datasets (Tecnocampus Hand Image Database, Carl dataset, and IRIS Thermal/Visible Face Database).
近年来,不同类型的图像(热红外(TIR)、可见光谱和近红外(NIR))的转换存在许多具有挑战性的问题。其他类型的相机可能缺乏某些类型的常用相机的能力和功能,产生不同类型的图像。根据相机的功能,在特定条件下(黑暗、雾、夜晚、白天和人造光)观察场景可能会出现不同的应用程序。我们需要从一个领域跳到另一个领域,以便更好地理解这个场景。本文提出了一种全自动模型(GVTI-AE),利用AutoEncoder方法将图像转换成不同类型的充满活力的逼真图像,该模型既不需要预处理,也不需要后处理,也不需要用户输入。使用GVTI-AE模型进行的实验表明,在广泛可用的数据集(Tecnocampus Hand Image Database, Carl dataset和IRIS Thermal/Visible Face Database)中产生的感知逼真的结果。
{"title":"Generating Spectrum Images from Different Types - Visible, Thermal, and Infrared Based on Autoencoder Architecture (GVTI-AE)","authors":"S. Jameel, Jafar Majidpour","doi":"10.1142/S021946782250005X","DOIUrl":"https://doi.org/10.1142/S021946782250005X","url":null,"abstract":"Recently, numerous challenging problems have existed for transforming different image types (thermal infrared (TIR), visible spectrum, and near-infrared (NIR)). Other types of cameras may lack the ability and features of certain types of frequently-used cameras that produce different types of images. Based on camera features, different applications might emerge from observing a scenario under specific conditions (darkness, fog, night, day, and artificial light). We need to jump from one field to another to understand the scenario better. This paper proposes a fully automatic model (GVTI-AE) to manipulate the transformation into different types of vibrant, realistic images using the AutoEncoder method, which requires neither pre-nor post-processing or any user input. The experiments carried out using the GVTI-AE model showed that the perceptually realistic results produced in the widely available datasets (Tecnocampus Hand Image Database, Carl dataset, and IRIS Thermal/Visible Face Database).","PeriodicalId":177479,"journal":{"name":"Int. J. Image Graph.","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134471301","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Breast Tumor Detection in MR Images Based on Density 基于密度的MR图像乳腺肿瘤检测
Pub Date : 2021-04-10 DOI: 10.1142/S0219467822500012
N. Shrivastava, J. Bharti
Breast cancer is dangerous in women. It is generally found after the symptoms appear. Detecting the breast cancer at an early stage and understanding the treatment are the most important strategies to prevent death from cancer. Generally, for detection of breast cancer, breast Magnetic Resonance Image (MRI) takes place. It is one of the best approaches to detect tumor in women. In this research paper, a combination of selection methods for seed region growing image segmentation is suggested to detect breast tumor. The suggested method has been divided into following parts: First, the pre-processing of breast image is performed. Second, the automatic threshold for binarization process is calculated. Third, the number of seed points and its position in the breast image are determined automatically using density of pixels value. Fourth, a method for calculation of threshold value is proposed for the purpose of region creation in seed region growing. For the evaluation purpose, the proposed method was applied and tested on the RIDER MRI breast dataset from National Biomedical Imaging Archive (NBIA). After the test was performed, it was observed that proposed algorithm gives 90% accuracy, 88% True Negative Fraction, 91% True Positive Fraction, 10% Misclassification Rate, 94% Precision and 86% Relative Overlap which is better than other existing methods. It not only gives better evaluation measure but also provides segmentation method for multiple tumor detection.
乳腺癌对女性来说是危险的。一般在症状出现后才发现。早期发现乳腺癌并了解治疗方法是预防癌症死亡的最重要策略。一般来说,为了检测乳腺癌,需要进行乳房磁共振成像(MRI)。这是检测女性肿瘤的最佳方法之一。本文提出了一种结合选择方法的种子区域生长图像分割方法来检测乳腺肿瘤。该方法主要分为以下几个部分:首先,对乳房图像进行预处理。其次,计算二值化过程的自动阈值;第三,利用像素密度值自动确定乳房图像中的种子点个数和位置;第四,提出了一种用于种子区域生长中区域创建的阈值计算方法。为了评估目的,将所提出的方法应用于美国国家生物医学成像档案馆(NBIA)的RIDER MRI乳房数据集并进行了测试。经过测试,该算法的准确率为90%,真负分数为88%,真正分数为91%,误分类率为10%,精度为94%,相对重叠率为86%,优于现有方法。它不仅提供了更好的评价指标,而且为多发肿瘤的检测提供了分割方法。
{"title":"Breast Tumor Detection in MR Images Based on Density","authors":"N. Shrivastava, J. Bharti","doi":"10.1142/S0219467822500012","DOIUrl":"https://doi.org/10.1142/S0219467822500012","url":null,"abstract":"Breast cancer is dangerous in women. It is generally found after the symptoms appear. Detecting the breast cancer at an early stage and understanding the treatment are the most important strategies to prevent death from cancer. Generally, for detection of breast cancer, breast Magnetic Resonance Image (MRI) takes place. It is one of the best approaches to detect tumor in women. In this research paper, a combination of selection methods for seed region growing image segmentation is suggested to detect breast tumor. The suggested method has been divided into following parts: First, the pre-processing of breast image is performed. Second, the automatic threshold for binarization process is calculated. Third, the number of seed points and its position in the breast image are determined automatically using density of pixels value. Fourth, a method for calculation of threshold value is proposed for the purpose of region creation in seed region growing. For the evaluation purpose, the proposed method was applied and tested on the RIDER MRI breast dataset from National Biomedical Imaging Archive (NBIA). After the test was performed, it was observed that proposed algorithm gives 90% accuracy, 88% True Negative Fraction, 91% True Positive Fraction, 10% Misclassification Rate, 94% Precision and 86% Relative Overlap which is better than other existing methods. It not only gives better evaluation measure but also provides segmentation method for multiple tumor detection.","PeriodicalId":177479,"journal":{"name":"Int. J. Image Graph.","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128163851","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Automated Image Denoising Model: Contribution Towards Optimized Internal and External Basis 自动图像去噪模型:对优化内外部基础的贡献
Pub Date : 2021-04-05 DOI: 10.1142/S0219467821500510
S. E. Kuzhali, D. Suresh
For handling digital images for various applications, image denoising is considered as a fundamental pre-processing step. Diverse image denoising algorithms have been introduced in the past few decades. The main intent of this proposal is to develop an effective image denoising model on the basis of internal and external patches. This model adopts Non-local means (NLM) for performing the denoising, which uses redundant information of the image in pixel or spatial domain to reduce the noise. While performing the image denoising using NLM, “denoising an image patch using the other noisy patches within the noisy image is done for internal denoising and denoising a patch using the external clean natural patches is done for external denoising”. Here, the selection of optimal block from the entire datasets including internal noisy images and external clean natural images is decided by a new hybrid optimization algorithm. The two renowned optimization algorithms Chicken Swarm Optimization (CSO), and Dragon Fly Algorithm (DA) are merged, and the new hybrid algorithm Rooster-based Levy Updated DA (RLU-DA) is adopted. The experimental results in terms of some relevant performance measures show the promising results of the proposed model with remarkable stability and high accuracy.
为了处理各种应用的数字图像,图像去噪被认为是一个基本的预处理步骤。在过去的几十年里,各种各样的图像去噪算法被引入。本文的主要目的是建立一种有效的基于内部和外部补丁的图像去噪模型。该模型采用非局部均值(Non-local means, NLM)进行去噪,利用图像在像素域或空间域的冗余信息去噪。在使用NLM进行图像去噪时,“使用噪声图像内的其他噪声块对图像patch进行去噪以进行内部去噪,使用外部干净的自然块对patch进行去噪以进行外部去噪”。采用一种新的混合优化算法,从包含内部噪声图像和外部干净自然图像的整个数据集中选择最优块。将鸡群算法(Chicken Swarm optimization, CSO)和蜻蜓算法(Dragon Fly Algorithm, DA)两种著名的优化算法进行合并,采用了基于公鸡的Levy Updated DA (RLU-DA)混合算法。相关性能指标的实验结果表明,该模型具有良好的稳定性和较高的精度。
{"title":"Automated Image Denoising Model: Contribution Towards Optimized Internal and External Basis","authors":"S. E. Kuzhali, D. Suresh","doi":"10.1142/S0219467821500510","DOIUrl":"https://doi.org/10.1142/S0219467821500510","url":null,"abstract":"For handling digital images for various applications, image denoising is considered as a fundamental pre-processing step. Diverse image denoising algorithms have been introduced in the past few decades. The main intent of this proposal is to develop an effective image denoising model on the basis of internal and external patches. This model adopts Non-local means (NLM) for performing the denoising, which uses redundant information of the image in pixel or spatial domain to reduce the noise. While performing the image denoising using NLM, “denoising an image patch using the other noisy patches within the noisy image is done for internal denoising and denoising a patch using the external clean natural patches is done for external denoising”. Here, the selection of optimal block from the entire datasets including internal noisy images and external clean natural images is decided by a new hybrid optimization algorithm. The two renowned optimization algorithms Chicken Swarm Optimization (CSO), and Dragon Fly Algorithm (DA) are merged, and the new hybrid algorithm Rooster-based Levy Updated DA (RLU-DA) is adopted. The experimental results in terms of some relevant performance measures show the promising results of the proposed model with remarkable stability and high accuracy.","PeriodicalId":177479,"journal":{"name":"Int. J. Image Graph.","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115005684","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Reminiscent Net: Conditional GAN-based Old Image De-Creasing 怀旧网络:基于条件gan的旧图像降噪
Pub Date : 2021-03-18 DOI: 10.1142/S0219467821500509
O. Ramwala, Smeet A. Dhakecha, C. Paunwala, M. Paunwala
Documents are an essential source of valuable information and knowledge, and photographs are a great way of reminiscing old memories and past events. However, it becomes difficult to preserve the quality of such ancient documents and old photographs for an extremely long time, as these images usually get damaged or creased due to various extrinsic effects. Utilizing image editing software like Photoshop to manually reconstruct such old photographs and documents is a strenuous and an enduring process. This paper attempts to leverage the generative modeling capabilities of Conditional Generative Adversarial Networks by utilizing specialized architectures for the Generator and the Discriminator. The proposed Reminiscent Net has a U-Net-based Generator with numerous feature maps for complete information transfer with the incorporation of location and contextual details, and the absence of dense layers allows utilization of diverse sized images. Implementation of the PatchGAN-based Discriminator that penalizes the image at the scale of patches has been proposed. NADAM optimizer has been implemented to enable faster and better convergence of the loss function. The proposed method produces visually appealing de-creased images, and experiments indicate that the architecture performs better than various novel approaches, both qualitatively and quantitatively.
文件是有价值的信息和知识的重要来源,照片是回忆旧记忆和过去事件的好方法。然而,由于各种外在影响,这些古老的文件和旧照片通常会受到损坏或皱褶,因此很难长时间保持其质量。利用Photoshop等图像编辑软件手动重建这些旧照片和文档是一个艰苦而持久的过程。本文试图利用条件生成对抗网络的生成建模能力,利用生成器和鉴别器的专门架构。提出的联想网络有一个基于u -Net的生成器,其中包含许多特征图,用于完整的信息传输,并结合了位置和上下文细节,并且没有密集层,可以利用不同大小的图像。提出了一种基于patchgan的判别器的实现,该判别器可以在补丁的尺度上对图像进行惩罚。为了使损失函数更快更好的收敛,实现了NADAM优化器。该方法产生了视觉上吸引人的减少图像,实验表明,该体系结构在定性和定量上都优于各种新方法。
{"title":"Reminiscent Net: Conditional GAN-based Old Image De-Creasing","authors":"O. Ramwala, Smeet A. Dhakecha, C. Paunwala, M. Paunwala","doi":"10.1142/S0219467821500509","DOIUrl":"https://doi.org/10.1142/S0219467821500509","url":null,"abstract":"Documents are an essential source of valuable information and knowledge, and photographs are a great way of reminiscing old memories and past events. However, it becomes difficult to preserve the quality of such ancient documents and old photographs for an extremely long time, as these images usually get damaged or creased due to various extrinsic effects. Utilizing image editing software like Photoshop to manually reconstruct such old photographs and documents is a strenuous and an enduring process. This paper attempts to leverage the generative modeling capabilities of Conditional Generative Adversarial Networks by utilizing specialized architectures for the Generator and the Discriminator. The proposed Reminiscent Net has a U-Net-based Generator with numerous feature maps for complete information transfer with the incorporation of location and contextual details, and the absence of dense layers allows utilization of diverse sized images. Implementation of the PatchGAN-based Discriminator that penalizes the image at the scale of patches has been proposed. NADAM optimizer has been implemented to enable faster and better convergence of the loss function. The proposed method produces visually appealing de-creased images, and experiments indicate that the architecture performs better than various novel approaches, both qualitatively and quantitatively.","PeriodicalId":177479,"journal":{"name":"Int. J. Image Graph.","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-03-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126445827","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
A Multilevel Image Thresholding Method Using the Darwinian Cuckoo Search Algorithm 基于达尔文布谷鸟搜索算法的多级图像阈值分割方法
Pub Date : 2021-03-17 DOI: 10.1142/S0219467821500522
E. Ehsaeyan, A. Zolghadrasli
Image segmentation is a prime operation to understand the content of images. Multilevel thresholding is applied in image segmentation because of its speed and accuracy. In this paper, a novel multilevel thresholding algorithm based on Cuckoo search (CS) is introduced. One of the major drawbacks of metaheuristic algorithms is the stagnation phenomenon which leads to a fall into local optimums and premature convergence. To overcome this shortcoming, the idea of Darwinian theory is incorporated with CS algorithm to increase the diversity and quality of the individuals without decreasing the convergence speed of CS algorithm. A policy of encouragement and punishment is considered to lead searching agents in the search space and reduce the computational time. The algorithm is implemented based on dividing the population into specified groups and each group tries to find a better location. Ten test images are selected to verify the ability of our algorithm using the famous energy curve method. Two popular entropies criteria, Otsu and Kapur, are employed to evaluate the capability of the introduced algorithm. Eight different search algorithms are also implemented and compared with our method. Experimental results manifest that DCS is a powerful tool for multilevel thresholding and the obtained results outperform the CS algorithm and other heuristic search methods.
图像分割是理解图像内容的基本操作。多层阈值分割因其速度快、精度高而被广泛应用于图像分割。提出了一种基于布谷鸟搜索(Cuckoo search, CS)的多层阈值分割算法。元启发式算法的主要缺点之一是停滞现象,导致陷入局部最优和过早收敛。为了克服这一缺点,CS算法中加入了达尔文理论的思想,在不降低CS算法收敛速度的前提下,增加了个体的多样性和质量。采用奖惩策略引导搜索主体进入搜索空间,减少计算时间。该算法是基于将人口划分为特定的组,每个组试图找到一个更好的位置来实现的。选取10张测试图像,利用著名的能量曲线法验证算法的能力。采用两种常用的熵准则Otsu和Kapur来评价所引入算法的性能。还实现了八种不同的搜索算法,并与我们的方法进行了比较。实验结果表明,DCS是一种强大的多级阈值分割工具,所得结果优于CS算法和其他启发式搜索方法。
{"title":"A Multilevel Image Thresholding Method Using the Darwinian Cuckoo Search Algorithm","authors":"E. Ehsaeyan, A. Zolghadrasli","doi":"10.1142/S0219467821500522","DOIUrl":"https://doi.org/10.1142/S0219467821500522","url":null,"abstract":"Image segmentation is a prime operation to understand the content of images. Multilevel thresholding is applied in image segmentation because of its speed and accuracy. In this paper, a novel multilevel thresholding algorithm based on Cuckoo search (CS) is introduced. One of the major drawbacks of metaheuristic algorithms is the stagnation phenomenon which leads to a fall into local optimums and premature convergence. To overcome this shortcoming, the idea of Darwinian theory is incorporated with CS algorithm to increase the diversity and quality of the individuals without decreasing the convergence speed of CS algorithm. A policy of encouragement and punishment is considered to lead searching agents in the search space and reduce the computational time. The algorithm is implemented based on dividing the population into specified groups and each group tries to find a better location. Ten test images are selected to verify the ability of our algorithm using the famous energy curve method. Two popular entropies criteria, Otsu and Kapur, are employed to evaluate the capability of the introduced algorithm. Eight different search algorithms are also implemented and compared with our method. Experimental results manifest that DCS is a powerful tool for multilevel thresholding and the obtained results outperform the CS algorithm and other heuristic search methods.","PeriodicalId":177479,"journal":{"name":"Int. J. Image Graph.","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131271595","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
HDR Image Compression by Multi-Scale down Sampling of Intensity Levels 强度等级多尺度降采样的HDR图像压缩
Pub Date : 2021-03-15 DOI: 10.1142/S0219467821500480
A. Swamy, N. Shylashree
HDR images are inherently very large in size compared to normal images. Hence, storage and communication overheads of HDR images are expensive to be used in mobile devices. Hence, invariably image compression is adopted for HDR images. In this paper, HDR image compression is achieved by down sampling the intensity levels while maintaining the dynamic range same as that of the original. This aspect retains the edge information of the images almost intact. Spatial down-sampling process is used to reduce the number of intensity samples. Consequently, this operation lowers the bit depth required to store the corresponding index file which in turn results in image compression.
与普通图像相比,HDR图像本质上是非常大的。因此,在移动设备中使用HDR图像的存储和通信开销是昂贵的。因此,对于HDR图像,一律采用图像压缩。在本文中,HDR图像压缩是通过降低采样强度等级来实现的,同时保持与原始图像相同的动态范围。这方面几乎完整地保留了图像的边缘信息。采用空间降采样的方法来减少强度样本的数量。因此,该操作降低了存储相应索引文件所需的位深度,从而导致图像压缩。
{"title":"HDR Image Compression by Multi-Scale down Sampling of Intensity Levels","authors":"A. Swamy, N. Shylashree","doi":"10.1142/S0219467821500480","DOIUrl":"https://doi.org/10.1142/S0219467821500480","url":null,"abstract":"HDR images are inherently very large in size compared to normal images. Hence, storage and communication overheads of HDR images are expensive to be used in mobile devices. Hence, invariably image compression is adopted for HDR images. In this paper, HDR image compression is achieved by down sampling the intensity levels while maintaining the dynamic range same as that of the original. This aspect retains the edge information of the images almost intact. Spatial down-sampling process is used to reduce the number of intensity samples. Consequently, this operation lowers the bit depth required to store the corresponding index file which in turn results in image compression.","PeriodicalId":177479,"journal":{"name":"Int. J. Image Graph.","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121581848","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Real-Time Computer Vision-Based Bangla Vehicle License Plate Recognition using Contour Analysis and Prediction Algorithm 基于实时计算机视觉的孟加拉车牌轮廓分析与预测算法识别
Pub Date : 2021-03-13 DOI: 10.1142/S021946782150042X
Masud Pervej, Sabuj Das, Md. Parvez Hossain, Md. Atikuzzaman, Md. Mahin, Muhammad Aminur Rahaman
Computer vision-based recognition of Bangle vehicle license plates (LPs) is an arduous task in dirty and muddy situations. This paper proposes an efficient method for real-time computer vision-based recognition of Bangla vehicle LPs using contour analysis and prediction algorithms. The method initially applies gray scaling the input RGB images, histogram equalization to improve the grayscale image quality, edge detection using Sobel edge detector, and adaptive thresholding to convert it to a binary image. The system localizes the vehicle LP based on the maximum rectangular contour area and converts it into a predefined size. Noise removal technique using morphological dilation and erosion operation is used, followed by Gaussian filtering on binary image to improve the image quality further. The system clusters the two-lined LP into seven clusters. The sub-clustering is applied on specific clusters and makes 68 individual sub-clusters. The system extracts vector contour (VC) from each 68 individual classes. After VC extraction, the system normalizes it into a q predefined length. The system applies inter co-relation function (ICF) to categorize each sub-cluster to its previously defined individual class. For that, it calculates the maximum similarity between test and previously trained VCs. The system applies the dependency prediction algorithm in parallel to predict the whole string (district name) in the cluster-1 based on previously categorized class or classes (starting character or characters of the district part). [Formula: see text] (Metro) or (null) from cluster-2, “-” (hyphen) from cluster-3 and 6 are predicted automatically as their positions are fixed. The system is trained using 68 classes in which each class contains 100 samples generated by the augmentation technique. The system is tested using another set of 68 classes with a total of [Formula: see text] images acquiring the recognition accuracy of 96.62% with the mean computational cost of 8.363[Formula: see text]ms/f. The system is also tested using 500 vehicle whole Bangla LPs achieving the mean whole LP recognition accuracy of 95.41% with a mean computational cost of 35.803[Formula: see text]ms/f.
在肮脏泥泞的环境中,基于计算机视觉的车牌识别是一项艰巨的任务。本文提出了一种利用轮廓分析和预测算法对孟加拉车辆lp进行实时计算机视觉识别的有效方法。该方法首先对输入的RGB图像进行灰度化,通过直方图均衡化来提高灰度图像的质量,使用Sobel边缘检测器进行边缘检测,并采用自适应阈值法将其转换为二值图像。该系统根据最大矩形轮廓面积定位车辆LP,并将其转换为预定义的尺寸。采用形态扩张和侵蚀运算去噪技术,对二值图像进行高斯滤波,进一步提高图像质量。该系统将双线LP分成7个集群。子聚类应用于特定的集群,形成68个单独的子聚类。系统从每68个单独的类中提取向量轮廓(VC)。VC提取后,系统将其归一化为q个预定义长度。该系统利用ICF (inter - co-relation function)将每个子集群划分为其先前定义的单个类。为此,它计算测试和之前训练过的vc之间的最大相似度。系统并行应用依赖预测算法,根据先前分类的类(或类的起始字符或区域部分的字符),预测cluster-1中的整个字符串(区域名称)。[公式:见文本](Metro)或(null)来自cluster-2,“-”(连字符)来自cluster-3和6被自动预测,因为它们的位置是固定的。该系统使用68个类进行训练,每个类包含100个由增强技术生成的样本。采用另一组共68类[公式:见文]图像对系统进行测试,识别准确率为96.62%,平均计算代价为8.363 ms/f[公式:见文]。用500辆整车孟加拉语LP对该系统进行了测试,平均识别准确率为95.41%,平均计算成本为35.803 ms/f。
{"title":"Real-Time Computer Vision-Based Bangla Vehicle License Plate Recognition using Contour Analysis and Prediction Algorithm","authors":"Masud Pervej, Sabuj Das, Md. Parvez Hossain, Md. Atikuzzaman, Md. Mahin, Muhammad Aminur Rahaman","doi":"10.1142/S021946782150042X","DOIUrl":"https://doi.org/10.1142/S021946782150042X","url":null,"abstract":"Computer vision-based recognition of Bangle vehicle license plates (LPs) is an arduous task in dirty and muddy situations. This paper proposes an efficient method for real-time computer vision-based recognition of Bangla vehicle LPs using contour analysis and prediction algorithms. The method initially applies gray scaling the input RGB images, histogram equalization to improve the grayscale image quality, edge detection using Sobel edge detector, and adaptive thresholding to convert it to a binary image. The system localizes the vehicle LP based on the maximum rectangular contour area and converts it into a predefined size. Noise removal technique using morphological dilation and erosion operation is used, followed by Gaussian filtering on binary image to improve the image quality further. The system clusters the two-lined LP into seven clusters. The sub-clustering is applied on specific clusters and makes 68 individual sub-clusters. The system extracts vector contour (VC) from each 68 individual classes. After VC extraction, the system normalizes it into a q predefined length. The system applies inter co-relation function (ICF) to categorize each sub-cluster to its previously defined individual class. For that, it calculates the maximum similarity between test and previously trained VCs. The system applies the dependency prediction algorithm in parallel to predict the whole string (district name) in the cluster-1 based on previously categorized class or classes (starting character or characters of the district part). [Formula: see text] (Metro) or (null) from cluster-2, “-” (hyphen) from cluster-3 and 6 are predicted automatically as their positions are fixed. The system is trained using 68 classes in which each class contains 100 samples generated by the augmentation technique. The system is tested using another set of 68 classes with a total of [Formula: see text] images acquiring the recognition accuracy of 96.62% with the mean computational cost of 8.363[Formula: see text]ms/f. The system is also tested using 500 vehicle whole Bangla LPs achieving the mean whole LP recognition accuracy of 95.41% with a mean computational cost of 35.803[Formula: see text]ms/f.","PeriodicalId":177479,"journal":{"name":"Int. J. Image Graph.","volume":"17 3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-03-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125916277","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
期刊
Int. J. Image Graph.
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1