Int. J. Image Graph.最新文献_第5页

Image Matting: A Comprehensive Survey on Techniques, Comparative Analysis, Applications and Future Scope 图像抠图:技术综述、比较分析、应用及未来前景

Int. J. Image Graph.

Pub Date : 2021-12-15 DOI: 10.1142/s0219467823500110

D. C. Lepcha, Bhawna Goyal, Ayush Dogra

In the era of rapid growth of technologies, image matting plays a key role in image and video editing along with image composition. In many significant real-world applications such as film production, it has been widely used for visual effects, virtual zoom, image translation, image editing and video editing. With recent advancements in digital cameras, both professionals and consumers have become increasingly involved in matting techniques to facilitate image editing activities. Image matting plays an important role to estimate alpha matte in the unknown region to distinguish foreground from the background region of an image using an input image and the corresponding trimap of an image which represents a foreground and unknown region. Numerous image matting techniques have been proposed recently to extract high-quality matte from image and video sequences. This paper illustrates a systematic overview of the current image and video matting techniques mostly emphasis on the current and advanced algorithms proposed recently. In general, image matting techniques have been categorized according to their underlying approaches, namely, sampling-based, propagation-based, combination of sampling and propagation-based and deep learning-based algorithms. The traditional image matting algorithms depend primarily on color information to predict alpha matte such as sampling-based, propagation-based or combination of sampling and propagation-based algorithms. However, these techniques mostly use low-level features and suffer from high-level background which tends to produce unwanted artifacts when color is same or semi-transparent in the foreground object. Image matting techniques based on deep learning have recently introduced to address the shortcomings of traditional algorithms. Rather than simply depending on the color information, it uses deep learning mechanism to estimate the alpha matte using an input image and the trimap of an image. A comprehensive survey on recent image matting algorithms and in-depth comparative analysis of these algorithms has been thoroughly discussed in this paper.

在技术飞速发展的时代，图像抠图与图像合成一样，在图像和视频编辑中起着至关重要的作用。在电影制作等许多重要的现实应用中，它已被广泛用于视觉效果、虚拟变焦、图像翻译、图像编辑和视频编辑。随着数码相机的进步，专业人士和消费者都越来越多地参与抠图技术，以促进图像编辑活动。图像抠图对未知区域的alpha抠图进行估计，利用输入图像和对应的表示前景和未知区域的三图来区分图像的前景和背景区域。为了从图像和视频序列中提取高质量的图像，最近提出了许多图像抠图技术。本文系统地概述了当前图像和视频抠图技术，重点介绍了当前和最近提出的先进算法。一般来说，图像抠图技术根据其底层方法进行分类，即基于采样的算法、基于传播的算法、基于采样与传播相结合的算法和基于深度学习的算法。传统的图像抠图算法主要依靠颜色信息来预测alpha哑光，如基于采样、基于传播或基于采样和基于传播的组合算法。然而，这些技术大多使用低级特征，并受到高级背景的影响，当前景对象的颜色相同或半透明时，往往会产生不必要的伪影。近年来，基于深度学习的图像抠图技术被引入，以解决传统算法的不足。它不是简单地依赖于颜色信息，而是使用深度学习机制来使用输入图像和图像的trimap来估计alpha哑光。本文对近年来的图像抠图算法进行了全面的综述，并对这些算法进行了深入的比较分析。

{"title":"Image Matting: A Comprehensive Survey on Techniques, Comparative Analysis, Applications and Future Scope","authors":"D. C. Lepcha, Bhawna Goyal, Ayush Dogra","doi":"10.1142/s0219467823500110","DOIUrl":"https://doi.org/10.1142/s0219467823500110","url":null,"abstract":"In the era of rapid growth of technologies, image matting plays a key role in image and video editing along with image composition. In many significant real-world applications such as film production, it has been widely used for visual effects, virtual zoom, image translation, image editing and video editing. With recent advancements in digital cameras, both professionals and consumers have become increasingly involved in matting techniques to facilitate image editing activities. Image matting plays an important role to estimate alpha matte in the unknown region to distinguish foreground from the background region of an image using an input image and the corresponding trimap of an image which represents a foreground and unknown region. Numerous image matting techniques have been proposed recently to extract high-quality matte from image and video sequences. This paper illustrates a systematic overview of the current image and video matting techniques mostly emphasis on the current and advanced algorithms proposed recently. In general, image matting techniques have been categorized according to their underlying approaches, namely, sampling-based, propagation-based, combination of sampling and propagation-based and deep learning-based algorithms. The traditional image matting algorithms depend primarily on color information to predict alpha matte such as sampling-based, propagation-based or combination of sampling and propagation-based algorithms. However, these techniques mostly use low-level features and suffer from high-level background which tends to produce unwanted artifacts when color is same or semi-transparent in the foreground object. Image matting techniques based on deep learning have recently introduced to address the shortcomings of traditional algorithms. Rather than simply depending on the color information, it uses deep learning mechanism to estimate the alpha matte using an input image and the trimap of an image. A comprehensive survey on recent image matting algorithms and in-depth comparative analysis of these algorithms has been thoroughly discussed in this paper.","PeriodicalId":177479,"journal":{"name":"Int. J. Image Graph.","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129284146","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Retinal Blood Vessel Extraction Using a New Enhancement Technique of Modified Convolution Filters and Sauvola Thresholding 基于改进卷积滤波和索沃拉阈值增强技术的视网膜血管提取

Int. J. Image Graph.

Pub Date : 2021-12-10 DOI: 10.1142/s0219467823500067

Hadrians Kesuma Putra, B. Suprihatin

The retinal blood vessels in humans are major components with different shapes and sizes. The extraction of the blood vessels from the retina is an important step to identify the type or nature of the pattern of the diseases in the retina. Furthermore, the retinal blood vessel was also used for diagnosis, detection, and classification. The most recent solution in this topic is to enable retinal image improvement or enhancement by a convolution filter and Sauvola threshold. In image enhancement, gamma correction is applied before filtering the retinal fundus. After that, the image should be transformed to a gray channel to enhance pictorial clarity using contrast-limited histogram equalization. For filter, this paper combines two convolution filters, namely sharpen and smooth filters. The Sauvola threshold, the morphology, and the medium filter are applied to extract blood vessels from the retinal image. This paper uses DRIVE and STARE datasets. The accuracies of the proposed method are 95.37% for DRIVE with a runtime of 1.77[Formula: see text]s and 95.17% for STARE with 2.05[Formula: see text]s runtime. Based on the result, it concludes that the proposed method is good enough to achieve average calculation parameters of a low time quality, quick, and significant.

视网膜血管是人体的主要组成部分，具有不同的形状和大小。从视网膜中提取血管是识别视网膜疾病类型或性质的重要步骤。此外，视网膜血管也被用于诊断、检测和分类。在这个主题中，最新的解决方案是通过卷积滤波器和索沃拉阈值来实现视网膜图像的改进或增强。在图像增强中，在过滤视网膜眼底之前进行伽玛校正。之后，将图像转换为灰度通道，使用对比度限制直方图均衡化来增强图像清晰度。对于滤波器，本文结合了两种卷积滤波器，即锐化滤波器和平滑滤波器。利用索沃拉阈值、形态学和介质滤波对视网膜图像进行血管提取。本文使用DRIVE和STARE数据集。对于运行时间为1.77的DRIVE和运行时间为2.05的STARE，本文方法的准确率分别为95.37%和95.17%。结果表明，该方法能够较好地实现低时间质量、快速、显著的平均计算参数。

{"title":"Retinal Blood Vessel Extraction Using a New Enhancement Technique of Modified Convolution Filters and Sauvola Thresholding","authors":"Hadrians Kesuma Putra, B. Suprihatin","doi":"10.1142/s0219467823500067","DOIUrl":"https://doi.org/10.1142/s0219467823500067","url":null,"abstract":"The retinal blood vessels in humans are major components with different shapes and sizes. The extraction of the blood vessels from the retina is an important step to identify the type or nature of the pattern of the diseases in the retina. Furthermore, the retinal blood vessel was also used for diagnosis, detection, and classification. The most recent solution in this topic is to enable retinal image improvement or enhancement by a convolution filter and Sauvola threshold. In image enhancement, gamma correction is applied before filtering the retinal fundus. After that, the image should be transformed to a gray channel to enhance pictorial clarity using contrast-limited histogram equalization. For filter, this paper combines two convolution filters, namely sharpen and smooth filters. The Sauvola threshold, the morphology, and the medium filter are applied to extract blood vessels from the retinal image. This paper uses DRIVE and STARE datasets. The accuracies of the proposed method are 95.37% for DRIVE with a runtime of 1.77[Formula: see text]s and 95.17% for STARE with 2.05[Formula: see text]s runtime. Based on the result, it concludes that the proposed method is good enough to achieve average calculation parameters of a low time quality, quick, and significant.","PeriodicalId":177479,"journal":{"name":"Int. J. Image Graph.","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128799382","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Fusion-Based Semantic Segmentation Using Deep Learning Architecture in Case of Very Small Training Dataset 在非常小的训练数据集情况下基于深度学习架构的融合语义分割

Int. J. Image Graph.

Pub Date : 2021-11-20 DOI: 10.1142/s0219467822500437

G. R. Padalkar, M. Khambete

Semantic segmentation is a pre-processing step in computer vision-based applications. It is the task of assigning a predefined class label to every pixel of an image. Several supervised and unsupervised algorithms are available to classify pixels of an image into predefined object classes. The algorithms, such as random forest and SVM are used to obtain the semantic segmentation. Recently, convolutional neural network (CNN)-based architectures have become popular for the tasks of object detection, object recognition, and segmentation. These deep architectures perform semantic segmentation with far better accuracy than the algorithms that were used earlier. CNN-based deep learning architectures require a large dataset for training. In real life, some of the applications may not have sufficient good quality samples for training of deep learning architectures e.g. medical applications. Such a requirement initiated a need to have a technique of effective training of deep learning architecture in case of a very small dataset. Class imbalance is another challenge in the process of training deep learning architecture. Due to class imbalance, the classifier overclassifies classes with large samples. In this paper, the challenge of training a deep learning architecture with a small dataset and class imbalance is addressed by novel fusion-based semantic segmentation technique which improves segmentation of minor and major classes.

语义分割是基于计算机视觉的应用程序中的预处理步骤。它的任务是为图像的每个像素分配预定义的类标签。有几种有监督和无监督算法可用于将图像的像素分类为预定义的对象类。采用随机森林和支持向量机等算法进行语义分割。最近，基于卷积神经网络(CNN)的架构在目标检测、目标识别和分割任务中变得非常流行。这些深度架构执行语义分割的准确性远远高于之前使用的算法。基于cnn的深度学习架构需要大量的数据集进行训练。在现实生活中，一些应用程序可能没有足够的高质量样本来训练深度学习架构，例如医疗应用程序。这样的需求引发了对在非常小的数据集情况下有效训练深度学习架构的技术的需求。类不平衡是深度学习架构训练过程中的另一个挑战。由于类不平衡，分类器对大样本的类进行过分类。本文提出了一种新的基于融合的语义分割技术，该技术改进了小类和主要类的分割，解决了在小数据集和类不平衡的情况下训练深度学习架构的挑战。

{"title":"Fusion-Based Semantic Segmentation Using Deep Learning Architecture in Case of Very Small Training Dataset","authors":"G. R. Padalkar, M. Khambete","doi":"10.1142/s0219467822500437","DOIUrl":"https://doi.org/10.1142/s0219467822500437","url":null,"abstract":"Semantic segmentation is a pre-processing step in computer vision-based applications. It is the task of assigning a predefined class label to every pixel of an image. Several supervised and unsupervised algorithms are available to classify pixels of an image into predefined object classes. The algorithms, such as random forest and SVM are used to obtain the semantic segmentation. Recently, convolutional neural network (CNN)-based architectures have become popular for the tasks of object detection, object recognition, and segmentation. These deep architectures perform semantic segmentation with far better accuracy than the algorithms that were used earlier. CNN-based deep learning architectures require a large dataset for training. In real life, some of the applications may not have sufficient good quality samples for training of deep learning architectures e.g. medical applications. Such a requirement initiated a need to have a technique of effective training of deep learning architecture in case of a very small dataset. Class imbalance is another challenge in the process of training deep learning architecture. Due to class imbalance, the classifier overclassifies classes with large samples. In this paper, the challenge of training a deep learning architecture with a small dataset and class imbalance is addressed by novel fusion-based semantic segmentation technique which improves segmentation of minor and major classes.","PeriodicalId":177479,"journal":{"name":"Int. J. Image Graph.","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122701292","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Machine Learning with Data Science-Enabled Lung Cancer Diagnosis and Classification Using Computed Tomography Images 机器学习与数据科学支持肺癌诊断和分类使用计算机断层扫描图像

Int. J. Image Graph.

Pub Date : 2021-11-10 DOI: 10.1142/s0219467822400022

S. Kiran, Inderjeet Kaur, K. Thangaraj, V. Saveetha, R. Grace, N. Arulkumar

In recent times, the healthcare industry has been generating a significant amount of data in distinct formats, such as electronic health records (EHR), clinical trials, genetic data, payments, scientific articles, wearables, and care management databases. Data science is useful for analysis (pattern recognition, hypothesis testing, risk valuation) and prediction. The major, primary usage of data science in the healthcare domain is in medical imaging. At the same time, lung cancer diagnosis has become a hot research topic, as automated disease detection poses numerous benefits. Although numerous approaches have existed in the literature for lung cancer diagnosis, the design of a novel model to automatically identify lung cancer is a challenging task. In this view, this paper designs an automated machine learning (ML) with data science-enabled lung cancer diagnosis and classification (MLDS-LCDC) using computed tomography (CT) images. The presented model initially employs Gaussian filtering (GF)-based pre-processing technique on the CT images collected from the lung cancer database. Besides, they are fed into the normalized cuts (Ncuts) technique where the nodule in the pre-processed image can be determined. Moreover, the oriented FAST and rotated BRIEF (ORB) technique is applied as a feature extractor. At last, sunflower optimization-based wavelet neural network (SFO-WNN) model is employed for the classification of lung cancer. In order to examine the diagnostic outcome of the MLDS-LCDC model, a set of experiments were carried out and the results are investigated in terms of different aspects. The resultant values demonstrated the effectiveness of the MLDS-LCDC model over the other state-of-the-art methods with the maximum sensitivity of 97.01%, specificity of 98.64%, and accuracy of 98.11%.

最近，医疗保健行业一直在以不同格式生成大量数据，例如电子健康记录(EHR)、临床试验、基因数据、支付、科学文章、可穿戴设备和护理管理数据库。数据科学对分析(模式识别、假设检验、风险评估)和预测很有用。数据科学在医疗保健领域的主要用途是医学成像。与此同时，肺癌诊断已成为一个热门的研究课题，因为自动化疾病检测带来了许多好处。虽然文献中已有许多方法用于肺癌诊断，但设计一种新的模型来自动识别肺癌是一项具有挑战性的任务。在这种观点下，本文设计了一个使用计算机断层扫描(CT)图像的具有数据科学支持的肺癌诊断和分类(MLDS-LCDC)的自动机器学习(ML)。该模型首先采用基于高斯滤波(GF)的预处理技术对肺癌数据库中的CT图像进行预处理。此外，它们被输入到归一化切割(Ncuts)技术中，可以确定预处理图像中的结节。此外，采用定向FAST和旋转BRIEF (ORB)技术作为特征提取器。最后，采用基于向日葵优化的小波神经网络(SFO-WNN)模型对肺癌进行分类。为了检验MLDS-LCDC模型的诊断效果，我们进行了一组实验，并从不同方面对结果进行了研究。结果表明，MLDS-LCDC模型的灵敏度为97.01%，特异性为98.64%，准确度为98.11%，优于其他最先进的方法。

{"title":"Machine Learning with Data Science-Enabled Lung Cancer Diagnosis and Classification Using Computed Tomography Images","authors":"S. Kiran, Inderjeet Kaur, K. Thangaraj, V. Saveetha, R. Grace, N. Arulkumar","doi":"10.1142/s0219467822400022","DOIUrl":"https://doi.org/10.1142/s0219467822400022","url":null,"abstract":"In recent times, the healthcare industry has been generating a significant amount of data in distinct formats, such as electronic health records (EHR), clinical trials, genetic data, payments, scientific articles, wearables, and care management databases. Data science is useful for analysis (pattern recognition, hypothesis testing, risk valuation) and prediction. The major, primary usage of data science in the healthcare domain is in medical imaging. At the same time, lung cancer diagnosis has become a hot research topic, as automated disease detection poses numerous benefits. Although numerous approaches have existed in the literature for lung cancer diagnosis, the design of a novel model to automatically identify lung cancer is a challenging task. In this view, this paper designs an automated machine learning (ML) with data science-enabled lung cancer diagnosis and classification (MLDS-LCDC) using computed tomography (CT) images. The presented model initially employs Gaussian filtering (GF)-based pre-processing technique on the CT images collected from the lung cancer database. Besides, they are fed into the normalized cuts (Ncuts) technique where the nodule in the pre-processed image can be determined. Moreover, the oriented FAST and rotated BRIEF (ORB) technique is applied as a feature extractor. At last, sunflower optimization-based wavelet neural network (SFO-WNN) model is employed for the classification of lung cancer. In order to examine the diagnostic outcome of the MLDS-LCDC model, a set of experiments were carried out and the results are investigated in terms of different aspects. The resultant values demonstrated the effectiveness of the MLDS-LCDC model over the other state-of-the-art methods with the maximum sensitivity of 97.01%, specificity of 98.64%, and accuracy of 98.11%.","PeriodicalId":177479,"journal":{"name":"Int. J. Image Graph.","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122498734","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

Deep Learning-Based Classification of Malignant and Benign Cells in Dermatoscopic Images via Transfer Learning Approach 基于迁移学习方法的皮肤镜图像中恶性和良性细胞深度学习分类

Int. J. Image Graph.

Pub Date : 2021-11-08 DOI: 10.1142/s0219467822500413

V. Kumar, V. Mishra, Monika Arora

The inhibition of healthy cells creating improper controlling process of the human body system indicates the occurrence of growth of cancerous cells. The cluster of such cells leads to the development of tumor. The observation of this type of abnormal skin pigmentation is done using an effective tool called Dermoscopy. However, these dermatoscopic images possess a great challenge for diagnosis. Considering the characteristics of dermatoscopic images, transfer learning is an appropriate approach of automatically classifying the images based on the respective categories. An automatic identification of skin cancer not only saves human life but also helps in detecting its growth at an earlier stage which saves medical practitioner’s effort and time. A newly predicted model has been proposed for classifying the skin cancer as benign or malignant by DCNN with transfer learning and its pre-trained models such as VGG 16, VGG 19, ResNet 50, ResNet 101, and Inception V3. The proposed methodology aims at examining the efficiency of pre-trained models and transfer learning approach for the classification tasks and opens new dimensions of research in the field of medicines using imaging technique which can be implementable in real-time applications.

对健康细胞的抑制导致人体系统控制过程不正常，表明癌细胞的生长发生。这种细胞的聚集导致肿瘤的发生。观察这种类型的异常皮肤色素沉着是使用一种有效的工具，称为皮肤镜检查。然而，这些皮肤镜图像对诊断具有很大的挑战。考虑到皮肤镜图像的特点，迁移学习是一种基于各自类别的图像自动分类的合适方法。自动识别皮肤癌不仅可以挽救人类的生命，而且有助于在早期发现皮肤癌的生长，节省医生的精力和时间。通过迁移学习及其预训练模型(如VGG 16、VGG 19、ResNet 50、ResNet 101和Inception V3)，提出了一种新的预测模型，用于将皮肤癌分类为良性或恶性。所提出的方法旨在检查分类任务的预训练模型和迁移学习方法的效率，并为使用可在实时应用中实现的成像技术在医学领域的研究开辟了新的维度。

{"title":"Deep Learning-Based Classification of Malignant and Benign Cells in Dermatoscopic Images via Transfer Learning Approach","authors":"V. Kumar, V. Mishra, Monika Arora","doi":"10.1142/s0219467822500413","DOIUrl":"https://doi.org/10.1142/s0219467822500413","url":null,"abstract":"The inhibition of healthy cells creating improper controlling process of the human body system indicates the occurrence of growth of cancerous cells. The cluster of such cells leads to the development of tumor. The observation of this type of abnormal skin pigmentation is done using an effective tool called Dermoscopy. However, these dermatoscopic images possess a great challenge for diagnosis. Considering the characteristics of dermatoscopic images, transfer learning is an appropriate approach of automatically classifying the images based on the respective categories. An automatic identification of skin cancer not only saves human life but also helps in detecting its growth at an earlier stage which saves medical practitioner’s effort and time. A newly predicted model has been proposed for classifying the skin cancer as benign or malignant by DCNN with transfer learning and its pre-trained models such as VGG 16, VGG 19, ResNet 50, ResNet 101, and Inception V3. The proposed methodology aims at examining the efficiency of pre-trained models and transfer learning approach for the classification tasks and opens new dimensions of research in the field of medicines using imaging technique which can be implementable in real-time applications.","PeriodicalId":177479,"journal":{"name":"Int. J. Image Graph.","volume":"275 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134280055","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A Thermal Blended Facial Expression Analysis and Recognition System Using Deformed Thermal Facial Areas 基于热变形面部区域的热混合面部表情分析与识别系统

Int. J. Image Graph.

Pub Date : 2021-11-06 DOI: 10.1142/s0219467822500498

P. Saha, D. Bhattacharjee, B. K. De, M. Nasipuri

There are many research works in visible as well as thermal facial expression analysis and recognition. Several facial expression databases have been designed in both modalities. However, little attention has been given for analyzing blended facial expressions in the thermal infrared spectrum. In this paper, we have introduced a Visual-Thermal Blended Facial Expression Database (VTBE) that contains visual and thermal face images with both basic and blended facial expressions. The database contains 12 posed blended facial expressions and spontaneous six basic facial expressions in both modalities. In this paper, we have proposed Deformed Thermal Facial Area (DTFA) in thermal expressive face image and make an analysis to differentiate between basic and blended expressions using DTFA. Here, the fusion of DTFA and Deformed Visual Facial Area (DVFA) has been proposed combining the features of both modalities and experiments and has been conducted on this new database. However, to show the effectiveness of our proposed approach, we have compared our method with state-of-the-art methods using USTC-NVIE database. Experiment results reveal that our approach is superior to state-of-the-art methods.

在可见和热态面部表情分析与识别方面有很多研究工作。在这两种模式下已经设计了几个面部表情数据库。然而，对混合面部表情的热红外光谱分析却很少受到重视。在本文中，我们介绍了一个视觉-热混合面部表情数据库(VTBE)，该数据库包含具有基本面部表情和混合面部表情的视觉和热面部图像。该数据库包含12种姿势的混合面部表情和自发的6种基本面部表情。本文提出了热表达人脸图像中的变形热面面积(DTFA)，并利用该方法对基本表情和混合表情进行了区分分析。在此，结合两种模式和实验的特点，提出了DTFA和变形视觉面部区域(DVFA)的融合，并在这个新的数据库上进行了。然而，为了证明我们提出的方法的有效性，我们使用USTC-NVIE数据库将我们的方法与最先进的方法进行了比较。实验结果表明，我们的方法优于最先进的方法。

{"title":"A Thermal Blended Facial Expression Analysis and Recognition System Using Deformed Thermal Facial Areas","authors":"P. Saha, D. Bhattacharjee, B. K. De, M. Nasipuri","doi":"10.1142/s0219467822500498","DOIUrl":"https://doi.org/10.1142/s0219467822500498","url":null,"abstract":"There are many research works in visible as well as thermal facial expression analysis and recognition. Several facial expression databases have been designed in both modalities. However, little attention has been given for analyzing blended facial expressions in the thermal infrared spectrum. In this paper, we have introduced a Visual-Thermal Blended Facial Expression Database (VTBE) that contains visual and thermal face images with both basic and blended facial expressions. The database contains 12 posed blended facial expressions and spontaneous six basic facial expressions in both modalities. In this paper, we have proposed Deformed Thermal Facial Area (DTFA) in thermal expressive face image and make an analysis to differentiate between basic and blended expressions using DTFA. Here, the fusion of DTFA and Deformed Visual Facial Area (DVFA) has been proposed combining the features of both modalities and experiments and has been conducted on this new database. However, to show the effectiveness of our proposed approach, we have compared our method with state-of-the-art methods using USTC-NVIE database. Experiment results reveal that our approach is superior to state-of-the-art methods.","PeriodicalId":177479,"journal":{"name":"Int. J. Image Graph.","volume":"28 2","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"113967944","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

Image Copy-Move Forgery Detection Using Combination of Scale-Invariant Feature Transform and Local Binary Pattern Features 结合尺度不变特征变换和局部二值模式特征的图像复制移动伪造检测

Int. J. Image Graph.

Pub Date : 2021-11-03 DOI: 10.1142/s0219467822500486

Marziye Shahrokhi, Alireza Akoushideh, A. Shahbahrami

Today, manipulating, storing, and sending digital images are simple and easy because of the development of digital imaging devices from hardware and software points of view. Digital images are used in different contexts of people’s lives such as news, forensics, and so on. Therefore, the reliability of received images is a question that often occupies the viewer’s mind and the authenticity of digital images is increasingly important. Detecting a forged image as a genuine one as well as detecting a genuine image as a forged one can sometimes have irreparable consequences. For example, an image that is available from the scene of a crime can lead to a wrong decision if it is detected incorrectly. In this paper, we propose a combination method to improve the accuracy of copy–move forgery detection (CMFD) reducing the false positive rate (FPR) based on texture attributes. The proposed method uses a combination of the scale-invariant feature transform (SIFT) and local binary pattern (LBP). Consideration of texture features around the keypoints detected by the SIFT algorithm can be effective to reduce the incorrect matches and improve the accuracy of CMFD. In addition, to find more and better keypoints some pre-processing methods have been proposed. This study was evaluated on the COVERAGE, GRIP, and MICC-F220 databases. Experimental results show that the proposed method without clustering or segmentation and only with simple matching operations, has been able to earn the true positive rates of 98.75%, 95.45%, and 87% on the GRIP, MICC-F220, and COVERAGE datasets, respectively. Also, the proposed method, with FPRs from 17.75% to 3.75% on the GRIP dataset, has been able to achieve the best results.

今天，由于从硬件和软件的角度来看，数字成像设备的发展，操作、存储和发送数字图像变得简单易行。数字图像用于人们生活的不同环境，如新闻、法医等。因此，接收图像的可靠性是一个经常占据观众脑海的问题，数字图像的真实性也越来越重要。将伪造图像检测为真实图像以及将真实图像检测为伪造图像有时会产生不可挽回的后果。例如，从犯罪现场获得的图像如果被错误地检测到，可能会导致错误的决策。本文提出了一种基于纹理属性降低误报率的组合方法来提高复制-移动伪造检测(CMFD)的准确性。该方法将尺度不变特征变换(SIFT)和局部二值模式(LBP)相结合。考虑SIFT算法检测到的关键点周围的纹理特征可以有效地减少错误匹配，提高CMFD的精度。此外，为了找到更多更好的关键点，本文还提出了一些预处理方法。本研究在COVERAGE、GRIP和mic - f220数据库上进行评估。实验结果表明，该方法在GRIP、mic - f220和COVERAGE数据集上的真阳性率分别为98.75%、95.45%和87%，不需要聚类和分割，只需要进行简单的匹配操作。该方法在GRIP数据集上的fpr为17.75% ~ 3.75%，达到了最佳效果。

{"title":"Image Copy-Move Forgery Detection Using Combination of Scale-Invariant Feature Transform and Local Binary Pattern Features","authors":"Marziye Shahrokhi, Alireza Akoushideh, A. Shahbahrami","doi":"10.1142/s0219467822500486","DOIUrl":"https://doi.org/10.1142/s0219467822500486","url":null,"abstract":"Today, manipulating, storing, and sending digital images are simple and easy because of the development of digital imaging devices from hardware and software points of view. Digital images are used in different contexts of people’s lives such as news, forensics, and so on. Therefore, the reliability of received images is a question that often occupies the viewer’s mind and the authenticity of digital images is increasingly important. Detecting a forged image as a genuine one as well as detecting a genuine image as a forged one can sometimes have irreparable consequences. For example, an image that is available from the scene of a crime can lead to a wrong decision if it is detected incorrectly. In this paper, we propose a combination method to improve the accuracy of copy–move forgery detection (CMFD) reducing the false positive rate (FPR) based on texture attributes. The proposed method uses a combination of the scale-invariant feature transform (SIFT) and local binary pattern (LBP). Consideration of texture features around the keypoints detected by the SIFT algorithm can be effective to reduce the incorrect matches and improve the accuracy of CMFD. In addition, to find more and better keypoints some pre-processing methods have been proposed. This study was evaluated on the COVERAGE, GRIP, and MICC-F220 databases. Experimental results show that the proposed method without clustering or segmentation and only with simple matching operations, has been able to earn the true positive rates of 98.75%, 95.45%, and 87% on the GRIP, MICC-F220, and COVERAGE datasets, respectively. Also, the proposed method, with FPRs from 17.75% to 3.75% on the GRIP dataset, has been able to achieve the best results.","PeriodicalId":177479,"journal":{"name":"Int. J. Image Graph.","volume":"93 6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-11-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126051051","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Machine Learning Techniques for Identifying Fetal Risk During Pregnancy 孕期胎儿风险识别的机器学习技术

Int. J. Image Graph.

Pub Date : 2021-10-27 DOI: 10.1142/s0219467822500450

S. Ravikumar, E. Kannan

Cardiotocography (CTG) is a biophysical method for assessing fetal condition that primarily relies on the recording and automated analysis of fetal heart activity. The quantitative description of the CTG signals is provided by computerized fetal monitoring systems. Even though effective conclusion generation methods for decision process support are still required to find out the fetal risk such as premature embryo, this proposed method and outcome data can confirm the assessment of the fetal state after birth. Low birth weight is quite possibly the main attribute that significantly depicts an unusual fetal result. These expectations are assessed in a constant experimental decision support system, providing valuable information that can be used to obtain additional information about the fetal state using machine learning techniques. The advancements in modern obstetric practice enabled the use of numerous reliable and robust machine learning approaches in classifying fetal heart rate signals. The Naïve Bayes (NB) classifier, support vector machine (SVM), decision trees (DT), and random forest (RF) are used in the proposed method. To assess these outcomes in the proposed method, some of the metrics such as precision, accuracy, F1 score, recall, sensitivity, logarithmic loss and mean absolute error have been taken. The above mentioned metrics will be helpful to predict the fetal risk.

心脏造影(CTG)是一种评估胎儿状况的生物物理方法，主要依赖于胎儿心脏活动的记录和自动分析。计算机胎儿监测系统提供CTG信号的定量描述。尽管尚需要有效的结论生成方法来支持决策过程，以发现早产等胎儿风险，但本文提出的方法和结局数据可以证实胎儿出生后状态的评估。低出生体重很可能是显著描述不寻常胎儿结果的主要属性。这些期望在一个恒定的实验决策支持系统中进行评估，提供有价值的信息，可用于使用机器学习技术获得有关胎儿状态的额外信息。现代产科实践的进步使得使用许多可靠和强大的机器学习方法来分类胎儿心率信号成为可能。该方法使用了Naïve贝叶斯(NB)分类器、支持向量机(SVM)、决策树(DT)和随机森林(RF)。为了评估所提出方法的这些结果，采用了一些指标，如精密度、准确度、F1分数、召回率、灵敏度、对数损失和平均绝对误差。上述指标将有助于预测胎儿风险。

{"title":"Machine Learning Techniques for Identifying Fetal Risk During Pregnancy","authors":"S. Ravikumar, E. Kannan","doi":"10.1142/s0219467822500450","DOIUrl":"https://doi.org/10.1142/s0219467822500450","url":null,"abstract":"Cardiotocography (CTG) is a biophysical method for assessing fetal condition that primarily relies on the recording and automated analysis of fetal heart activity. The quantitative description of the CTG signals is provided by computerized fetal monitoring systems. Even though effective conclusion generation methods for decision process support are still required to find out the fetal risk such as premature embryo, this proposed method and outcome data can confirm the assessment of the fetal state after birth. Low birth weight is quite possibly the main attribute that significantly depicts an unusual fetal result. These expectations are assessed in a constant experimental decision support system, providing valuable information that can be used to obtain additional information about the fetal state using machine learning techniques. The advancements in modern obstetric practice enabled the use of numerous reliable and robust machine learning approaches in classifying fetal heart rate signals. The Naïve Bayes (NB) classifier, support vector machine (SVM), decision trees (DT), and random forest (RF) are used in the proposed method. To assess these outcomes in the proposed method, some of the metrics such as precision, accuracy, F1 score, recall, sensitivity, logarithmic loss and mean absolute error have been taken. The above mentioned metrics will be helpful to predict the fetal risk.","PeriodicalId":177479,"journal":{"name":"Int. J. Image Graph.","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-10-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131967935","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Firefly Algorithm Optimized Functional Link Artificial Neural Network for ISA-Radar Image Recognition 萤火虫算法优化的功能链接人工神经网络ISA-Radar图像识别

Int. J. Image Graph.

Pub Date : 2021-10-25 DOI: 10.1142/s0219467822500449

Asma Elyounsi, H. Tlijani, M. Bouhlel

Traditional neural networks are very diverse and have been used during the last decades in the fields of data classification. These networks like MLP, back propagation neural networks (BPNN) and feed forward network have shown inability to scale with problem size and with the slow convergence rate. So in order to overcome these numbers of drawbacks, the use of higher order neural networks (HONNs) becomes the solution by adding input units along with a stronger functioning of other neural units in the network and transforms easily these input units to hidden layers. In this paper, a new metaheuristic method, Firefly (FFA), is applied to calculate the optimal weights of the Functional Link Artificial Neural Network (FLANN) by using the flashing behavior of fireflies in order to classify ISA-Radar target. The average classification result of FLANN-FFA which reached 96% shows the efficiency of the process compared to other tested methods.

传统的神经网络是非常多样化的，在过去的几十年里一直被用于数据分类领域。MLP、bp神经网络(back propagation neural networks, BPNN)和前馈网络(feed - forward network)等网络在问题规模和收敛速度方面表现出无法扩展的特点。因此，为了克服这些缺点，使用高阶神经网络(honn)成为解决方案，通过添加输入单元以及网络中其他神经单元的更强功能，并轻松地将这些输入单元转换为隐藏层。为了对ISA-Radar目标进行分类，提出了一种新的元启发式方法Firefly (FFA)，利用萤火虫的闪烁行为计算功能链路人工神经网络(FLANN)的最优权值。与其他测试方法相比，FLANN-FFA的平均分类结果达到96%，表明该方法的效率较高。

引用次数: 1

Comparative Analysis of Different Data Replication Strategies in Cloud Environment 云环境下不同数据复制策略的比较分析

Int. J. Image Graph.

Pub Date : 2021-10-23 DOI: 10.1142/s0219467822500425

K. Sasikumar, B. Vijayakumar

In this paper, we performed a comparative study of the different data replication strategies such as Adaptive Data Replication Strategy (ADRS), Dynamic Cost Aware Re-Replication and Rebalancing Strategy (DCR2S) and Efficient Placement Algorithm (EPA) in the cloud environment. The implementation of these three techniques is done in JAVA and the performance analysis is conducted to study the performance of those replication techniques by various parameters. The parameters used for the performance analysis of these three techniques are Load Variance, Response Time, Probability of File Availability, System Byte Effective Rate (SBER), Latency, and Fault Ratio. From the analysis, it is evaluated that by varying the number of file replicas, it shows deviations in the outcomes of these parameters. The comparative results were also analyzed.

在本文中，我们对不同的数据复制策略进行了比较研究，如自适应数据复制策略(ADRS)、动态成本感知重复制和再平衡策略(DCR2S)和高效放置算法(EPA)在云环境中。在JAVA中实现了这三种技术，并通过各种参数对这些复制技术的性能进行了性能分析。这三种技术的性能分析参数包括负载变化、响应时间、文件可用概率、SBER (System Byte Effective Rate)、时延和故障率。从分析中可以评估，通过改变文件副本的数量，可以显示这些参数的结果的偏差。并对对比结果进行了分析。

引用次数: 0