This paper intends to present an automated mango grading system under four stages (1) pre-processing, (2) feature extraction, (3) optimal feature selection and (4) classification. Initially, the input image is subjected to the pre-processing phase, where the reading, sizing, noise removal and segmentation process happens. Subsequently, the features are extracted from the pre-processed image. To make the system more effective, from the extracted features, the optimal features are selected using a new hybrid optimization algorithm termed the lion assisted firefly algorithm (LA-FF), which is the combination of LA and FF, respectively. Then, the optimal features are given for the classification process, where the optimized deep convolutional neural network (CNN) is deployed. As a major contribution, the configuration of CNN is fine-tuned via selecting the optimal count of convolutional layers. This obviously enhances the classification accuracy in grading system. For fine-tuning the convolutional layers in the deep CNN, the LA-FF algorithm is used so that the classifier is optimized. The grading is evaluated on the basis of healthydiseased, ripe-unripe and bigmediumvery big cases with respect to type I and type II measures and the performance of the proposed grading model is compared over the other state-of-the-art models.
{"title":"Optimized deep learning model for mango grading: Hybridizing lion plus firefly algorithm","authors":"M. Tripathi, Dhananjay D. Maktedar","doi":"10.1049/IPR2.12163","DOIUrl":"https://doi.org/10.1049/IPR2.12163","url":null,"abstract":"This paper intends to present an automated mango grading system under four stages (1) pre-processing, (2) feature extraction, (3) optimal feature selection and (4) classification. Initially, the input image is subjected to the pre-processing phase, where the reading, sizing, noise removal and segmentation process happens. Subsequently, the features are extracted from the pre-processed image. To make the system more effective, from the extracted features, the optimal features are selected using a new hybrid optimization algorithm termed the lion assisted firefly algorithm (LA-FF), which is the combination of LA and FF, respectively. Then, the optimal features are given for the classification process, where the optimized deep convolutional neural network (CNN) is deployed. As a major contribution, the configuration of CNN is fine-tuned via selecting the optimal count of convolutional layers. This obviously enhances the classification accuracy in grading system. For fine-tuning the convolutional layers in the deep CNN, the LA-FF algorithm is used so that the classifier is optimized. The grading is evaluated on the basis of healthydiseased, ripe-unripe and bigmediumvery big cases with respect to type I and type II measures and the performance of the proposed grading model is compared over the other state-of-the-art models.","PeriodicalId":13486,"journal":{"name":"IET Image Process.","volume":"16 1","pages":"1940-1956"},"PeriodicalIF":0.0,"publicationDate":"2021-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79318502","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Low-light image enhancement is rapidly gaining research attention due to the increasing demands of extreme visual tasks in various applications. Although numerous methods exist to enhance image qualities in low light, it is still undetermined how to trade-off between the human observation and computer vision processing. In this work, an effective generative adversarial network structure is proposed comprising both the densely residual block (DRB) and the enhancing block (EB) for low-light image enhancement. Specifically, the proposed end-to-end image enhancement method, consisting of a generator and a discriminator, is trained using the hyper loss function. The DRB adopts the residual and dense skip connections to connect and enhance the features extracted from different depths in the network while the EB receives unique multi-scale features to ensure feature diversity. Additionally, increasing the feature sizes allows the discriminator to further distinguish between fake and real images from the patch levels. The merits of the loss function are also studied to recover both contextual and local details. Extensive experimental results show that our method is capable of dealing with extremely low-light scenes and the realistic feature generator outperforms several state-of-the-art methods in a number of qualitative and quantitative evaluation tests.
{"title":"Generative adversarial network for low-light image enhancement","authors":"Fei Li, Jiangbin Zheng, Yuan-fang Zhang","doi":"10.1049/IPR2.12124","DOIUrl":"https://doi.org/10.1049/IPR2.12124","url":null,"abstract":"Low-light image enhancement is rapidly gaining research attention due to the increasing demands of extreme visual tasks in various applications. Although numerous methods exist to enhance image qualities in low light, it is still undetermined how to trade-off between the human observation and computer vision processing. In this work, an effective generative adversarial network structure is proposed comprising both the densely residual block (DRB) and the enhancing block (EB) for low-light image enhancement. Specifically, the proposed end-to-end image enhancement method, consisting of a generator and a discriminator, is trained using the hyper loss function. The DRB adopts the residual and dense skip connections to connect and enhance the features extracted from different depths in the network while the EB receives unique multi-scale features to ensure feature diversity. Additionally, increasing the feature sizes allows the discriminator to further distinguish between fake and real images from the patch levels. The merits of the loss function are also studied to recover both contextual and local details. Extensive experimental results show that our method is capable of dealing with extremely low-light scenes and the realistic feature generator outperforms several state-of-the-art methods in a number of qualitative and quantitative evaluation tests.","PeriodicalId":13486,"journal":{"name":"IET Image Process.","volume":"41 1","pages":"1542-1552"},"PeriodicalIF":0.0,"publicationDate":"2021-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76151305","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
The enhancement of light-defect images such as extremely low-light, low-light and dim-light has always been a research hotspot. Most of the existing methods are excellent in specific illuminations, and there is much room for improvement in processing light-defect images with different illuminations. Therefore, this study proposes an efficient framework based on deep learning to enhance various light-defect images. The proposed framework estimates the reflectance component and illumination component. Next, we propose a generator guided by an attention mechanism in the reflectance part to repair the light-defect in the dark. In addition, we design a colour loss function for the problem of colour distortion in the enhanced images. Finally, the illumination map of the light-defect images is adjusted adaptively. Extensive experiments are conducted to demonstrate that our method can not only deal with the images with different illuminations but also enhance the images with clearer details and richer colours. At the same time, we prove its superiority by compar-ing it with state-of-the-art methods under both visual quality comparison and quantitative comparison of various datasets and real-world images.
{"title":"An efficient framework for deep learning-based light-defect image enhancement","authors":"Chengxu Ma, Daihui Li, Shangyou Zeng, Junbo Zhao, Hongyang Chen","doi":"10.1049/IPR2.12125","DOIUrl":"https://doi.org/10.1049/IPR2.12125","url":null,"abstract":"The enhancement of light-defect images such as extremely low-light, low-light and dim-light has always been a research hotspot. Most of the existing methods are excellent in specific illuminations, and there is much room for improvement in processing light-defect images with different illuminations. Therefore, this study proposes an efficient framework based on deep learning to enhance various light-defect images. The proposed framework estimates the reflectance component and illumination component. Next, we propose a generator guided by an attention mechanism in the reflectance part to repair the light-defect in the dark. In addition, we design a colour loss function for the problem of colour distortion in the enhanced images. Finally, the illumination map of the light-defect images is adjusted adaptively. Extensive experiments are conducted to demonstrate that our method can not only deal with the images with different illuminations but also enhance the images with clearer details and richer colours. At the same time, we prove its superiority by compar-ing it with state-of-the-art methods under both visual quality comparison and quantitative comparison of various datasets and real-world images.","PeriodicalId":13486,"journal":{"name":"IET Image Process.","volume":"31 1","pages":"1553-1566"},"PeriodicalIF":0.0,"publicationDate":"2021-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85883502","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
It can be a very challenging task when using level set method segmenting natural images with high intensity inhomogeneity and complex background scenes. A new synthesis level set method for robust image segmentation based on the combination of Retinex-corrected saliency region information and edge information is proposed in this work. First, the Retinex theory is introduced to correct the saliency information extraction. Second, the Retinex-corrected saliency information is embedded into the level set method due to its advantageous quality which makes a foreground object stand out relative to the backgrounds. Combined with the edge information, the boundary of segmentation will be more precise and smooth. Experiments indicate that the proposed segmentation algorithm is efficient, fast, reliable, and robust.
{"title":"Level set method with Retinex-corrected saliency embedded for image segmentation","authors":"Dongmei Liu, F. Chang, Huaxiang Zhang, Li Liu","doi":"10.1049/IPR2.12123","DOIUrl":"https://doi.org/10.1049/IPR2.12123","url":null,"abstract":"It can be a very challenging task when using level set method segmenting natural images with high intensity inhomogeneity and complex background scenes. A new synthesis level set method for robust image segmentation based on the combination of Retinex-corrected saliency region information and edge information is proposed in this work. First, the Retinex theory is introduced to correct the saliency information extraction. Second, the Retinex-corrected saliency information is embedded into the level set method due to its advantageous quality which makes a foreground object stand out relative to the backgrounds. Combined with the edge information, the boundary of segmentation will be more precise and smooth. Experiments indicate that the proposed segmentation algorithm is efficient, fast, reliable, and robust.","PeriodicalId":13486,"journal":{"name":"IET Image Process.","volume":"80 1","pages":"1530-1541"},"PeriodicalIF":0.0,"publicationDate":"2021-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76828598","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
M. Tavakoli, A. Mehdizadeh, Reza Pourreza-Shahri, J. Dehmeshki
Retinal blood vessel segmentation and analysis is critical for the computer-aided diagnosis of different diseases such as diabetic retinopathy. This study presents an automated unsupervised method for segmenting the retinal vasculature based on hybrid methods. The algorithm initially applies a preprocessing step using morphological operators to enhance the vessel tree structure against a non-uniform image background. The main processing applies the Radon transform to overlapping windows, followed by vessel validation, vessel refinement and vessel reconstruction to achieve the final segmentation. The method was tested on three publicly available datasets and a local database comprising a total of 188 images. Segmentation performance was evaluated using three measures: accuracy, receiver operating characteristic (ROC) analysis, and the structural similarity index. ROC analysis resulted in area under curve values of 97.39%, 97.01%, and 97.12%, for the DRIVE, STARE, and CHASE-DB1, respectively. Also, the results of accuracy were 0.9688, 0.9646, and 0.9475 for the same datasets. Finally, the average values of structural similarity index were computed for all four datasets, with average values of 0.9650 (DRIVE), 0.9641 (STARE), and 0.9625 (CHASE-DB1). These results compare with the best published results to date, exceeding their performance for several of the datasets; similar performance is found using accuracy.
{"title":"Unsupervised automated retinal vessel segmentation based on Radon line detector and morphological reconstruction","authors":"M. Tavakoli, A. Mehdizadeh, Reza Pourreza-Shahri, J. Dehmeshki","doi":"10.1049/IPR2.12119","DOIUrl":"https://doi.org/10.1049/IPR2.12119","url":null,"abstract":"Retinal blood vessel segmentation and analysis is critical for the computer-aided diagnosis of different diseases such as diabetic retinopathy. This study presents an automated unsupervised method for segmenting the retinal vasculature based on hybrid methods. The algorithm initially applies a preprocessing step using morphological operators to enhance the vessel tree structure against a non-uniform image background. The main processing applies the Radon transform to overlapping windows, followed by vessel validation, vessel refinement and vessel reconstruction to achieve the final segmentation. The method was tested on three publicly available datasets and a local database comprising a total of 188 images. Segmentation performance was evaluated using three measures: accuracy, receiver operating characteristic (ROC) analysis, and the structural similarity index. ROC analysis resulted in area under curve values of 97.39%, 97.01%, and 97.12%, for the DRIVE, STARE, and CHASE-DB1, respectively. Also, the results of accuracy were 0.9688, 0.9646, and 0.9475 for the same datasets. Finally, the average values of structural similarity index were computed for all four datasets, with average values of 0.9650 (DRIVE), 0.9641 (STARE), and 0.9625 (CHASE-DB1). These results compare with the best published results to date, exceeding their performance for several of the datasets; similar performance is found using accuracy.","PeriodicalId":13486,"journal":{"name":"IET Image Process.","volume":"31 1","pages":"1484-1498"},"PeriodicalIF":0.0,"publicationDate":"2021-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"91246397","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Nikunja Bihari Kar, D. Nayak, Korra Sathya Babu, Yudong Zhang
Facial expression recognition has been a long-standing problem in the field of computer vision. This paper proposes a new simple scheme for effective recognition of facial expressions based on a hybrid feature descriptor and an improved classifier. Inspired by the success of stationary wavelet transform in many computer vision tasks, stationary wavelet transform is first employed on the pre-processed face image. The pyramid of histograms of orientation gradient features is then computed from the low-frequency stationary wavelet transform coefficients to capture more prominent details from facial images. The key idea of this hybrid feature descriptor is to exploit both spatial and frequency domain features which at the same time are robust against illumination and noise. The relevant features are subsequently determined using linear discriminant analysis. A new least squares support vector machine parameter tuning strategy is proposed using a contemporary optimisation technique called Jaya optimisation for classification of facial expressions. Experimental evaluations are performed on Japanese female facial expression and the Extended Cohn–Kanade (CK + ) datasets, and the results based on 5-fold stratified cross-validation test confirm the superiority of the proposed method over state-of-the-art approaches.
{"title":"A hybrid feature descriptor with Jaya optimised least squares SVM for facial expression recognition","authors":"Nikunja Bihari Kar, D. Nayak, Korra Sathya Babu, Yudong Zhang","doi":"10.1049/IPR2.12118","DOIUrl":"https://doi.org/10.1049/IPR2.12118","url":null,"abstract":"Facial expression recognition has been a long-standing problem in the field of computer vision. This paper proposes a new simple scheme for effective recognition of facial expressions based on a hybrid feature descriptor and an improved classifier. Inspired by the success of stationary wavelet transform in many computer vision tasks, stationary wavelet transform is first employed on the pre-processed face image. The pyramid of histograms of orientation gradient features is then computed from the low-frequency stationary wavelet transform coefficients to capture more prominent details from facial images. The key idea of this hybrid feature descriptor is to exploit both spatial and frequency domain features which at the same time are robust against illumination and noise. The relevant features are subsequently determined using linear discriminant analysis. A new least squares support vector machine parameter tuning strategy is proposed using a contemporary optimisation technique called Jaya optimisation for classification of facial expressions. Experimental evaluations are performed on Japanese female facial expression and the Extended Cohn–Kanade (CK + ) datasets, and the results based on 5-fold stratified cross-validation test confirm the superiority of the proposed method over state-of-the-art approaches.","PeriodicalId":13486,"journal":{"name":"IET Image Process.","volume":"13 1","pages":"1471-1483"},"PeriodicalIF":0.0,"publicationDate":"2021-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82036079","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Geetha Pavani Pappu, B. Biswal, M. Sairam, P. Biswal
In this article, an exclusive-disjunction-based detection of neovascularisation (NV), which is the formation of new blood vessels on the retinal surfaces, is presented. These vessels, being thin and fragile, get ruptured easily leading to permanent blindness. The proposed algorithm consists of two stages. In the first stage, the retinal images are classified into non-NV and NV using multi-scale convolutional neural network, while in the second stage, 13 relevant features are extracted from the vascular map of NV images to achieve the pixel locations of new blood vessels using a directional matched filter along with the Difference of Laplacian of Gaussian operator followed by an exclusive disjunction function with adaptive thresholding of the vascular map. At the same time, the pixel locations of optic disc (OD) are detected using intensity distribution and variations on the retinal images. Finally, the pixel locations of both new blood vessels and OD are compared for classification. If the pixel locations of new blood vessels fall inside the OD, they are labelled as NV on OD, else they are labelled as NV elsewhere. The proposed algorithm has achieved an accuracy of 99.5%, specificity of 97.5%, sensitivity of 98.9%, and area under the curve of 94.2% when tested on 155 non-NV and 115 NV images.
{"title":"An exclusive-disjunction-based detection of neovascularisation using multi-scale CNN","authors":"Geetha Pavani Pappu, B. Biswal, M. Sairam, P. Biswal","doi":"10.1049/ipr2.12122","DOIUrl":"https://doi.org/10.1049/ipr2.12122","url":null,"abstract":"In this article, an exclusive-disjunction-based detection of neovascularisation (NV), which is the formation of new blood vessels on the retinal surfaces, is presented. These vessels, being thin and fragile, get ruptured easily leading to permanent blindness. The proposed algorithm consists of two stages. In the first stage, the retinal images are classified into non-NV and NV using multi-scale convolutional neural network, while in the second stage, 13 relevant features are extracted from the vascular map of NV images to achieve the pixel locations of new blood vessels using a directional matched filter along with the Difference of Laplacian of Gaussian operator followed by an exclusive disjunction function with adaptive thresholding of the vascular map. At the same time, the pixel locations of optic disc (OD) are detected using intensity distribution and variations on the retinal images. Finally, the pixel locations of both new blood vessels and OD are compared for classification. If the pixel locations of new blood vessels fall inside the OD, they are labelled as NV on OD, else they are labelled as NV elsewhere. The proposed algorithm has achieved an accuracy of 99.5%, specificity of 97.5%, sensitivity of 98.9%, and area under the curve of 94.2% when tested on 155 non-NV and 115 NV images.","PeriodicalId":13486,"journal":{"name":"IET Image Process.","volume":"11 1","pages":"1518-1529"},"PeriodicalIF":0.0,"publicationDate":"2021-01-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88475184","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Recently, deep convolutional neural networks have been successfully used for image denoising due to their favourable performance. This paper examines the error feedback mechanism to image denoising and propose an error feedback denoising network. Specif-ically, we use the down-and-up projection sequence to estimate the noise feature. By the residual connection, the clean structures are removed from the noise features. The essential difference between the proposed network and other existing feedback networks is the projection sequence. Our error feedback projection sequence is down-and-up, which is more suitable for image denoising than the existing up-and-down order. Moreover, we design a compression block to improve the expression ability of the general 1 × 1 convolutional compression layer. The advantage of our well-designed down-and-up block is that the network parameters are fewer than other feedback networks and the receptive field is enlarged. We have implemented our error feedback denoising network on denoising and JPEG image deblocking. Extensive experiments verify the effectiveness of our down-and-up block and demonstrate that our error feedback denoising network is comparable with the state-of-the-art. The code will be open source. The source codes for reproducing the results can be found at: https://github.com/Houruizhi/EFDN.
{"title":"Error feedback denoising network","authors":"R. Hou, Fang Li","doi":"10.1049/ipr2.12121","DOIUrl":"https://doi.org/10.1049/ipr2.12121","url":null,"abstract":"Recently, deep convolutional neural networks have been successfully used for image denoising due to their favourable performance. This paper examines the error feedback mechanism to image denoising and propose an error feedback denoising network. Specif-ically, we use the down-and-up projection sequence to estimate the noise feature. By the residual connection, the clean structures are removed from the noise features. The essential difference between the proposed network and other existing feedback networks is the projection sequence. Our error feedback projection sequence is down-and-up, which is more suitable for image denoising than the existing up-and-down order. Moreover, we design a compression block to improve the expression ability of the general 1 × 1 convolutional compression layer. The advantage of our well-designed down-and-up block is that the network parameters are fewer than other feedback networks and the receptive field is enlarged. We have implemented our error feedback denoising network on denoising and JPEG image deblocking. Extensive experiments verify the effectiveness of our down-and-up block and demonstrate that our error feedback denoising network is comparable with the state-of-the-art. The code will be open source. The source codes for reproducing the results can be found at: https://github.com/Houruizhi/EFDN.","PeriodicalId":13486,"journal":{"name":"IET Image Process.","volume":"1 1","pages":"1508-1517"},"PeriodicalIF":0.0,"publicationDate":"2021-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87792341","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}