Pub Date : 2023-09-09DOI: 10.1142/s0219467825500275
MD Azam Pasha, M. Narayana
Due to the combination of genetic diseases as well as a variety of biomedical abnormalities, the fatal disease named cancer is caused. Colon and lung cancer are regarded as the two leading diseases for disability and death. The most significant component for demonstrating the best course of action is the histopathological identification of such malignancies. So, in order to minimize the mortality rate caused by cancer, there is a need for early detection of the aliment on both fronts accordingly. In this case, both the deep and machine learning techniques have been utilized to speed up the detection process of cancer which may also help the researchers to study a huge amount of patients over a short period and less loss. Hence, it is highly essential to design a new lung and colon detection model based on deep learning approaches. Initially, a different set of histopathological images is collected from benchmark resources to perform effective analysis. Then, to attain the first set of features, the collected image is offered to the dilated net for attaining deep image features with the help of the Visual Geometry Group (VGG16) and Residual Neural Network (ResNet). Further, the second set of features is attained by the below process. Here, the collected image is given to pre-processing phase and the image is pre–pre-processed with the help of Contrast-limited Adaptive Histogram Equalization (CLAHE) and filter technique. Then, the pre-processed image is offered to the segmentation phase with the help of adaptive binary thresholding and offered to a dilated network that holds VGG16 and ResNet and attained the second set of features. The parameters of adaptive binary thresholding are tuned with the help of a developed hybrid approach called Sand Cat swarm JAya Optimization (SC-JAO) via Sand Cat swarm Optimization (SCO) and JAYA (SC-JAO). Finally, the third set of features is attained by offering the image to pre-processing phase. Then, the pre-processed image is offered to the segmentation phase and the image is a segmented phase and features are tuned by developed SC-JAO. Further, the segmented features are offered to attain the textural features like Gray-Level Co-Occurrence Matrix (GLCM) and Local Weber Pattern (LWP) and attained the third set of features. Then, the attained three different sets of features are given to the optimal weighted feature phase, where the parameters are optimized by the SC-JAO algorithm and then given to the disease prediction phase. Here, disease prediction is made with the help of Attention-based Adaptive Weighted Recurrent Neural Networks (AAW-RNN), and their parameters are tuned by developed SC-JAO. Thus, the developed model achieved an effective lung and colon detection rate over conventional approaches over multiple experimental analyses.
{"title":"Development of Trio Optimal Feature Extraction Model for Attention-Based Adaptive Weighted RNN-Based Lung and Colon Cancer Detection Framework Using Histopathological Images","authors":"MD Azam Pasha, M. Narayana","doi":"10.1142/s0219467825500275","DOIUrl":"https://doi.org/10.1142/s0219467825500275","url":null,"abstract":"Due to the combination of genetic diseases as well as a variety of biomedical abnormalities, the fatal disease named cancer is caused. Colon and lung cancer are regarded as the two leading diseases for disability and death. The most significant component for demonstrating the best course of action is the histopathological identification of such malignancies. So, in order to minimize the mortality rate caused by cancer, there is a need for early detection of the aliment on both fronts accordingly. In this case, both the deep and machine learning techniques have been utilized to speed up the detection process of cancer which may also help the researchers to study a huge amount of patients over a short period and less loss. Hence, it is highly essential to design a new lung and colon detection model based on deep learning approaches. Initially, a different set of histopathological images is collected from benchmark resources to perform effective analysis. Then, to attain the first set of features, the collected image is offered to the dilated net for attaining deep image features with the help of the Visual Geometry Group (VGG16) and Residual Neural Network (ResNet). Further, the second set of features is attained by the below process. Here, the collected image is given to pre-processing phase and the image is pre–pre-processed with the help of Contrast-limited Adaptive Histogram Equalization (CLAHE) and filter technique. Then, the pre-processed image is offered to the segmentation phase with the help of adaptive binary thresholding and offered to a dilated network that holds VGG16 and ResNet and attained the second set of features. The parameters of adaptive binary thresholding are tuned with the help of a developed hybrid approach called Sand Cat swarm JAya Optimization (SC-JAO) via Sand Cat swarm Optimization (SCO) and JAYA (SC-JAO). Finally, the third set of features is attained by offering the image to pre-processing phase. Then, the pre-processed image is offered to the segmentation phase and the image is a segmented phase and features are tuned by developed SC-JAO. Further, the segmented features are offered to attain the textural features like Gray-Level Co-Occurrence Matrix (GLCM) and Local Weber Pattern (LWP) and attained the third set of features. Then, the attained three different sets of features are given to the optimal weighted feature phase, where the parameters are optimized by the SC-JAO algorithm and then given to the disease prediction phase. Here, disease prediction is made with the help of Attention-based Adaptive Weighted Recurrent Neural Networks (AAW-RNN), and their parameters are tuned by developed SC-JAO. Thus, the developed model achieved an effective lung and colon detection rate over conventional approaches over multiple experimental analyses.","PeriodicalId":44688,"journal":{"name":"International Journal of Image and Graphics","volume":"2016 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136193149","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-09-07DOI: 10.1142/s0219467825500342
Madhavarapu Chandan, S. G. Santhi, T. Srinivasa Rao
Due to the major operating restrictions, ensuring security is the fundamental problem of Wireless Sensor Networks (WSNs). Because of their inadequate security mechanisms, WSNs are indeed a simple point for malware (worms, viruses, malicious code, etc.). According to the epidemic nature of worm propagation, it is critical to develop a worm defense mechanism in the network. This concept aims to establish novel malware detection in WSN that consists of several phases: “(i) Preprocessing, (ii) feature extraction, as well as (iii) detection”. At first, the input data is subjected for preprocessing phase. Then, the feature extraction takes place, in which principal component analysis (PCA), improved linear discriminant analysis (LDA), and autoencoder-based characteristics are retrieved. Moreover, the retrieved characteristics are subjected to the detection phase. The detection is performed employing combined shallow learning and DL. Further, the shallow learning includes decision tree (DT), logistic regression (LR), and Naive Bayes (NB); the deep learning (DL) includes deep neural network (DNN), convolutional neural network (CNN), and recurrent neural network (RNN). Here, the DT output is given to the DNN, LR output is subjected to CNN, and the NB output is given to the RNN, respectively. Eventually, the DNN, CNN, and RNN outputs are averaged to generate a successful outcome. The combination can be thought of as an Ensemble classifier. The weight of the RNN is optimally tuned through the Self Improved Shark Smell Optimization with Opposition Learning (SISSOOL) model to improve detection precision and accuracy. Lastly, the outcomes of the suggested approach are computed in terms of different measures.
{"title":"Combined Shallow and Deep Learning Models for Malware Detection in Wsn","authors":"Madhavarapu Chandan, S. G. Santhi, T. Srinivasa Rao","doi":"10.1142/s0219467825500342","DOIUrl":"https://doi.org/10.1142/s0219467825500342","url":null,"abstract":"Due to the major operating restrictions, ensuring security is the fundamental problem of Wireless Sensor Networks (WSNs). Because of their inadequate security mechanisms, WSNs are indeed a simple point for malware (worms, viruses, malicious code, etc.). According to the epidemic nature of worm propagation, it is critical to develop a worm defense mechanism in the network. This concept aims to establish novel malware detection in WSN that consists of several phases: “(i) Preprocessing, (ii) feature extraction, as well as (iii) detection”. At first, the input data is subjected for preprocessing phase. Then, the feature extraction takes place, in which principal component analysis (PCA), improved linear discriminant analysis (LDA), and autoencoder-based characteristics are retrieved. Moreover, the retrieved characteristics are subjected to the detection phase. The detection is performed employing combined shallow learning and DL. Further, the shallow learning includes decision tree (DT), logistic regression (LR), and Naive Bayes (NB); the deep learning (DL) includes deep neural network (DNN), convolutional neural network (CNN), and recurrent neural network (RNN). Here, the DT output is given to the DNN, LR output is subjected to CNN, and the NB output is given to the RNN, respectively. Eventually, the DNN, CNN, and RNN outputs are averaged to generate a successful outcome. The combination can be thought of as an Ensemble classifier. The weight of the RNN is optimally tuned through the Self Improved Shark Smell Optimization with Opposition Learning (SISSOOL) model to improve detection precision and accuracy. Lastly, the outcomes of the suggested approach are computed in terms of different measures.","PeriodicalId":44688,"journal":{"name":"International Journal of Image and Graphics","volume":" ","pages":""},"PeriodicalIF":1.6,"publicationDate":"2023-09-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43201003","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-09-05DOI: 10.1142/s021946782550024x
Sivaramakrishna Yechuri, Sunny Dayal Vanabathina
Speech enhancement methods differ depending on the degree of degradation and noise in the speech signal, so research in the field is still difficult, especially when dealing with residual and background noise, which is highly transient. Numerous deep learning networks have been developed that provide promising results for improving the perceptual quality and intelligibility of noisy speech. Innovation and research in speech enhancement have been opened up by the power of deep learning techniques with implications across a wide range of real time applications. By reviewing the important datasets, feature extraction methods, deep learning models, training algorithms and evaluation metrics for speech enhancement, this paper provides a comprehensive overview. We begin by tracing the evolution of speech enhancement research, from early approaches to recent advances in deep learning architectures. By analyzing and comparing the approaches to solving speech enhancement challenges, we categorize them according to their strengths and weaknesses. Moreover, we discuss the challenges and future directions of deep learning in speech enhancement, including the demand for parameter-efficient models for speech enhancement. The purpose of this paper is to examine the development of the field, compare and contrast different approaches, and highlight future directions as well as challenges for further research.
{"title":"Speech Enhancement: A Review of Different Deep Learning Methods","authors":"Sivaramakrishna Yechuri, Sunny Dayal Vanabathina","doi":"10.1142/s021946782550024x","DOIUrl":"https://doi.org/10.1142/s021946782550024x","url":null,"abstract":"Speech enhancement methods differ depending on the degree of degradation and noise in the speech signal, so research in the field is still difficult, especially when dealing with residual and background noise, which is highly transient. Numerous deep learning networks have been developed that provide promising results for improving the perceptual quality and intelligibility of noisy speech. Innovation and research in speech enhancement have been opened up by the power of deep learning techniques with implications across a wide range of real time applications. By reviewing the important datasets, feature extraction methods, deep learning models, training algorithms and evaluation metrics for speech enhancement, this paper provides a comprehensive overview. We begin by tracing the evolution of speech enhancement research, from early approaches to recent advances in deep learning architectures. By analyzing and comparing the approaches to solving speech enhancement challenges, we categorize them according to their strengths and weaknesses. Moreover, we discuss the challenges and future directions of deep learning in speech enhancement, including the demand for parameter-efficient models for speech enhancement. The purpose of this paper is to examine the development of the field, compare and contrast different approaches, and highlight future directions as well as challenges for further research.","PeriodicalId":44688,"journal":{"name":"International Journal of Image and Graphics","volume":" ","pages":""},"PeriodicalIF":1.6,"publicationDate":"2023-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45839286","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-09-01DOI: 10.1142/s0219467825500093
Xin Wang, Xiaogang Dong
The blurring of texture edges often occurs during image data transmission and acquisition. To ensure the detailed clarity of the drag-time images, we propose a time image de-noising method based on sparse regularization. First, the image pixel sparsity index is set, and then an image de-noising model is established based on sparse regularization processing to obtain the neighborhood weights of similar image blocks. Second, a time image de-noising algorithm is designed to determine whether the coding coefficient reaches the standard value, and a new image de-noising method is obtained. Finally, the images of electronic clocks and mechanical clocks are used as two kinds of time images to compare different image de-noising methods, respectively. The results show that the sparsity regularization method has the highest peak signal-to-noise ratio among the six compared methods for different noise standard deviations and two time images. The image structure similarity is always above which shows that the proposed method is better than the other five image de-noising methods.
{"title":"Time Image De-Noising Method Based on Sparse Regularization","authors":"Xin Wang, Xiaogang Dong","doi":"10.1142/s0219467825500093","DOIUrl":"https://doi.org/10.1142/s0219467825500093","url":null,"abstract":"The blurring of texture edges often occurs during image data transmission and acquisition. To ensure the detailed clarity of the drag-time images, we propose a time image de-noising method based on sparse regularization. First, the image pixel sparsity index is set, and then an image de-noising model is established based on sparse regularization processing to obtain the neighborhood weights of similar image blocks. Second, a time image de-noising algorithm is designed to determine whether the coding coefficient reaches the standard value, and a new image de-noising method is obtained. Finally, the images of electronic clocks and mechanical clocks are used as two kinds of time images to compare different image de-noising methods, respectively. The results show that the sparsity regularization method has the highest peak signal-to-noise ratio among the six compared methods for different noise standard deviations and two time images. The image structure similarity is always above which shows that the proposed method is better than the other five image de-noising methods.","PeriodicalId":44688,"journal":{"name":"International Journal of Image and Graphics","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135688305","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-08-31DOI: 10.1142/s0219467825500226
Rasmiranjan Mohakud, Rajashree Dash
For dermatoscopic skin lesion images, deep learning-based algorithms, particularly convolutional neural networks (CNN), have demonstrated good classification and segmentation capabilities. The impact of utilizing lesion segmentation data on classification performance, however, is still up for being subject to discussion. Being driven in this direction, in this work we propose a hybrid deep learning-based model to classify the skin cancer using segmented images. In the first stage, a fully convolutional encoder–decoder network (FCEDN) is employed to segment the skin cancer image and then in the second phase, a CNN is applied on the segmented images for classification. As the model’s success depends on the hyper-parameters it uses and fine-tuning these hyper-parameters by hand is time-consuming, so in this study the hyper-parameters of the hybrid model are optimized by utilizing an exponential neighborhood gray wolf optimization (ENGWO) technique. Extensive experiments are carried out using the International Skin Imaging Collaboration (ISIC) 2016 and ISIC 2017 datasets to show the efficacy of the model. The suggested model has been evaluated on both balanced and unbalanced datasets. With the balanced dataset, the proposed hybrid model achieves training accuracy up to 99.98%, validation accuracy up to 92.13% and testing accuracy up to 89.75%. It is evident from the findings that the proposed hybrid model outperforms previous known models in a competitive manner over balanced data.
{"title":"A Hybrid Model for Classification of Skin Cancer Images After Segmentation","authors":"Rasmiranjan Mohakud, Rajashree Dash","doi":"10.1142/s0219467825500226","DOIUrl":"https://doi.org/10.1142/s0219467825500226","url":null,"abstract":"For dermatoscopic skin lesion images, deep learning-based algorithms, particularly convolutional neural networks (CNN), have demonstrated good classification and segmentation capabilities. The impact of utilizing lesion segmentation data on classification performance, however, is still up for being subject to discussion. Being driven in this direction, in this work we propose a hybrid deep learning-based model to classify the skin cancer using segmented images. In the first stage, a fully convolutional encoder–decoder network (FCEDN) is employed to segment the skin cancer image and then in the second phase, a CNN is applied on the segmented images for classification. As the model’s success depends on the hyper-parameters it uses and fine-tuning these hyper-parameters by hand is time-consuming, so in this study the hyper-parameters of the hybrid model are optimized by utilizing an exponential neighborhood gray wolf optimization (ENGWO) technique. Extensive experiments are carried out using the International Skin Imaging Collaboration (ISIC) 2016 and ISIC 2017 datasets to show the efficacy of the model. The suggested model has been evaluated on both balanced and unbalanced datasets. With the balanced dataset, the proposed hybrid model achieves training accuracy up to 99.98%, validation accuracy up to 92.13% and testing accuracy up to 89.75%. It is evident from the findings that the proposed hybrid model outperforms previous known models in a competitive manner over balanced data.","PeriodicalId":44688,"journal":{"name":"International Journal of Image and Graphics","volume":" ","pages":""},"PeriodicalIF":1.6,"publicationDate":"2023-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44995806","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-08-31DOI: 10.1142/s0219467825500238
Sumit Chhabra, Khushboo Bansal
Human brain tumors are now the most serious and horrible diseases for people, causing certain deaths. The patient’s life also becomes more complicated over time as a result of the brain tumor. Thus, it is essential to find tumors early to safeguard and extend the patient’s life. Hence, new improvements are highly essential in the techniques of brain tumor detection in medical areas. To address this, research has introduced automatic brain tumor prediction using Pteropus unicinctus optimization on deep neural networks (PUO-deep NNs). Initially, the data are gathered from the BraTS MICCAI brain tumor dataset and preprocessing and ROI extraction are performed to remove the noise from the data. Then the extracted RoI is forwarded to the fuzzy c-means (FCM) clustering to segment the brain image. The parameters of the FCM tune the PUO algorithm so the image is segmented into the tumor region and the non-tumor region. Then the feature extraction takes place on ResNet. Finally, the deep NN classifier successfully predicted the brain tumor by utilizing the PUO method, which improved the classifier performance and produced extremely accurate results. For dataset 1, the PUO-deep NN achieved values of 87.69% accuracy, 93.81% sensitivity, and 99.01% specificity. The suggested PUO-deep NN also attained the values for dataset 2 of 98.49%, 98.55%, and 95.60%, which is significantly more effective than the current approaches.
{"title":"An Efficient Brain Tumor Prediction Using Pteropus Unicinctus Optimization on Deep Neural Network","authors":"Sumit Chhabra, Khushboo Bansal","doi":"10.1142/s0219467825500238","DOIUrl":"https://doi.org/10.1142/s0219467825500238","url":null,"abstract":"Human brain tumors are now the most serious and horrible diseases for people, causing certain deaths. The patient’s life also becomes more complicated over time as a result of the brain tumor. Thus, it is essential to find tumors early to safeguard and extend the patient’s life. Hence, new improvements are highly essential in the techniques of brain tumor detection in medical areas. To address this, research has introduced automatic brain tumor prediction using Pteropus unicinctus optimization on deep neural networks (PUO-deep NNs). Initially, the data are gathered from the BraTS MICCAI brain tumor dataset and preprocessing and ROI extraction are performed to remove the noise from the data. Then the extracted RoI is forwarded to the fuzzy c-means (FCM) clustering to segment the brain image. The parameters of the FCM tune the PUO algorithm so the image is segmented into the tumor region and the non-tumor region. Then the feature extraction takes place on ResNet. Finally, the deep NN classifier successfully predicted the brain tumor by utilizing the PUO method, which improved the classifier performance and produced extremely accurate results. For dataset 1, the PUO-deep NN achieved values of 87.69% accuracy, 93.81% sensitivity, and 99.01% specificity. The suggested PUO-deep NN also attained the values for dataset 2 of 98.49%, 98.55%, and 95.60%, which is significantly more effective than the current approaches.","PeriodicalId":44688,"journal":{"name":"International Journal of Image and Graphics","volume":" ","pages":""},"PeriodicalIF":1.6,"publicationDate":"2023-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48086675","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-08-30DOI: 10.1142/s0219467825500299
Xueying Duan
Recognizing abnormal behavior recognition (ABR) is an important part of social security work. To ensure social harmony and stability, it is of great significance to study the identification methods of abnormal human motion behavior. Aiming at the low accuracy of human motion ABR method, ABR method for human motion based on improved deep reinforcement learning (DRL) is proposed. First, the background image is processed in combination with the Gaussian model; second, the background features and human motion trajectory features are extracted, respectively; finally, the improved DRL model is constructed, and the feature information is input into the improvement model to further extract the abnormal behavior features, and the ABR of human motion is realized through the interaction between the agent and the environment. The different methods were examined based on UCF101 data set and HiEve data set. The results show that the accuracy of human motion key point acquisition and posture estimation accuracy is high, the proposed method sensitivity is good, and the recognition accuracy of human motion abnormal behavior is as high as 95.5%. It can realize the ABR for human motion and lay a foundation for the further development of follow-up social security management.
{"title":"Abnormal Behavior Recognition for Human Motion Based on Improved Deep Reinforcement Learning","authors":"Xueying Duan","doi":"10.1142/s0219467825500299","DOIUrl":"https://doi.org/10.1142/s0219467825500299","url":null,"abstract":"Recognizing abnormal behavior recognition (ABR) is an important part of social security work. To ensure social harmony and stability, it is of great significance to study the identification methods of abnormal human motion behavior. Aiming at the low accuracy of human motion ABR method, ABR method for human motion based on improved deep reinforcement learning (DRL) is proposed. First, the background image is processed in combination with the Gaussian model; second, the background features and human motion trajectory features are extracted, respectively; finally, the improved DRL model is constructed, and the feature information is input into the improvement model to further extract the abnormal behavior features, and the ABR of human motion is realized through the interaction between the agent and the environment. The different methods were examined based on UCF101 data set and HiEve data set. The results show that the accuracy of human motion key point acquisition and posture estimation accuracy is high, the proposed method sensitivity is good, and the recognition accuracy of human motion abnormal behavior is as high as 95.5%. It can realize the ABR for human motion and lay a foundation for the further development of follow-up social security management.","PeriodicalId":44688,"journal":{"name":"International Journal of Image and Graphics","volume":" ","pages":""},"PeriodicalIF":1.6,"publicationDate":"2023-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41533888","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-08-29DOI: 10.1142/s0219467825500263
Manochandar Thenralmanoharan, P. Kumaraguru Diderot
Accurate and rapid detection of Alzheimer’s disease (AD) using magnetic resonance imaging (MRI) gained considerable attention among research workers because of an increased number of current researches being driven by deep learning (DL) methods that have accomplished outstanding outcomes in variety of domains involving medical image analysis. Especially, convolution neural network (CNN) is primarily applied for the analyses of image datasets according to the capability of handling massive unstructured datasets and automatically extracting significant features. Earlier detection is dominant to the success and development interferences, and neuroimaging characterizes the potential regions for earlier diagnosis of AD. The study presents and develops a novel Deep Learning-based Magnetic Resonance Image Segmentation and Classification for AD Diagnosis (DLMRISC-ADD) model. The presented DLMRISC-ADD model mainly focuses on the segmentation of MRI images to detect AD. To accomplish this, the presented DLMRISC-ADD model follows a two-stage process, namely, skull stripping and image segmentation. At the preliminary stage, the presented DLMRISC-ADD model employs U-Net-based skull stripping approach to remove skull regions from the input MRIs. Next, in the second stage, the DLMRISC-ADD model applies QuickNAT model for MRI image segmentation, which identifies distinct parts such as white matter, gray matter, hippocampus, amygdala, and ventricles. Moreover, densely connected network (DenseNet201) feature extractor with sparse autoencoder (SAE) classifier is used for AD detection process. A brief set of simulations is implemented on ADNI dataset to demonstrate the improved performance of the DLMRISC-ADD method, and the outcomes are examined extensively. The experimental results exhibit the effectual segmentation results of the DLMRISC-ADD technique.
{"title":"Deep Learning-Based Magnetic Resonance Image Segmentation and Classification for Alzheimer’s Disease Diagnosis","authors":"Manochandar Thenralmanoharan, P. Kumaraguru Diderot","doi":"10.1142/s0219467825500263","DOIUrl":"https://doi.org/10.1142/s0219467825500263","url":null,"abstract":"Accurate and rapid detection of Alzheimer’s disease (AD) using magnetic resonance imaging (MRI) gained considerable attention among research workers because of an increased number of current researches being driven by deep learning (DL) methods that have accomplished outstanding outcomes in variety of domains involving medical image analysis. Especially, convolution neural network (CNN) is primarily applied for the analyses of image datasets according to the capability of handling massive unstructured datasets and automatically extracting significant features. Earlier detection is dominant to the success and development interferences, and neuroimaging characterizes the potential regions for earlier diagnosis of AD. The study presents and develops a novel Deep Learning-based Magnetic Resonance Image Segmentation and Classification for AD Diagnosis (DLMRISC-ADD) model. The presented DLMRISC-ADD model mainly focuses on the segmentation of MRI images to detect AD. To accomplish this, the presented DLMRISC-ADD model follows a two-stage process, namely, skull stripping and image segmentation. At the preliminary stage, the presented DLMRISC-ADD model employs U-Net-based skull stripping approach to remove skull regions from the input MRIs. Next, in the second stage, the DLMRISC-ADD model applies QuickNAT model for MRI image segmentation, which identifies distinct parts such as white matter, gray matter, hippocampus, amygdala, and ventricles. Moreover, densely connected network (DenseNet201) feature extractor with sparse autoencoder (SAE) classifier is used for AD detection process. A brief set of simulations is implemented on ADNI dataset to demonstrate the improved performance of the DLMRISC-ADD method, and the outcomes are examined extensively. The experimental results exhibit the effectual segmentation results of the DLMRISC-ADD technique.","PeriodicalId":44688,"journal":{"name":"International Journal of Image and Graphics","volume":" ","pages":""},"PeriodicalIF":1.6,"publicationDate":"2023-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45697084","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-08-28DOI: 10.1142/s0219467825500251
Shabana Rai, Arif Ullah, Wong Lai Kuan, Rifat Mustafa
When it comes to filtering and compressing data before sending it to a cloud server, fog computing is a rummage sale. Fog computing enables an alternate method to reduce the complexity of medical image processing and steadily improve its dependability. Medical images are produced by imaging processing modalities using X-rays, computed tomography (CT) scans, magnetic resonance imaging (MRI) scans, and ultrasound (US). These medical images are large and have a huge amount of storage. This problem is being solved by making use of compression. In this area, lots of work is done. However, before adding more techniques to Fog, getting a high compression ratio (CR) in a shorter time is required, therefore consuming less network traffic. Le Gall5/3 integer wavelet transform (IWT) and a set partitioning in hierarchical trees (SPIHT) encoder were used in this study’s implementation of an image compression technique. MRI is used in the experiments. The suggested technique uses a modified CR and less compression time (CT) to compress the medical image. The proposed approach results in an average CR of 84.8895%. A 40.92% peak signal-to-noise ratio (PSNR) PNSR value is present. Using the Huffman coding, the proposed approach reduces the CT by 36.7434 s compared to the IWT. Regarding CR, the suggested technique outperforms IWTs with Huffman coding by 12%. The current approach has a 72.36% CR. The suggested work’s shortcoming is that the high CR caused a decline in the quality of the medical images. PSNR values can be raised, and more effort can be made to compress colored medical images and 3-dimensional medical images.
{"title":"An Enhanced Compression Method for Medical Images Using SPIHT Encoder for Fog Computing","authors":"Shabana Rai, Arif Ullah, Wong Lai Kuan, Rifat Mustafa","doi":"10.1142/s0219467825500251","DOIUrl":"https://doi.org/10.1142/s0219467825500251","url":null,"abstract":"When it comes to filtering and compressing data before sending it to a cloud server, fog computing is a rummage sale. Fog computing enables an alternate method to reduce the complexity of medical image processing and steadily improve its dependability. Medical images are produced by imaging processing modalities using X-rays, computed tomography (CT) scans, magnetic resonance imaging (MRI) scans, and ultrasound (US). These medical images are large and have a huge amount of storage. This problem is being solved by making use of compression. In this area, lots of work is done. However, before adding more techniques to Fog, getting a high compression ratio (CR) in a shorter time is required, therefore consuming less network traffic. Le Gall5/3 integer wavelet transform (IWT) and a set partitioning in hierarchical trees (SPIHT) encoder were used in this study’s implementation of an image compression technique. MRI is used in the experiments. The suggested technique uses a modified CR and less compression time (CT) to compress the medical image. The proposed approach results in an average CR of 84.8895%. A 40.92% peak signal-to-noise ratio (PSNR) PNSR value is present. Using the Huffman coding, the proposed approach reduces the CT by 36.7434 s compared to the IWT. Regarding CR, the suggested technique outperforms IWTs with Huffman coding by 12%. The current approach has a 72.36% CR. The suggested work’s shortcoming is that the high CR caused a decline in the quality of the medical images. PSNR values can be raised, and more effort can be made to compress colored medical images and 3-dimensional medical images.","PeriodicalId":44688,"journal":{"name":"International Journal of Image and Graphics","volume":" ","pages":""},"PeriodicalIF":1.6,"publicationDate":"2023-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48965801","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-08-22DOI: 10.1142/s0219467825500330
Neha Hajare, A. Rajawat
Black gram crop belongs to the Fabaceae family and its scientific name is Vigna Mungo.It has high nutritional content, improves the fertility of the soil, and provides atmospheric nitrogen fixation in the soil. The quality of the black gram crop is degraded by diseases such as Yellow mosaic, Anthracnose, Powdery Mildew, and Leaf Crinkle which causes economic loss to farmers and degraded production. The agriculture sector needs to classify plant nutrient deficiencies in order to increase crop quality and yield. In order to handle a variety of difficult challenges, computer vision and deep learning technologies play a crucial role in the agricultural and biological sectors. The typical diagnostic procedure involves a pathologist visiting the site and inspecting each plant. However, manually crop disease assessment is limited due to lesser accuracy and limited access of personnel. To address these problems, it is necessary to develop automated methods that can quickly identify and classify a wide range of plant diseases. In this paper, black gram disease classifications are done through a deep ensemble model with optimal training and the procedure of this technique is as follows: Initially, the input dataset is processed to increase its size via data augmentation. Here, the processes like shifting, rotation, and shearing take place. Then, the model starts with the noise removal of images using median filtering. Subsequent to the preprocessing, segmentation takes place via the proposed deep joint segmentation model to determine the ROI and non-ROI regions. The next process is the extraction of the feature set that includes the features like improved multi-texton-based features, shape-based features, color-based features, and local Gabor X-OR pattern features. The model combines the classifiers like Deep Belief Networks, Recurrent Neural Networks, and Convolutional Neural Networks. For tuning the optimal weights of the model, a new algorithm termed swarm intelligence-based Self-Improved Dwarf Mongoose Optimization algorithm (SIDMO) is introduced. Over the past two decades, nature-based metaheuristic algorithms have gained more popularity because of their ability to solve various global optimization problems with optimal solutions. This training model ensures the enhancement of classification accuracy. The accuracy of the SIDMO, which is around 94.82%, is substantially higher than that of the existing models, which are FPA[Formula: see text]88.86%, SSOA[Formula: see text]88.99%, GOA[Formula: see text]85.84%, SMA[Formula: see text]85.11%, SRSR[Formula: see text]85.32%, and DMOA[Formula: see text]88.99%, respectively.
{"title":"Black Gram Disease Classification via Deep Ensemble Model with Optimal Training","authors":"Neha Hajare, A. Rajawat","doi":"10.1142/s0219467825500330","DOIUrl":"https://doi.org/10.1142/s0219467825500330","url":null,"abstract":"Black gram crop belongs to the Fabaceae family and its scientific name is Vigna Mungo.It has high nutritional content, improves the fertility of the soil, and provides atmospheric nitrogen fixation in the soil. The quality of the black gram crop is degraded by diseases such as Yellow mosaic, Anthracnose, Powdery Mildew, and Leaf Crinkle which causes economic loss to farmers and degraded production. The agriculture sector needs to classify plant nutrient deficiencies in order to increase crop quality and yield. In order to handle a variety of difficult challenges, computer vision and deep learning technologies play a crucial role in the agricultural and biological sectors. The typical diagnostic procedure involves a pathologist visiting the site and inspecting each plant. However, manually crop disease assessment is limited due to lesser accuracy and limited access of personnel. To address these problems, it is necessary to develop automated methods that can quickly identify and classify a wide range of plant diseases. In this paper, black gram disease classifications are done through a deep ensemble model with optimal training and the procedure of this technique is as follows: Initially, the input dataset is processed to increase its size via data augmentation. Here, the processes like shifting, rotation, and shearing take place. Then, the model starts with the noise removal of images using median filtering. Subsequent to the preprocessing, segmentation takes place via the proposed deep joint segmentation model to determine the ROI and non-ROI regions. The next process is the extraction of the feature set that includes the features like improved multi-texton-based features, shape-based features, color-based features, and local Gabor X-OR pattern features. The model combines the classifiers like Deep Belief Networks, Recurrent Neural Networks, and Convolutional Neural Networks. For tuning the optimal weights of the model, a new algorithm termed swarm intelligence-based Self-Improved Dwarf Mongoose Optimization algorithm (SIDMO) is introduced. Over the past two decades, nature-based metaheuristic algorithms have gained more popularity because of their ability to solve various global optimization problems with optimal solutions. This training model ensures the enhancement of classification accuracy. The accuracy of the SIDMO, which is around 94.82%, is substantially higher than that of the existing models, which are FPA[Formula: see text]88.86%, SSOA[Formula: see text]88.99%, GOA[Formula: see text]85.84%, SMA[Formula: see text]85.11%, SRSR[Formula: see text]85.32%, and DMOA[Formula: see text]88.99%, respectively.","PeriodicalId":44688,"journal":{"name":"International Journal of Image and Graphics","volume":" ","pages":""},"PeriodicalIF":1.6,"publicationDate":"2023-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43941856","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}