Pub Date : 2023-08-03DOI: 10.1142/s0219467825500123
Padmanayana Bhat, B. Anoop
The eye-related problem of diabetes is called diabetic retinopathy (DR), which is the main factor contributing to visual loss. This research develops an enhanced deep model for DR classification. Here, deep convolutional neural network (Deep CNN) is trained with the improved invasive weed social ski-driver optimization (IISSDO), which is generated by fusing improved invasive weed optimization (IIWO) and social ski-driver (SSD). The IISSDO-based Deep CNN classifies DR severity into normal, mild, non-proliferative DR (NPDR), moderate NPDR, severe NPDR, and proliferative. Initially, a type 2 fuzzy and cuckoo search (T2FCS) filter performs pre-processing and the quality of the data is improved by data augmentation. The lesion is then divided using DeepJoint segmentation. Then, the Deep CNN determines the DR. The analysis uses the Indian DR image database. The IISSDO-based Deep CNN has the highest accuracy, sensitivity, and specificity of 96.566%, 96.773%, and 96.517%, respectively.
{"title":"Improved Invasive Weed Social Ski-Driver Optimization-Based Deep Convolution Neural Network for Diabetic Retinopathy Classification","authors":"Padmanayana Bhat, B. Anoop","doi":"10.1142/s0219467825500123","DOIUrl":"https://doi.org/10.1142/s0219467825500123","url":null,"abstract":"The eye-related problem of diabetes is called diabetic retinopathy (DR), which is the main factor contributing to visual loss. This research develops an enhanced deep model for DR classification. Here, deep convolutional neural network (Deep CNN) is trained with the improved invasive weed social ski-driver optimization (IISSDO), which is generated by fusing improved invasive weed optimization (IIWO) and social ski-driver (SSD). The IISSDO-based Deep CNN classifies DR severity into normal, mild, non-proliferative DR (NPDR), moderate NPDR, severe NPDR, and proliferative. Initially, a type 2 fuzzy and cuckoo search (T2FCS) filter performs pre-processing and the quality of the data is improved by data augmentation. The lesion is then divided using DeepJoint segmentation. Then, the Deep CNN determines the DR. The analysis uses the Indian DR image database. The IISSDO-based Deep CNN has the highest accuracy, sensitivity, and specificity of 96.566%, 96.773%, and 96.517%, respectively.","PeriodicalId":44688,"journal":{"name":"International Journal of Image and Graphics","volume":null,"pages":null},"PeriodicalIF":1.6,"publicationDate":"2023-08-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41622319","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-07-28DOI: 10.1142/s0219467825500172
Pramod M. Bachiphale, N. Zulpe
Problem: Digital devices are becoming increasingly powerful and smart, which is improving quality of life, but presents new challenges to privacy protection. Visual cryptographic schemes provide data sharing privacy, but have drawbacks such as extra storage space, lossy secret images, and the need to store permutation keys. Aim: This paper proposes a light-weight visual sign-cryptography scheme based on optimal key generation to address the disadvantages of existing visual cryptographic schemes and improve the security, sharing quality, and time consumption of multisecret images. Methods: The proposed light-weight visual sign-cryptography (LW-VSC) scheme consists of three processes: band separation, shares generation, and signcryption/un-signcryption. The process of separation and shares generation is done by an existing method. The multiple shares of the secret images are then encrypted/decrypted using light-weight sign-cryptography. The proposed scheme uses a novel harpy eagle search optimization (HESO) algorithm to generate optimal keys for both the encrypt/decrypt processes. Results: Simulation results and comparative analysis showed the proposed scheme is more secure and requires less storage space, with faster encryption/decryption and improved key generation quality. Conclusion: The proposed light-weight visual sign-cryptography scheme based on optimal key generation is a promising approach to enhance security and improve data sharing quality. The HESO algorithm shows promise in improving the quality of key generation, providing better privacy protection in the face of increasingly powerful digital devices.
{"title":"Optimal Multisecret Image Sharing Using Lightweight Visual Sign-Cryptography Scheme With Optimal Key Generation for Gray/Color Images","authors":"Pramod M. Bachiphale, N. Zulpe","doi":"10.1142/s0219467825500172","DOIUrl":"https://doi.org/10.1142/s0219467825500172","url":null,"abstract":"Problem: Digital devices are becoming increasingly powerful and smart, which is improving quality of life, but presents new challenges to privacy protection. Visual cryptographic schemes provide data sharing privacy, but have drawbacks such as extra storage space, lossy secret images, and the need to store permutation keys. Aim: This paper proposes a light-weight visual sign-cryptography scheme based on optimal key generation to address the disadvantages of existing visual cryptographic schemes and improve the security, sharing quality, and time consumption of multisecret images. Methods: The proposed light-weight visual sign-cryptography (LW-VSC) scheme consists of three processes: band separation, shares generation, and signcryption/un-signcryption. The process of separation and shares generation is done by an existing method. The multiple shares of the secret images are then encrypted/decrypted using light-weight sign-cryptography. The proposed scheme uses a novel harpy eagle search optimization (HESO) algorithm to generate optimal keys for both the encrypt/decrypt processes. Results: Simulation results and comparative analysis showed the proposed scheme is more secure and requires less storage space, with faster encryption/decryption and improved key generation quality. Conclusion: The proposed light-weight visual sign-cryptography scheme based on optimal key generation is a promising approach to enhance security and improve data sharing quality. The HESO algorithm shows promise in improving the quality of key generation, providing better privacy protection in the face of increasingly powerful digital devices.","PeriodicalId":44688,"journal":{"name":"International Journal of Image and Graphics","volume":null,"pages":null},"PeriodicalIF":1.6,"publicationDate":"2023-07-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45471841","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-07-27DOI: 10.1142/s0219467825500160
Li Fang, Wang Xianghai
In recent years, image denoising methods based on total variational regularization have attracted extensive attention. However, the traditional total variational regularization method is an approximate solution based on convex method, and does not consider the particularity of the region with rich details. In this paper, the adaptive total-variation and nonconvex low-rank model for image denoising is proposed, which is a hybrid regularization model. First, the image is decomposed into sparse terms and low rank terms, and then the total variational regularization is used to denoise. At the same time, an adaptive coefficient based on gradient is constructed to adaptively judge the flat area and detail texture area, slow down the denoising intensity of detail area, and then play the role of preserving detail information. Finally, by constructing a nonconvex function, the optimal solution of the function is obtained by using the alternating minimization method. This method not only effectively removes the image noise, but also retains the detailed information of the image. The experimental results show the effectiveness of the proposed model, and SNR and SSIM of the denoised image are improved.
{"title":"Adaptive Total-Variation and Nonconvex Low-Rank Model for Image Denoising","authors":"Li Fang, Wang Xianghai","doi":"10.1142/s0219467825500160","DOIUrl":"https://doi.org/10.1142/s0219467825500160","url":null,"abstract":"In recent years, image denoising methods based on total variational regularization have attracted extensive attention. However, the traditional total variational regularization method is an approximate solution based on convex method, and does not consider the particularity of the region with rich details. In this paper, the adaptive total-variation and nonconvex low-rank model for image denoising is proposed, which is a hybrid regularization model. First, the image is decomposed into sparse terms and low rank terms, and then the total variational regularization is used to denoise. At the same time, an adaptive coefficient based on gradient is constructed to adaptively judge the flat area and detail texture area, slow down the denoising intensity of detail area, and then play the role of preserving detail information. Finally, by constructing a nonconvex function, the optimal solution of the function is obtained by using the alternating minimization method. This method not only effectively removes the image noise, but also retains the detailed information of the image. The experimental results show the effectiveness of the proposed model, and SNR and SSIM of the denoised image are improved.","PeriodicalId":44688,"journal":{"name":"International Journal of Image and Graphics","volume":null,"pages":null},"PeriodicalIF":1.6,"publicationDate":"2023-07-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44839070","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-07-25DOI: 10.1142/s021946782450061x
Dharaj. Sangani, R. Thakker, S. Panchal, Rajesh Gogineni
In remote sensing, owing to existing sensors’ limitations and the tradeoff between signal-to-noise ratio (SNR) and instantaneous field of view (IFOV), it is difficult to obtain a single image with good spectral and spatial resolution. Pansharpening (PS) is the technique for sharpening multispectral (MS) images by extracting structural and edge information of panchromatic (PAN) image. Multiscale decomposition methods are used for decomposing image in sub-bands but are affected by ringing artifacts, therefore the resultant image seems to be blurred and misregistered. The proposed method overcomes this drawback by decomposing PAN and four band MS image into cartoon and texture components with total variation (TV) Hilbert[Formula: see text] model. The particle swarm optimization (PSO) algorithm is used for finding the optimum weight for fusing texture and cartoon details of PAN and MS images. The proposed method is practically validated on both full-scale and reduced-scale. Robustness of our proposed approach is tested on different geographical areas such as hilly, urban, and vegetation areas. From the visual analysis and qualitative parameters, the proposed method is proved effective compared with other traditional approaches.
{"title":"Remote Sensing Pansharpening with TV-H−1 Decomposition and PSO-Based Adaptive Weighting Method","authors":"Dharaj. Sangani, R. Thakker, S. Panchal, Rajesh Gogineni","doi":"10.1142/s021946782450061x","DOIUrl":"https://doi.org/10.1142/s021946782450061x","url":null,"abstract":"In remote sensing, owing to existing sensors’ limitations and the tradeoff between signal-to-noise ratio (SNR) and instantaneous field of view (IFOV), it is difficult to obtain a single image with good spectral and spatial resolution. Pansharpening (PS) is the technique for sharpening multispectral (MS) images by extracting structural and edge information of panchromatic (PAN) image. Multiscale decomposition methods are used for decomposing image in sub-bands but are affected by ringing artifacts, therefore the resultant image seems to be blurred and misregistered. The proposed method overcomes this drawback by decomposing PAN and four band MS image into cartoon and texture components with total variation (TV) Hilbert[Formula: see text] model. The particle swarm optimization (PSO) algorithm is used for finding the optimum weight for fusing texture and cartoon details of PAN and MS images. The proposed method is practically validated on both full-scale and reduced-scale. Robustness of our proposed approach is tested on different geographical areas such as hilly, urban, and vegetation areas. From the visual analysis and qualitative parameters, the proposed method is proved effective compared with other traditional approaches.","PeriodicalId":44688,"journal":{"name":"International Journal of Image and Graphics","volume":null,"pages":null},"PeriodicalIF":1.6,"publicationDate":"2023-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43317066","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-07-22DOI: 10.1142/s0219467824500566
R. Bania, A. Halder
Mammography imaging is one of the most successful techniques for breast cancer screening and detecting breast lesions. Detection of the Region of Interest (ROI) (where the possible abnormalities could be present) is the backbone for the success of any Computer-Aided Detection or Diagnosis (CADx) system. In this paper, to assist the CADx system, one computational model is proposed to detect breast mass lesions from mammogram images. At the beginning of the process, pectoral muscles from the mammograms are removed as a pre-processing step. Then by applying an automatic thresholding scheme with the required image processing techniques, different regions of breast tissues are ranked to detect the possible suspected region to refine the further segmentation task. One seeded region growing approach is proposed with an automatic seed selection criterion to detect the suspected region to segment the ROI. The proposed model has very less user intervention as maximum of the parameters are computed automatically. To evaluate the performance of the proposed model, it is compared with four different methods with six different evaluation metrics viz., Jaccard & Dice co-efficient, relative error, segmentation accuracy, error and Fowlkes–Mallows index (FMI). On the proposed model, 57 mammogram images are tested, consisting of four different cases that are collected from the publicly available benchmark database. The qualitative and quantitative analyses are performed to evaluate the proposed model. The best dice co-efficient, Jaccard co-efficient, accuracy, error and FMI values observed are 0.9506, 0.9471, 95.62%, 4.38% and 0.932, respectively. The superiority of the model over six state-of-the-art compared methods is well evident from the experimental results.
{"title":"Automatic Breast Mass Lesion Detection in Mammogram Image","authors":"R. Bania, A. Halder","doi":"10.1142/s0219467824500566","DOIUrl":"https://doi.org/10.1142/s0219467824500566","url":null,"abstract":"Mammography imaging is one of the most successful techniques for breast cancer screening and detecting breast lesions. Detection of the Region of Interest (ROI) (where the possible abnormalities could be present) is the backbone for the success of any Computer-Aided Detection or Diagnosis (CADx) system. In this paper, to assist the CADx system, one computational model is proposed to detect breast mass lesions from mammogram images. At the beginning of the process, pectoral muscles from the mammograms are removed as a pre-processing step. Then by applying an automatic thresholding scheme with the required image processing techniques, different regions of breast tissues are ranked to detect the possible suspected region to refine the further segmentation task. One seeded region growing approach is proposed with an automatic seed selection criterion to detect the suspected region to segment the ROI. The proposed model has very less user intervention as maximum of the parameters are computed automatically. To evaluate the performance of the proposed model, it is compared with four different methods with six different evaluation metrics viz., Jaccard & Dice co-efficient, relative error, segmentation accuracy, error and Fowlkes–Mallows index (FMI). On the proposed model, 57 mammogram images are tested, consisting of four different cases that are collected from the publicly available benchmark database. The qualitative and quantitative analyses are performed to evaluate the proposed model. The best dice co-efficient, Jaccard co-efficient, accuracy, error and FMI values observed are 0.9506, 0.9471, 95.62%, 4.38% and 0.932, respectively. The superiority of the model over six state-of-the-art compared methods is well evident from the experimental results.","PeriodicalId":44688,"journal":{"name":"International Journal of Image and Graphics","volume":null,"pages":null},"PeriodicalIF":1.6,"publicationDate":"2023-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43630262","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-07-22DOI: 10.1142/s0219467825500044
Hemlata Sahu, R. Kashyap
Medical image classification is one of the most significant tasks in computer-aided diagnosis. In the era of modern healthcare, the progress of digitalized medical images has led to a crucial role in analyzing medical image analysis. Recently, accurate disease recognition from medical Computed Tomography (CT) images remains a challenging scenario which is important in rendering effective treatment to patients. The infectious COVID-19 disease is highly contagious and leads to a rapid increase in infected individuals. Some drawbacks noticed with RT-PCR kits are high false negative rate (FNR) and a shortage in the number of test kits. Hence, a Chest CT scan is introduced instead of RT-PCR which plays an important role in diagnosing and screening COVID-19 infections. However, manual examination of CT scans performed by radiologists can be time-consuming, and a manual review of each individual CT image may not be feasible in emergencies. Therefore, there is a need to perform automated COVID-19 detection with the advances in AI-based models. This work presents effective and automatic Deep Learning (DL)-based COVID-19 detection using Chest CT images. Initially, the data is gathered and pre-processed through Spatial Weighted Bilateral Filter (SWBF) to eradicate unwanted distortions. The extraction of deep features is processed using Fine_Dense Convolutional Network (Fine_DenseNet). For classification, the Softmax layer of Fine_DenseNet is replaced using Improved Generative Adversarial Network_Artificial Hummingbird (IGAN_AHb) model in order to train the data on the labeled and unlabeled dataset. The loss in the network model is optimized using Artificial Hummingbird (AHb) optimization algorithm. Here, the proposed DL model (Fine_DenseIGANet) is used to perform automated multi-class classification of COVID-19 using CT scan images and attained a superior classification accuracy of 95.73% over other DL models.
{"title":"Fine_Denseiganet: Automatic Medical Image Classification in Chest CT Scan Using Hybrid Deep Learning Framework","authors":"Hemlata Sahu, R. Kashyap","doi":"10.1142/s0219467825500044","DOIUrl":"https://doi.org/10.1142/s0219467825500044","url":null,"abstract":"Medical image classification is one of the most significant tasks in computer-aided diagnosis. In the era of modern healthcare, the progress of digitalized medical images has led to a crucial role in analyzing medical image analysis. Recently, accurate disease recognition from medical Computed Tomography (CT) images remains a challenging scenario which is important in rendering effective treatment to patients. The infectious COVID-19 disease is highly contagious and leads to a rapid increase in infected individuals. Some drawbacks noticed with RT-PCR kits are high false negative rate (FNR) and a shortage in the number of test kits. Hence, a Chest CT scan is introduced instead of RT-PCR which plays an important role in diagnosing and screening COVID-19 infections. However, manual examination of CT scans performed by radiologists can be time-consuming, and a manual review of each individual CT image may not be feasible in emergencies. Therefore, there is a need to perform automated COVID-19 detection with the advances in AI-based models. This work presents effective and automatic Deep Learning (DL)-based COVID-19 detection using Chest CT images. Initially, the data is gathered and pre-processed through Spatial Weighted Bilateral Filter (SWBF) to eradicate unwanted distortions. The extraction of deep features is processed using Fine_Dense Convolutional Network (Fine_DenseNet). For classification, the Softmax layer of Fine_DenseNet is replaced using Improved Generative Adversarial Network_Artificial Hummingbird (IGAN_AHb) model in order to train the data on the labeled and unlabeled dataset. The loss in the network model is optimized using Artificial Hummingbird (AHb) optimization algorithm. Here, the proposed DL model (Fine_DenseIGANet) is used to perform automated multi-class classification of COVID-19 using CT scan images and attained a superior classification accuracy of 95.73% over other DL models.","PeriodicalId":44688,"journal":{"name":"International Journal of Image and Graphics","volume":null,"pages":null},"PeriodicalIF":1.6,"publicationDate":"2023-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42842277","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-07-22DOI: 10.1142/s0219467825500056
Santoshi Kumari, T. P. Pushphavathi
The sentiment analysis relying on the aspect of online reviews is utilized for identifying the polarity of the given review. Nowadays, many methods are introduced for aspect-based sentiment analysis (ABSA) using neural networks, and many methods failed to consider contextual information exploitation to make the performance more accurate. Hence, this research proposed an optimized deep learning method for the detection of the aspect and to identify the polarity. Hence, in this research, an optimized deep learning technique for the ABSA is introduced by considering the online reviews, in which the deep learning classifiers are trained with the proposed Fabricius ringlet optimization (FRO) algorithm to reduce the loss that helps to enhance the accuracy of sentiment polarity prediction. The proposed FRO is developed by the hybridization of the behavioral nature of the Fabricius and the ringlet in feeding for the determination of the global best solution. The tuning of the weights and biases of the classifier enhance the performance of the classifier. The objective behind the tuning is to minimize the loss function while training and to enhance the accuracy of aspect extraction and polarity prediction of sentiment. Based on a study of the existing approach, the suggested FRO-based hybrid deep learning method is significantly improved; its accuracy, sensitivity, and specificity are 87.06%, 90.83%, and 79.37%, respectively, with a training percentage of 40%. The accuracy, sensitivity, and specificity of the existing technique have also been enhanced for aspect restaurant values, which are 87.53%, 96.06%, and 79.88% with a 60% training percentage. Similar to that, Twitter values for accuracy, sensitivity, and specificity are reported to be 89.08%, 99.35%, and 79.70%, respectively, with an 80% training percentage. The proposed method obtained the 90.13%, 99.35%, and 81.10% accuracy, sensitivity, and specificity from the assessment of the FRO-based hybrid deep learning.
{"title":"Aspect-Based Sentiment Analysis Using Fabricius Ringlet-Based Hybrid Deep Learning for Online Reviews","authors":"Santoshi Kumari, T. P. Pushphavathi","doi":"10.1142/s0219467825500056","DOIUrl":"https://doi.org/10.1142/s0219467825500056","url":null,"abstract":"The sentiment analysis relying on the aspect of online reviews is utilized for identifying the polarity of the given review. Nowadays, many methods are introduced for aspect-based sentiment analysis (ABSA) using neural networks, and many methods failed to consider contextual information exploitation to make the performance more accurate. Hence, this research proposed an optimized deep learning method for the detection of the aspect and to identify the polarity. Hence, in this research, an optimized deep learning technique for the ABSA is introduced by considering the online reviews, in which the deep learning classifiers are trained with the proposed Fabricius ringlet optimization (FRO) algorithm to reduce the loss that helps to enhance the accuracy of sentiment polarity prediction. The proposed FRO is developed by the hybridization of the behavioral nature of the Fabricius and the ringlet in feeding for the determination of the global best solution. The tuning of the weights and biases of the classifier enhance the performance of the classifier. The objective behind the tuning is to minimize the loss function while training and to enhance the accuracy of aspect extraction and polarity prediction of sentiment. Based on a study of the existing approach, the suggested FRO-based hybrid deep learning method is significantly improved; its accuracy, sensitivity, and specificity are 87.06%, 90.83%, and 79.37%, respectively, with a training percentage of 40%. The accuracy, sensitivity, and specificity of the existing technique have also been enhanced for aspect restaurant values, which are 87.53%, 96.06%, and 79.88% with a 60% training percentage. Similar to that, Twitter values for accuracy, sensitivity, and specificity are reported to be 89.08%, 99.35%, and 79.70%, respectively, with an 80% training percentage. The proposed method obtained the 90.13%, 99.35%, and 81.10% accuracy, sensitivity, and specificity from the assessment of the FRO-based hybrid deep learning.","PeriodicalId":44688,"journal":{"name":"International Journal of Image and Graphics","volume":null,"pages":null},"PeriodicalIF":1.6,"publicationDate":"2023-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43349570","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-07-22DOI: 10.1142/s021946782550007x
Abhilash Kayyidavazhiyil
Although IoT sectors seem more popular and pervasively, they struggle with hazards. The botnet is one of the largest security dangers associated with IoT. It enables malicious software to administer and attack private network equipment collectively without the owners’ knowledge. Although many studies have used ML to detect botnets, these are either not very effective or only work with specific types of botnets or devices. As a result, the detection model for deep learning ideas is the focus of this research. It entails three key processes: (a) preprocessing, (b) feature extraction, and (c) classification. The input data are initially preprocessed using an improved data normalization approach. The preprocessed data are used to extract a number of features, including Tanimoto coefficient features, improved differential holoentropy-based features, Pearson r correlation-based features, and others. The detection process will be completed by an ensemble classification model that randomly shuffles models like the Deep Belief Network (DBN) model, Bidirectional Gated Recurrent Unit (Bi-GRU), and Long Short-Term Memory (LSTM). Bi-GRU, DBN, and LSTM will be averaged to provide the ensemble results. Bi-GRU is trained using the Self Improved Blue Monkey Optimization (SIBMO) Algorithm by selecting the optimal weights, which increases the detection accuracy. The overall performance of the suggested work is then evaluated in relation to other existing models using various methodologies. In comparison to existing methods, the created ensemble classifier [Formula: see text] SIBMO scheme obtains the highest accuracy (93%) at a learning percentage of 90%.
{"title":"Combined Tri-Classifiers for IoT Botnet Detection with Tuned Training Weights","authors":"Abhilash Kayyidavazhiyil","doi":"10.1142/s021946782550007x","DOIUrl":"https://doi.org/10.1142/s021946782550007x","url":null,"abstract":"Although IoT sectors seem more popular and pervasively, they struggle with hazards. The botnet is one of the largest security dangers associated with IoT. It enables malicious software to administer and attack private network equipment collectively without the owners’ knowledge. Although many studies have used ML to detect botnets, these are either not very effective or only work with specific types of botnets or devices. As a result, the detection model for deep learning ideas is the focus of this research. It entails three key processes: (a) preprocessing, (b) feature extraction, and (c) classification. The input data are initially preprocessed using an improved data normalization approach. The preprocessed data are used to extract a number of features, including Tanimoto coefficient features, improved differential holoentropy-based features, Pearson r correlation-based features, and others. The detection process will be completed by an ensemble classification model that randomly shuffles models like the Deep Belief Network (DBN) model, Bidirectional Gated Recurrent Unit (Bi-GRU), and Long Short-Term Memory (LSTM). Bi-GRU, DBN, and LSTM will be averaged to provide the ensemble results. Bi-GRU is trained using the Self Improved Blue Monkey Optimization (SIBMO) Algorithm by selecting the optimal weights, which increases the detection accuracy. The overall performance of the suggested work is then evaluated in relation to other existing models using various methodologies. In comparison to existing methods, the created ensemble classifier [Formula: see text] SIBMO scheme obtains the highest accuracy (93%) at a learning percentage of 90%.","PeriodicalId":44688,"journal":{"name":"International Journal of Image and Graphics","volume":null,"pages":null},"PeriodicalIF":1.6,"publicationDate":"2023-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45568914","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-07-22DOI: 10.1142/s0219467825500019
Chaitanya Jannu, S. Vanambathina
Recent years have seen a significant amount of studies in the area of speech enhancement. This review looks at several speech improvement methods as well as Deep Neural Network (DNN) functions in speech enhancement. Speech transmissions are frequently distorted by ambient noise, background noise, and reverberations. There are processing methods, such as Short-time Fourier Transform, Short-time Autocorrelation, and Short-time Energy (STE), that can be used to enhance speech. To reduce speech noise, features such as the Mel-Frequency Cepstral Coefficients (MFCCs), Logarithmic Power Spectrum (LPS), and Gammatone Frequency Cepstral Coefficients (GFCCs) can be retrieved and input to a DNN. DNN is essential to speech improvement since it builds models using a lot of training data and evaluates the efficacy of the enhanced speech using certain performance metrics. Since the beginning of deep learning publications in 1993, a variety of speech enhancement methods have been examined in this study. This review provides a thorough examination of the several neural network topologies, training algorithms, activation functions, training targets, acoustic features, and databases that were employed for the job of speech enhancement and were gathered from various articles published between 1993 and 2022.
近年来,在语音增强领域进行了大量的研究。本文综述了几种语音增强方法以及深度神经网络(DNN)在语音增强中的作用。语音传输经常受到环境噪声、背景噪声和混响的干扰。有短时傅里叶变换、短时自相关和短时能量(STE)等处理方法可用于增强语音。为了降低语音噪声,可以检索Mel-Frequency Cepstral系数(MFCCs),对数功率谱(LPS)和gamma - one Frequency Cepstral系数(GFCCs)等特征并将其输入到DNN中。深度神经网络对语音改进至关重要,因为它使用大量训练数据构建模型,并使用某些性能指标评估增强语音的效果。自1993年深度学习出版物开始以来,本研究对各种语音增强方法进行了研究。本文对1993年至2022年间发表的各种文章中用于语音增强工作的几种神经网络拓扑、训练算法、激活函数、训练目标、声学特征和数据库进行了全面的研究。
{"title":"An Overview of Speech Enhancement Based on Deep Learning Techniques","authors":"Chaitanya Jannu, S. Vanambathina","doi":"10.1142/s0219467825500019","DOIUrl":"https://doi.org/10.1142/s0219467825500019","url":null,"abstract":"Recent years have seen a significant amount of studies in the area of speech enhancement. This review looks at several speech improvement methods as well as Deep Neural Network (DNN) functions in speech enhancement. Speech transmissions are frequently distorted by ambient noise, background noise, and reverberations. There are processing methods, such as Short-time Fourier Transform, Short-time Autocorrelation, and Short-time Energy (STE), that can be used to enhance speech. To reduce speech noise, features such as the Mel-Frequency Cepstral Coefficients (MFCCs), Logarithmic Power Spectrum (LPS), and Gammatone Frequency Cepstral Coefficients (GFCCs) can be retrieved and input to a DNN. DNN is essential to speech improvement since it builds models using a lot of training data and evaluates the efficacy of the enhanced speech using certain performance metrics. Since the beginning of deep learning publications in 1993, a variety of speech enhancement methods have been examined in this study. This review provides a thorough examination of the several neural network topologies, training algorithms, activation functions, training targets, acoustic features, and databases that were employed for the job of speech enhancement and were gathered from various articles published between 1993 and 2022.","PeriodicalId":44688,"journal":{"name":"International Journal of Image and Graphics","volume":null,"pages":null},"PeriodicalIF":1.6,"publicationDate":"2023-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46800367","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-07-22DOI: 10.1142/s0219467825500081
Mamvinder Sharma, Sudhakara Reddy Saripalli, A. Gupta, Pankaj Palta, D. Pandey
Visualization of material composition across numerous grains and complicated networks of grain boundaries using image processing techniques can reveal fresh insights into the material’s structural evolution and upcoming functional capabilities for a variety of applications. Three-dimensional integrated circuits (3D IC) are the most practical technology for increasing transistor density in future semiconductor applications. One of the key benefits of 3D IC is heterogeneous integration, which results in shorter interconnections due to vertical stacking. However, one of the most significant challenges in building higher-density microelectronics devices is the stress generated by material mismatches in the coefficient of thermal expansion (CTE). The purpose of this study is to analyze grain boundary migration caused by variations in strain energy density using image processing methods for 3D grain continuum modeling. Temperature changes in polycrystalline structures generate stresses and strain energy densities, which may be calculated using FEM software. Single crystal Cu’s anisotropic elastic properties are twisted to suit grain orientation in space and each grain is treated as a single crystal. Grain boundary speeds are calculated using a simple model that relates grain boundary mobility to variations in strain energy density on both sides of grain boundaries. Using the grain continuum model, researchers will be able to investigate the effect of thermally generated stresses on grain boundary motion caused by atomic flux driven by strain energy. Using finite-element modeling of the grain structure in a Through Silicon Via, the stress effect on grain boundaries caused by grain rotation due to CTE mismatch was investigated (TSV). The structure must be modeled using a scanning electron microscopes Electron Backscatter Diffraction (EBSD) image (SEM). Grain growth and subsequent grain boundary rotation can be performed using the appropriate extrapolation method to measure their influence on stress and, as a result, the TSV’s overall reliability.
{"title":"Image Processing-Based Method of Evaluation of Stress from Grain Structures of Through Silicon Via (TSV)","authors":"Mamvinder Sharma, Sudhakara Reddy Saripalli, A. Gupta, Pankaj Palta, D. Pandey","doi":"10.1142/s0219467825500081","DOIUrl":"https://doi.org/10.1142/s0219467825500081","url":null,"abstract":"Visualization of material composition across numerous grains and complicated networks of grain boundaries using image processing techniques can reveal fresh insights into the material’s structural evolution and upcoming functional capabilities for a variety of applications. Three-dimensional integrated circuits (3D IC) are the most practical technology for increasing transistor density in future semiconductor applications. One of the key benefits of 3D IC is heterogeneous integration, which results in shorter interconnections due to vertical stacking. However, one of the most significant challenges in building higher-density microelectronics devices is the stress generated by material mismatches in the coefficient of thermal expansion (CTE). The purpose of this study is to analyze grain boundary migration caused by variations in strain energy density using image processing methods for 3D grain continuum modeling. Temperature changes in polycrystalline structures generate stresses and strain energy densities, which may be calculated using FEM software. Single crystal Cu’s anisotropic elastic properties are twisted to suit grain orientation in space and each grain is treated as a single crystal. Grain boundary speeds are calculated using a simple model that relates grain boundary mobility to variations in strain energy density on both sides of grain boundaries. Using the grain continuum model, researchers will be able to investigate the effect of thermally generated stresses on grain boundary motion caused by atomic flux driven by strain energy. Using finite-element modeling of the grain structure in a Through Silicon Via, the stress effect on grain boundaries caused by grain rotation due to CTE mismatch was investigated (TSV). The structure must be modeled using a scanning electron microscopes Electron Backscatter Diffraction (EBSD) image (SEM). Grain growth and subsequent grain boundary rotation can be performed using the appropriate extrapolation method to measure their influence on stress and, as a result, the TSV’s overall reliability.","PeriodicalId":44688,"journal":{"name":"International Journal of Image and Graphics","volume":null,"pages":null},"PeriodicalIF":1.6,"publicationDate":"2023-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45018506","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}