Pub Date : 2023-01-28DOI: 10.1142/s0219467824500359
K. M. Muraleedharan, K. T. B. Kumar, Sunil John, R. K. S. Kumar
Automatic voice pathology detection enables an objective assessment of pathologies that influence the voice production strategy. By utilizing the conventional pipeline model as well as the modern deep learning-centric end-to-end methodology, numerous pathological voice analyzing techniques have been developed. The conventional methodology is still a valid choice owing to the lack of enormous amounts of training data in the study region of pathological voice. In the meantime, obtaining higher precision, higher accuracy, and stability is still a complicated task. Therefore, by amalgamating the nonlinear measure, the pathological voices are analyzed to abate such risks. The viability of six nonlinear discriminating measures derived from the phase space realm, involving healthy and pathological voice signals, is studied in this work. The analyzed parameters are Singularity spectrum coefficients ([Formula: see text], [Formula: see text] and [Formula: see text]). Correlation entropy at optimum embedding dimension ([Formula: see text]) and correlation dimension at optimum embedding dimension ([Formula: see text]). Analyzing the pathological voices with better accuracy rates is the major objective of the proposed methodology. Here, the Support Vector Machine (SVM) was utilized as the classifier. Experimentations were performed on VOiceICarfEDerico (VOICED) databases subsuming 208 healthy, as well as pathological voices, amongst these 50 samples, were utilized. Here, the model obtained 97% of accuracy with 99% as of the classifier with Gaussian kernel function. Therefore, to differentiate normal as well as pathological subjects, the six proposed characteristics are highly beneficial; in addition, they will be supportive in pathology diagnosis.
{"title":"Combined Use of Nonlinear Measures for Analyzing Pathological Voices","authors":"K. M. Muraleedharan, K. T. B. Kumar, Sunil John, R. K. S. Kumar","doi":"10.1142/s0219467824500359","DOIUrl":"https://doi.org/10.1142/s0219467824500359","url":null,"abstract":"Automatic voice pathology detection enables an objective assessment of pathologies that influence the voice production strategy. By utilizing the conventional pipeline model as well as the modern deep learning-centric end-to-end methodology, numerous pathological voice analyzing techniques have been developed. The conventional methodology is still a valid choice owing to the lack of enormous amounts of training data in the study region of pathological voice. In the meantime, obtaining higher precision, higher accuracy, and stability is still a complicated task. Therefore, by amalgamating the nonlinear measure, the pathological voices are analyzed to abate such risks. The viability of six nonlinear discriminating measures derived from the phase space realm, involving healthy and pathological voice signals, is studied in this work. The analyzed parameters are Singularity spectrum coefficients ([Formula: see text], [Formula: see text] and [Formula: see text]). Correlation entropy at optimum embedding dimension ([Formula: see text]) and correlation dimension at optimum embedding dimension ([Formula: see text]). Analyzing the pathological voices with better accuracy rates is the major objective of the proposed methodology. Here, the Support Vector Machine (SVM) was utilized as the classifier. Experimentations were performed on VOiceICarfEDerico (VOICED) databases subsuming 208 healthy, as well as pathological voices, amongst these 50 samples, were utilized. Here, the model obtained 97% of accuracy with 99% as of the classifier with Gaussian kernel function. Therefore, to differentiate normal as well as pathological subjects, the six proposed characteristics are highly beneficial; in addition, they will be supportive in pathology diagnosis.","PeriodicalId":44688,"journal":{"name":"International Journal of Image and Graphics","volume":"1 4","pages":""},"PeriodicalIF":1.6,"publicationDate":"2023-01-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41247211","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-01-25DOI: 10.1142/s0219467824500293
Jovi Joseph, S. Sreela
Diabetic Retinopathy (DR) and Glaucoma are two of the most common causes of vision loss world-wide. However, it can be averted if therapy is begun early enough. In biomedical applications, the use of digital image processing has assisted in the automated identification of some ailments at an earlier stage. To make this prediction generally neural network classifier models were previously used, but these models have the drawback of being unable to detect multiple illnesses that occur in the eye at the same time and require a big database for successful classifier training. As a result, a model is needed to reliably distinguish DR and Glaucoma in diabetic individuals more accurately and with minimum dataset images. In this view, this study introduced Mayfly Optimized Deep Convolutional Network (MODCN) model for automated disease detection in the fundus retina images. In the MODCN model, the images are initially preprocessed, segmented at generator in the GAN model then a discriminator readily gives synthesis of real images of the fundus retina, thus a wide database has been created and considered as training images for the MODCN classifier. MODCN classifier has a modified high-density layer as a transition layer to avoid overfitting and the errors are minimized by tuning the hyperparameters using Mayfly Optimization Algorithm. After feature mapping, the classes normal, DR and Glaucoma are labeled and stored. At the testing stage, images are preprocessed, feature mapped and classified in the MODCN model. Thus, the proposed MODCN model detects multiple illness such as Diabetic Retinopathy and Glaucoma at the same time even with a small amount of database that performs a successful classifier training. This model is then evaluated and gives an accuracy of 99% that was higher compared to previous models.
{"title":"MODCN: Fine-Tuned Deep Convolutional Neural Network with GAN Deployed to Forecast Diabetic Eye Damage in Fundus Retinal Images","authors":"Jovi Joseph, S. Sreela","doi":"10.1142/s0219467824500293","DOIUrl":"https://doi.org/10.1142/s0219467824500293","url":null,"abstract":"Diabetic Retinopathy (DR) and Glaucoma are two of the most common causes of vision loss world-wide. However, it can be averted if therapy is begun early enough. In biomedical applications, the use of digital image processing has assisted in the automated identification of some ailments at an earlier stage. To make this prediction generally neural network classifier models were previously used, but these models have the drawback of being unable to detect multiple illnesses that occur in the eye at the same time and require a big database for successful classifier training. As a result, a model is needed to reliably distinguish DR and Glaucoma in diabetic individuals more accurately and with minimum dataset images. In this view, this study introduced Mayfly Optimized Deep Convolutional Network (MODCN) model for automated disease detection in the fundus retina images. In the MODCN model, the images are initially preprocessed, segmented at generator in the GAN model then a discriminator readily gives synthesis of real images of the fundus retina, thus a wide database has been created and considered as training images for the MODCN classifier. MODCN classifier has a modified high-density layer as a transition layer to avoid overfitting and the errors are minimized by tuning the hyperparameters using Mayfly Optimization Algorithm. After feature mapping, the classes normal, DR and Glaucoma are labeled and stored. At the testing stage, images are preprocessed, feature mapped and classified in the MODCN model. Thus, the proposed MODCN model detects multiple illness such as Diabetic Retinopathy and Glaucoma at the same time even with a small amount of database that performs a successful classifier training. This model is then evaluated and gives an accuracy of 99% that was higher compared to previous models.","PeriodicalId":44688,"journal":{"name":"International Journal of Image and Graphics","volume":" ","pages":""},"PeriodicalIF":1.6,"publicationDate":"2023-01-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43526405","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-01-20DOI: 10.1142/s021946782450013x
K. V. Rao, B. V. Ramana Reddy
Stock market forecasting is a significant task, and investing in the stock marketplace is a significant part of monetary research due to its high risk. Therefore, accurate forecasting of stock market analysis is still a challenge. Due to stable and volatile data, stock market forecasting remains a major challenge for investors. Recent machine learning (ML) models have been able to reduce the risk of stock market forecasting. However, diversity remains a key challenge in developing better erudition models and extracts more intellectually priceless qualities to auxiliary advanced predictability. In this paper, we propose an efficient strategy optimization using a hybrid ML model for stock market prediction (HM-SMP). The first contribution of the proposed HM-SMP model is to introduce chaos-enhanced firefly bowerbird optimization (CEFBO) algorithm for optimal feature selection among multiple features which reduce the data dimensionality. Second, we develop a hybrid multi-objective capuchin with a recurrent neural network (HC-RNN) for the prediction of the stock market which enhances the prediction accuracy. We use supervised RNN to predict the closing price. Finally, to estimate the presence of the proposed HM-SMP model through the benchmark, stock market datasets and the performance can be compared with the existing state-of-the-art models in terms of accuracy, precision, recall, and [Formula: see text]-measure.
{"title":"HM-SMF: An Efficient Strategy Optimization using a Hybrid Machine Learning Model for Stock Market Prediction","authors":"K. V. Rao, B. V. Ramana Reddy","doi":"10.1142/s021946782450013x","DOIUrl":"https://doi.org/10.1142/s021946782450013x","url":null,"abstract":"Stock market forecasting is a significant task, and investing in the stock marketplace is a significant part of monetary research due to its high risk. Therefore, accurate forecasting of stock market analysis is still a challenge. Due to stable and volatile data, stock market forecasting remains a major challenge for investors. Recent machine learning (ML) models have been able to reduce the risk of stock market forecasting. However, diversity remains a key challenge in developing better erudition models and extracts more intellectually priceless qualities to auxiliary advanced predictability. In this paper, we propose an efficient strategy optimization using a hybrid ML model for stock market prediction (HM-SMP). The first contribution of the proposed HM-SMP model is to introduce chaos-enhanced firefly bowerbird optimization (CEFBO) algorithm for optimal feature selection among multiple features which reduce the data dimensionality. Second, we develop a hybrid multi-objective capuchin with a recurrent neural network (HC-RNN) for the prediction of the stock market which enhances the prediction accuracy. We use supervised RNN to predict the closing price. Finally, to estimate the presence of the proposed HM-SMP model through the benchmark, stock market datasets and the performance can be compared with the existing state-of-the-art models in terms of accuracy, precision, recall, and [Formula: see text]-measure.","PeriodicalId":44688,"journal":{"name":"International Journal of Image and Graphics","volume":" ","pages":""},"PeriodicalIF":1.6,"publicationDate":"2023-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47874406","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-01-20DOI: 10.1142/s0219467824500104
Swetha Gadde, J. Amutharaj, S. Usha
Currently, the size of multimedia data is rising gradually from gigabytes to petabytes, due to the progression of a larger quantity of realistic data. The majority of big data is conveyed via the internet and they were accumulated on cloud servers. Since cloud computing offers internet-oriented services, there were a lot of attackers and malevolent users. They always attempt to deploy the private data of users without any right access. At certain times, they substitute the real data by any counterfeit data. As a result, data protection has turned out to be a noteworthy concern in recent times. This paper aims to establish an optimization-based privacy preservation model for preserving multimedia data by selecting the optimal secret key. Here, the encryption and decryption process is carried out by Improved Blowfish cryptographic technique, where the sensitive data in cloud server is preserved using the optimal key. Optimal key generation is the significant procedure to ensure the objectives of integrity and confidentiality. Likewise, data restoration is the inverse process of sanitization (decryption). In both the cases, key generation remains a major aspect, which is optimally chosen by a novel hybrid algorithm termed as “Clan based Crow Search with Adaptive Awareness probability (CCS-AAP). Finally, an analysis is carried out to validate the improvement of the proposed method.
{"title":"Cloud Multimedia Data Security by Optimization-Assisted Cryptographic Technique","authors":"Swetha Gadde, J. Amutharaj, S. Usha","doi":"10.1142/s0219467824500104","DOIUrl":"https://doi.org/10.1142/s0219467824500104","url":null,"abstract":"Currently, the size of multimedia data is rising gradually from gigabytes to petabytes, due to the progression of a larger quantity of realistic data. The majority of big data is conveyed via the internet and they were accumulated on cloud servers. Since cloud computing offers internet-oriented services, there were a lot of attackers and malevolent users. They always attempt to deploy the private data of users without any right access. At certain times, they substitute the real data by any counterfeit data. As a result, data protection has turned out to be a noteworthy concern in recent times. This paper aims to establish an optimization-based privacy preservation model for preserving multimedia data by selecting the optimal secret key. Here, the encryption and decryption process is carried out by Improved Blowfish cryptographic technique, where the sensitive data in cloud server is preserved using the optimal key. Optimal key generation is the significant procedure to ensure the objectives of integrity and confidentiality. Likewise, data restoration is the inverse process of sanitization (decryption). In both the cases, key generation remains a major aspect, which is optimally chosen by a novel hybrid algorithm termed as “Clan based Crow Search with Adaptive Awareness probability (CCS-AAP). Finally, an analysis is carried out to validate the improvement of the proposed method.","PeriodicalId":44688,"journal":{"name":"International Journal of Image and Graphics","volume":" ","pages":""},"PeriodicalIF":1.6,"publicationDate":"2023-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45497973","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-01-07DOI: 10.1142/s0219467824500311
A. Begum, Prabha Selvaraj
Alzheimer’s disease (AD) is a popular neurological disorder affecting a critical part of the world’s population. Its early diagnosis is extremely imperative for enhancing the quality of patients’ lives. Recently, improved technologies like image processing, artificial intelligence involving machine learning, deep learning, and transfer learning have been introduced for detecting AD. This review describes the contribution of image processing, feature extraction, optimization, and classification approach in AD recognition. It deeply investigates different methods adopted for multiclass diagnosis of AD. The paper further presents a brief comparison of existing AD studies in terms of techniques adopted, performance measures, classification accuracy, publication year, and datasets. It then summarizes the important technical barriers in reviewed works. This paper allows the readers to gain profound knowledge regarding AD diagnosis for promoting extensive research in this field.
{"title":"Multiclass Diagnosis of Alzheimer’s Disease Analysis Using Machine Learning and Deep Learning Techniques","authors":"A. Begum, Prabha Selvaraj","doi":"10.1142/s0219467824500311","DOIUrl":"https://doi.org/10.1142/s0219467824500311","url":null,"abstract":"Alzheimer’s disease (AD) is a popular neurological disorder affecting a critical part of the world’s population. Its early diagnosis is extremely imperative for enhancing the quality of patients’ lives. Recently, improved technologies like image processing, artificial intelligence involving machine learning, deep learning, and transfer learning have been introduced for detecting AD. This review describes the contribution of image processing, feature extraction, optimization, and classification approach in AD recognition. It deeply investigates different methods adopted for multiclass diagnosis of AD. The paper further presents a brief comparison of existing AD studies in terms of techniques adopted, performance measures, classification accuracy, publication year, and datasets. It then summarizes the important technical barriers in reviewed works. This paper allows the readers to gain profound knowledge regarding AD diagnosis for promoting extensive research in this field.","PeriodicalId":44688,"journal":{"name":"International Journal of Image and Graphics","volume":" ","pages":""},"PeriodicalIF":1.6,"publicationDate":"2023-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44673990","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-01-05DOI: 10.1142/s0219467823400119
P. Sasikala, L. Mary Immaculate Sheela
Globally, people’s health and wealth are affected by the outbreak of the corona virus. It is a virus, which infects from common fever to severe acute respiratory syndrome. It has the potency to transmit from one person to another. It is established that this virus spread is augmenting speedily devoid of any symptoms. Therefore, the prediction of this outbreak situation with mathematical modelling is highly significant along with necessary. To produce informed decisions along with to adopt pertinent control measures, a number of outbreak prediction methodologies for COVID-19 are being utilized by officials worldwide. An effectual COVID-19 outbreaks’ prediction by employing Squirrel Search Optimization Algorithm centric Tanh Multi-Layer Perceptron Neural Network (MLPNN) (SSOA-TMLPNN) along with Auto-Regressive Integrated Moving Average (ARIMA) methodologies is proposed here. Initially, from the openly accessible sources, the input time series COVID-19 data are amassed. Then, pre-processing is performed for better classification outcomes after collecting the data. Next, by utilizing Sine-centered Empirical Mode Decomposition (S-EMD) methodology, the data decomposition is executed. Subsequently, the data are input to the Brownian motion Intense (BI) - SSOA-TMLPNN classifier. In this, the diseased, recovered, and death cases in the country are classified. After that, regarding the time-series data, the corona-virus’s future outbreak is predicted by employing ARIMA. Afterwards, data visualization is conducted. Lastly, to evaluate the proposed model’s efficacy, its outcomes are analogized with certain prevailing methodologies. The obtained outcomes revealed that the proposed methodology surpassed the other existing methodologies.
{"title":"An Efficient COVID-19 Disease Outbreak Prediction Using BI-SSOA-TMLPNN and ARIMA","authors":"P. Sasikala, L. Mary Immaculate Sheela","doi":"10.1142/s0219467823400119","DOIUrl":"https://doi.org/10.1142/s0219467823400119","url":null,"abstract":"Globally, people’s health and wealth are affected by the outbreak of the corona virus. It is a virus, which infects from common fever to severe acute respiratory syndrome. It has the potency to transmit from one person to another. It is established that this virus spread is augmenting speedily devoid of any symptoms. Therefore, the prediction of this outbreak situation with mathematical modelling is highly significant along with necessary. To produce informed decisions along with to adopt pertinent control measures, a number of outbreak prediction methodologies for COVID-19 are being utilized by officials worldwide. An effectual COVID-19 outbreaks’ prediction by employing Squirrel Search Optimization Algorithm centric Tanh Multi-Layer Perceptron Neural Network (MLPNN) (SSOA-TMLPNN) along with Auto-Regressive Integrated Moving Average (ARIMA) methodologies is proposed here. Initially, from the openly accessible sources, the input time series COVID-19 data are amassed. Then, pre-processing is performed for better classification outcomes after collecting the data. Next, by utilizing Sine-centered Empirical Mode Decomposition (S-EMD) methodology, the data decomposition is executed. Subsequently, the data are input to the Brownian motion Intense (BI) - SSOA-TMLPNN classifier. In this, the diseased, recovered, and death cases in the country are classified. After that, regarding the time-series data, the corona-virus’s future outbreak is predicted by employing ARIMA. Afterwards, data visualization is conducted. Lastly, to evaluate the proposed model’s efficacy, its outcomes are analogized with certain prevailing methodologies. The obtained outcomes revealed that the proposed methodology surpassed the other existing methodologies.","PeriodicalId":44688,"journal":{"name":"International Journal of Image and Graphics","volume":" ","pages":""},"PeriodicalIF":1.6,"publicationDate":"2023-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44853542","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-01-01DOI: 10.1142/S0219467822400095
K. P. Kumar, C. Venkata Narasimhulu, K. Prasad
{"title":"2D Wavelet Tree Ordering Based Localized Total Variation Model for Efficient Image Restoration","authors":"K. P. Kumar, C. Venkata Narasimhulu, K. Prasad","doi":"10.1142/S0219467822400095","DOIUrl":"https://doi.org/10.1142/S0219467822400095","url":null,"abstract":"","PeriodicalId":44688,"journal":{"name":"International Journal of Image and Graphics","volume":"23 1","pages":"2240009:1-2240009:16"},"PeriodicalIF":1.6,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"64448468","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-01-01DOI: 10.1142/S0219467823500092
Marcos J. C. E. Azevedo, C. Mello
{"title":"Edge Detection in Natural Scenes Inspired by the Speed Drawing Challenge","authors":"Marcos J. C. E. Azevedo, C. Mello","doi":"10.1142/S0219467823500092","DOIUrl":"https://doi.org/10.1142/S0219467823500092","url":null,"abstract":"","PeriodicalId":44688,"journal":{"name":"International Journal of Image and Graphics","volume":"23 1","pages":"2350009:1-2350009:20"},"PeriodicalIF":1.6,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"64448625","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2022-12-15DOI: 10.1142/s0219467824500116
Hailong Yang, Yinghao Liu, Tian Xia
In this paper, a new detection method of pin defects based on image segmentation and ResNe-50 is proposed, which realizes the defect detection of faulty pins in many aviation connectors. In this paper, a new dataset image segmentation method is used to segment many aviation connectors in a single image to generate a dataset, which reduces the tedious work of manually labeling the dataset. In the defect detection model, based on ResNet-50, a ResNet-B residual structure is introduced to reduce the loss of features during information extraction; a continuously differentiable CELU is used as the activation function to reduce the neuron death problem of ReLU; a new deformable convolution network (DCN v2) is introduced as the convolution kernel structure of the model to improve the recognition of aviation connectors with prominent geometric deformation pin recognition. The improved model achieved 97.2% and 94.4% accuracy for skewed and missing pins, respectively, in the experiments. The detection accuracy improved by 1.91% to 96.62% compared to the conventional ResNet-50. Compared with the traditional model, the improved model has better generalization ability.
{"title":"Defect Detection Scheme of Pins for Aviation Connectors Based on Image Segmentation and Improved RESNET-50","authors":"Hailong Yang, Yinghao Liu, Tian Xia","doi":"10.1142/s0219467824500116","DOIUrl":"https://doi.org/10.1142/s0219467824500116","url":null,"abstract":"In this paper, a new detection method of pin defects based on image segmentation and ResNe-50 is proposed, which realizes the defect detection of faulty pins in many aviation connectors. In this paper, a new dataset image segmentation method is used to segment many aviation connectors in a single image to generate a dataset, which reduces the tedious work of manually labeling the dataset. In the defect detection model, based on ResNet-50, a ResNet-B residual structure is introduced to reduce the loss of features during information extraction; a continuously differentiable CELU is used as the activation function to reduce the neuron death problem of ReLU; a new deformable convolution network (DCN v2) is introduced as the convolution kernel structure of the model to improve the recognition of aviation connectors with prominent geometric deformation pin recognition. The improved model achieved 97.2% and 94.4% accuracy for skewed and missing pins, respectively, in the experiments. The detection accuracy improved by 1.91% to 96.62% compared to the conventional ResNet-50. Compared with the traditional model, the improved model has better generalization ability.","PeriodicalId":44688,"journal":{"name":"International Journal of Image and Graphics","volume":" ","pages":""},"PeriodicalIF":1.6,"publicationDate":"2022-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42383753","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Endocardial and epicardial border identification has been of extensive interest in cardiac Magnetic Resonance Images (MRIs). It is a difficult job to segment the epicardium and endocardium accurately and automatically from cardiac MRI owing to the cardiac tissues’ complexity even though the prevailing Deep Learning (DL) methodologies had attained significant success in medical imaging segmentation. Hence, by employing effectual ResNeXT-50-centric Inverse Edge Indicator Level Set (IEILS) and anisotropic sigmoid diffusion algorithms, this system has proposed cardiac MRI segmentation. The work has endured some function for an effectual partition of epicardium and endocardium. Initially, by employing the Truncated Kernel Function (TK)-Trilateral Filter, the noise removal function is executed on the input cardiac MRI. Next, by wielding the ResNeXT-50 IEILS, the Left and Right Ventricular (LV/RV) regions are segmented. The epicardium and endocardium are segmented by the ASD algorithm once the LV/RV is separated from the Left Ventricle (LV) region. Here, the openly accessible Sunnybrook and the Right Ventricle (RV) datasets are wielded. Then, the prevailing state-of-art algorithms are analogized to the outcomes achieved by the proposed framework. Regarding accuracy, sensitivity, and specificity, the proposed methodology executed the cardiac MRI segmentation process precisely along with the other surpassed state-of-the-art methodologies.
{"title":"Cardiac MRI Segmentation Using Efficient ResNeXT-50-Based IEI Level Set and Anisotropic Sigmoid Diffusion Algorithms","authors":"Anupama Bhan, Parthasarathi Mangipudi, Ayush Goyal","doi":"10.1142/s0219467823400144","DOIUrl":"https://doi.org/10.1142/s0219467823400144","url":null,"abstract":"Endocardial and epicardial border identification has been of extensive interest in cardiac Magnetic Resonance Images (MRIs). It is a difficult job to segment the epicardium and endocardium accurately and automatically from cardiac MRI owing to the cardiac tissues’ complexity even though the prevailing Deep Learning (DL) methodologies had attained significant success in medical imaging segmentation. Hence, by employing effectual ResNeXT-50-centric Inverse Edge Indicator Level Set (IEILS) and anisotropic sigmoid diffusion algorithms, this system has proposed cardiac MRI segmentation. The work has endured some function for an effectual partition of epicardium and endocardium. Initially, by employing the Truncated Kernel Function (TK)-Trilateral Filter, the noise removal function is executed on the input cardiac MRI. Next, by wielding the ResNeXT-50 IEILS, the Left and Right Ventricular (LV/RV) regions are segmented. The epicardium and endocardium are segmented by the ASD algorithm once the LV/RV is separated from the Left Ventricle (LV) region. Here, the openly accessible Sunnybrook and the Right Ventricle (RV) datasets are wielded. Then, the prevailing state-of-art algorithms are analogized to the outcomes achieved by the proposed framework. Regarding accuracy, sensitivity, and specificity, the proposed methodology executed the cardiac MRI segmentation process precisely along with the other surpassed state-of-the-art methodologies.","PeriodicalId":44688,"journal":{"name":"International Journal of Image and Graphics","volume":" ","pages":""},"PeriodicalIF":1.6,"publicationDate":"2022-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42866800","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}