首页 > 最新文献

Turkish Journal of Electrical Engineering and Computer Sciences最新文献

英文 中文
A comparative study of blind source separation methods 盲源分离方法比较研究
IF 1.1 4区 计算机科学 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-11-30 DOI: 10.55730/1300-0632.4047
Burak Baysal, Mehmet Önder Efe
{"title":"A comparative study of blind source separation methods","authors":"Burak Baysal, Mehmet Önder Efe","doi":"10.55730/1300-0632.4047","DOIUrl":"https://doi.org/10.55730/1300-0632.4047","url":null,"abstract":"","PeriodicalId":49410,"journal":{"name":"Turkish Journal of Electrical Engineering and Computer Sciences","volume":"30 6","pages":""},"PeriodicalIF":1.1,"publicationDate":"2023-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139205358","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Feature distillation from vision-language model for semisupervised action classification 基于半监督动作分类的视觉语言模型特征提取
4区 计算机科学 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-10-07 DOI: 10.55730/1300-0632.4038
ASLI ÇELİK, AYHAN KÜÇÜKMANİSA, OĞUZHAN URHAN
: The training of supervised machine learning approaches is critically dependent on annotating large-scale datasets. Semisupervised learning approaches aim to achieve compatible performance with supervised methods using relatively less annotation without sacrificing good generalization capacity. In line with this objective, ways of leveraging unlabeled data have been the subject of intense research. However, semisupervised video action recognition has received relatively less attention compared to image domain implementations. Existing semisupervised video action recognition methods trained from scratch rely heavily on augmentation techniques, complex architectures, and/or the use of other modalities while distillation-based methods use models that have only been trained for 2D computer vision tasks. In another line of work, pretrained vision-language models have shown very promising results for generating general-purpose visual features with reports of high zero-shot performance for many downstream tasks. In this work, we exploit a language-supervised visual encoder for learning video representations for video action classification tasks. We propose a teacher-student learning paradigm through feature distillation and pseudo-labeling. Our experimental results are a proof-of-concept revealing that multimodal feature extractors can be utilized for spatiotemporal feature extraction in a semisupervised learning context and show compatible performance with SOTA methods, especially in a low-label regime.
{"title":"Feature distillation from vision-language model for semisupervised action classification","authors":"ASLI ÇELİK, AYHAN KÜÇÜKMANİSA, OĞUZHAN URHAN","doi":"10.55730/1300-0632.4038","DOIUrl":"https://doi.org/10.55730/1300-0632.4038","url":null,"abstract":": The training of supervised machine learning approaches is critically dependent on annotating large-scale datasets. Semisupervised learning approaches aim to achieve compatible performance with supervised methods using relatively less annotation without sacrificing good generalization capacity. In line with this objective, ways of leveraging unlabeled data have been the subject of intense research. However, semisupervised video action recognition has received relatively less attention compared to image domain implementations. Existing semisupervised video action recognition methods trained from scratch rely heavily on augmentation techniques, complex architectures, and/or the use of other modalities while distillation-based methods use models that have only been trained for 2D computer vision tasks. In another line of work, pretrained vision-language models have shown very promising results for generating general-purpose visual features with reports of high zero-shot performance for many downstream tasks. In this work, we exploit a language-supervised visual encoder for learning video representations for video action classification tasks. We propose a teacher-student learning paradigm through feature distillation and pseudo-labeling. Our experimental results are a proof-of-concept revealing that multimodal feature extractors can be utilized for spatiotemporal feature extraction in a semisupervised learning context and show compatible performance with SOTA methods, especially in a low-label regime.","PeriodicalId":49410,"journal":{"name":"Turkish Journal of Electrical Engineering and Computer Sciences","volume":"43 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135302579","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A unique hybrid domain hand-crafted feature to classify colorectal tissue histopathological images using multiheaded CNN 一种独特的混合域手工特征,用于使用多头CNN对结直肠组织病理图像进行分类
4区 计算机科学 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-10-07 DOI: 10.55730/1300-0632.4030
ANURODH KUMAR, AMIT VISHWAKARMA, VARUN BAJAJ
{"title":"A unique hybrid domain hand-crafted feature to classify colorectal tissue histopathological images using multiheaded CNN","authors":"ANURODH KUMAR, AMIT VISHWAKARMA, VARUN BAJAJ","doi":"10.55730/1300-0632.4030","DOIUrl":"https://doi.org/10.55730/1300-0632.4030","url":null,"abstract":"","PeriodicalId":49410,"journal":{"name":"Turkish Journal of Electrical Engineering and Computer Sciences","volume":"56 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135302569","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
TRCaptionNet: A novel and accurate deep Turkish image captioning model with vision transformer based image encoders and deep linguistic text decoders TRCaptionNet:一种新颖而准确的深度土耳其语图像字幕模型,该模型使用基于视觉转换器的图像编码器和深度语言文本解码器
4区 计算机科学 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-10-07 DOI: 10.55730/1300-0632.4035
SERDAR YILDIZ, ABBAS MEMİŞ, SONGÜL VARLI
{"title":"TRCaptionNet: A novel and accurate deep Turkish image captioning model with vision transformer based image encoders and deep linguistic text decoders","authors":"SERDAR YILDIZ, ABBAS MEMİŞ, SONGÜL VARLI","doi":"10.55730/1300-0632.4035","DOIUrl":"https://doi.org/10.55730/1300-0632.4035","url":null,"abstract":"","PeriodicalId":49410,"journal":{"name":"Turkish Journal of Electrical Engineering and Computer Sciences","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135302578","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Infrared imaging segmentation employing an explainable deep neural network 红外成像分割采用可解释的深度神经网络
4区 计算机科学 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-10-07 DOI: 10.55730/1300-0632.4032
XINFEI LIAO, DAN WANG, ZAIRAN LI, NILANJAN DEY, RS SIMON, FUQIAN SHI
Explainable AI (XAI) improved by a deep neural network (DNN) of a residual neural network (ResNet) and long short-term memory networks (LSTMs), termed XAIRL, is proposed for segmenting foot infrared imaging datasets. First, an infrared sensor imaging dataset is acquired by a foot infrared sensor imaging device and preprocessed. The infrared sensor image features are then defined and extracted with XAIRL being applied to segment the dataset. This paper compares and discusses our results with XAIRL. Evaluation indices are applied to perform various measurements for foot infrared image segmentation including accuracy, precision, recall, F1 score, intersection over union (IoU), Dice similarity coefficient, mean intersection of union, boundary displacement error (BDE), Hausdorff distance, and receiver operating characteristic (ROC). Compared to results from the literature, XAIRL shows the highest overall performance, achieving accuracy of 0.93, precision of 0.91, recall of 0.95, and F1 score of 0.93. XAIRL also displays the highest IoU, Dice similarity coefficient, and ROC curve and the lowest BDE and Hausdorff distance. Although U-Net performs well for most metrics, Mask R-CNN shows slightly worse performance but still outperforms the random forest and support vector machine algorithms. By building a high-quality foot infrared imaging dataset, machine learning-based algorithms can accurately analyze foot temperature and pressure distribution. These models can then be used to customize shoes for individual wearers, improving their comfort and reducing the risk of foot injuries, particularly for those with high blood pressure.
提出了一种基于残差神经网络(ResNet)和长短期记忆网络(LSTMs)的深度神经网络(DNN)改进的可解释人工智能(XAI),称为XAIRL,用于足部红外成像数据集的分割。首先,利用足部红外传感器成像装置获取红外传感器成像数据集并进行预处理;然后定义和提取红外传感器图像特征,并应用XAIRL对数据集进行分割。本文将我们的结果与XAIRL进行了比较和讨论。采用评价指标对足部红外图像分割进行了准确度、精密度、召回率、F1评分、交汇交汇(IoU)、Dice相似系数、交汇交汇平均、边界位移误差(BDE)、豪斯多夫距离和接收机工作特征(ROC)等评价。与文献结果相比,XAIRL的综合性能最高,准确率为0.93,精密度为0.91,召回率为0.95,F1得分为0.93。XAIRL的IoU、Dice相似系数和ROC曲线最高,BDE和Hausdorff距离最低。虽然U-Net在大多数指标上表现良好,但Mask R-CNN的表现略差,但仍然优于随机森林和支持向量机算法。通过建立高质量的足部红外成像数据集,基于机器学习的算法可以准确分析足部温度和压力分布。然后,这些模型可以用于为个人穿着者定制鞋子,提高舒适度,降低足部受伤的风险,特别是对那些高血压患者。
{"title":"Infrared imaging segmentation employing an explainable deep neural network","authors":"XINFEI LIAO, DAN WANG, ZAIRAN LI, NILANJAN DEY, RS SIMON, FUQIAN SHI","doi":"10.55730/1300-0632.4032","DOIUrl":"https://doi.org/10.55730/1300-0632.4032","url":null,"abstract":"Explainable AI (XAI) improved by a deep neural network (DNN) of a residual neural network (ResNet) and long short-term memory networks (LSTMs), termed XAIRL, is proposed for segmenting foot infrared imaging datasets. First, an infrared sensor imaging dataset is acquired by a foot infrared sensor imaging device and preprocessed. The infrared sensor image features are then defined and extracted with XAIRL being applied to segment the dataset. This paper compares and discusses our results with XAIRL. Evaluation indices are applied to perform various measurements for foot infrared image segmentation including accuracy, precision, recall, F1 score, intersection over union (IoU), Dice similarity coefficient, mean intersection of union, boundary displacement error (BDE), Hausdorff distance, and receiver operating characteristic (ROC). Compared to results from the literature, XAIRL shows the highest overall performance, achieving accuracy of 0.93, precision of 0.91, recall of 0.95, and F1 score of 0.93. XAIRL also displays the highest IoU, Dice similarity coefficient, and ROC curve and the lowest BDE and Hausdorff distance. Although U-Net performs well for most metrics, Mask R-CNN shows slightly worse performance but still outperforms the random forest and support vector machine algorithms. By building a high-quality foot infrared imaging dataset, machine learning-based algorithms can accurately analyze foot temperature and pressure distribution. These models can then be used to customize shoes for individual wearers, improving their comfort and reducing the risk of foot injuries, particularly for those with high blood pressure.","PeriodicalId":49410,"journal":{"name":"Turkish Journal of Electrical Engineering and Computer Sciences","volume":"101 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135302711","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Cognitive digital modelling for hyperspectral image classification using transfer learning model 基于迁移学习模型的高光谱图像分类认知数字建模
4区 计算机科学 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-10-07 DOI: 10.55730/1300-0632.4033
MOHAMMAD SHABAZ, MUKESH SONI
: Deep convolutional neural networks can fully use the intrinsic relationship between features and improve the separability of hyperspectral images, which has received extensive in recent years. However, the need for a large number of labelled samples to train deep network models limits the application of such methods. The idea of transfer learning is introduced into remote sensing image classification to reduce the need for the number of labelled samples. In particular, the situation in which each class in the target picture only has one labelled sample is investigated. In the target domain, the number of training samples is enlarged by the homogenous region obtained by segmenting the target image. On this basis, the deep Siamese convolutional neural network is used to reduce the distribution difference between the source domain image and the target domain image to achieve the final result of the target hyperspectral image classification. The experimental results show that the combination of homogenous region and Siamese convolutional network can improve the classification effect of semisupervised transfer learning and better solve cross-regional hyperspectral image classification.
{"title":"Cognitive digital modelling for hyperspectral image classification using transfer learning model","authors":"MOHAMMAD SHABAZ, MUKESH SONI","doi":"10.55730/1300-0632.4033","DOIUrl":"https://doi.org/10.55730/1300-0632.4033","url":null,"abstract":": Deep convolutional neural networks can fully use the intrinsic relationship between features and improve the separability of hyperspectral images, which has received extensive in recent years. However, the need for a large number of labelled samples to train deep network models limits the application of such methods. The idea of transfer learning is introduced into remote sensing image classification to reduce the need for the number of labelled samples. In particular, the situation in which each class in the target picture only has one labelled sample is investigated. In the target domain, the number of training samples is enlarged by the homogenous region obtained by segmenting the target image. On this basis, the deep Siamese convolutional neural network is used to reduce the distribution difference between the source domain image and the target domain image to achieve the final result of the target hyperspectral image classification. The experimental results show that the combination of homogenous region and Siamese convolutional network can improve the classification effect of semisupervised transfer learning and better solve cross-regional hyperspectral image classification.","PeriodicalId":49410,"journal":{"name":"Turkish Journal of Electrical Engineering and Computer Sciences","volume":"69 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135302570","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multi-view brain tumor segmentation (MVBTS): An ensemble of planar and triplanar attention UNets 多视图脑肿瘤分割(MVBTS):一种平面和三平面注意力单元的集合
4区 计算机科学 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-10-07 DOI: 10.55730/1300-0632.4026
SNEHAL RAJPUT, RUPAL KAPDI, MEHUL RAVAL, MOHENDRA ROY
{"title":"Multi-view brain tumor segmentation (MVBTS): An ensemble of planar and triplanar attention UNets","authors":"SNEHAL RAJPUT, RUPAL KAPDI, MEHUL RAVAL, MOHENDRA ROY","doi":"10.55730/1300-0632.4026","DOIUrl":"https://doi.org/10.55730/1300-0632.4026","url":null,"abstract":"","PeriodicalId":49410,"journal":{"name":"Turkish Journal of Electrical Engineering and Computer Sciences","volume":"298 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135302574","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Hybrid machine learning model to predict chronic kidney diseases using handcrafted features for early health rehabilitation 混合机器学习模型预测慢性肾脏疾病,使用手工制作的早期健康康复特征
4区 计算机科学 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-10-07 DOI: 10.55730/1300-0632.4028
AMJAD REHMAN, TANZILA SABA, HAIDER ALI, NARMINE ELHAKIM, NOOR AYESHA
Chronic kidney diseases proliferate due to hypertension, diabetes, anemia, obesity, smoking etc. Patients with such conditions are sometimes unaware of first symptoms, complicating disease diagnosis. This paper presents chronic kidney disease (CKD) prediction model to classify CKD patients from NCKD (Non-CKD). The proposed study has two main stages. First, we found the odds ratio through logistic regression and comparison test to identify early risk factors from kidneys? MRI and differentiate CKD from NCKD subjects. In stage 2, LR, LDA, MLP classifiers were applied to predict CKD and NCKD by extracting features from MRI. The odds ratio of blood glucose random and serum creatinine was found higher, and levels of sodium, Potassium, packed cell volume, white blood cell count, and red blood cell count were found lesser in CKD. The comparison results show increase levels in blood glucose random, serum creatinine and decreased levels found in sodium, potassium, packed cell volume, White blood cell and red blood cell count respectively in CKD patients than NCKD subjects. The accuracies of LR were 98.5% and 97.5% for train & test datasets. While LDA accuracy was 96.07% and 96.6% for train and test datasets. Likewise, MLP attained were 95% and 94.1% accuracy for train and test datasets. Finally, we used 5-fold CV approach on the LR model. The mean accuracies of LR were 0.954 and 0.942 for training and testing data respectively. According to LR the serum creatinine, Albumin, Diabetes mellitus, red blood cells count, pus cell and hypertension were found to be the most significant features to discriminate the CKD patients from NCKD. The proposed strategy is best suited for practical implementation for reducing the disease's prevalence.
慢性肾脏疾病因高血压、糖尿病、贫血、肥胖、吸烟等而增加。患有这种疾病的患者有时不知道最初的症状,使疾病诊断复杂化。本文提出慢性肾脏疾病(CKD)预测模型,将CKD患者与NCKD(非CKD)进行分类。拟议中的研究分为两个主要阶段。首先,我们通过logistic回归和比较检验发现比值比可以识别肾脏早期危险因素。MRI和区分CKD和NCKD受试者。在第2阶段,LR、LDA、MLP分类器通过提取MRI特征来预测CKD和NCKD。随机血糖和血清肌酐的比值比较高,而钠、钾水平、堆积细胞体积、白细胞计数和红细胞计数在CKD中较低。对比结果显示,CKD患者血糖随机升高,血清肌酐水平升高,钠、钾、堆积细胞体积、白细胞和红细胞计数下降。对于训练和测试数据集,LR的准确率分别为98.5%和97.5%。训练集和测试集的LDA准确率分别为96.07%和96.6%。同样,MLP在训练和测试数据集上的准确率分别为95%和94.1%。最后,我们对LR模型使用了5倍CV方法。训练和测试数据的LR平均准确率分别为0.954和0.942。根据LR,血清肌酐、白蛋白、糖尿病、红细胞计数、脓细胞和高血压是区分CKD和NCKD的最重要特征。拟议的战略最适合于实际实施,以减少该疾病的流行。
{"title":"Hybrid machine learning model to predict chronic kidney diseases using handcrafted features for early health rehabilitation","authors":"AMJAD REHMAN, TANZILA SABA, HAIDER ALI, NARMINE ELHAKIM, NOOR AYESHA","doi":"10.55730/1300-0632.4028","DOIUrl":"https://doi.org/10.55730/1300-0632.4028","url":null,"abstract":"Chronic kidney diseases proliferate due to hypertension, diabetes, anemia, obesity, smoking etc. Patients with such conditions are sometimes unaware of first symptoms, complicating disease diagnosis. This paper presents chronic kidney disease (CKD) prediction model to classify CKD patients from NCKD (Non-CKD). The proposed study has two main stages. First, we found the odds ratio through logistic regression and comparison test to identify early risk factors from kidneys? MRI and differentiate CKD from NCKD subjects. In stage 2, LR, LDA, MLP classifiers were applied to predict CKD and NCKD by extracting features from MRI. The odds ratio of blood glucose random and serum creatinine was found higher, and levels of sodium, Potassium, packed cell volume, white blood cell count, and red blood cell count were found lesser in CKD. The comparison results show increase levels in blood glucose random, serum creatinine and decreased levels found in sodium, potassium, packed cell volume, White blood cell and red blood cell count respectively in CKD patients than NCKD subjects. The accuracies of LR were 98.5% and 97.5% for train & test datasets. While LDA accuracy was 96.07% and 96.6% for train and test datasets. Likewise, MLP attained were 95% and 94.1% accuracy for train and test datasets. Finally, we used 5-fold CV approach on the LR model. The mean accuracies of LR were 0.954 and 0.942 for training and testing data respectively. According to LR the serum creatinine, Albumin, Diabetes mellitus, red blood cells count, pus cell and hypertension were found to be the most significant features to discriminate the CKD patients from NCKD. The proposed strategy is best suited for practical implementation for reducing the disease's prevalence.","PeriodicalId":49410,"journal":{"name":"Turkish Journal of Electrical Engineering and Computer Sciences","volume":"69 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135302713","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhancing exploration-exploitation in harmony search for airborne hyperspectral imaging band selection (E3HS) 加强机载高光谱成像波段选择(E3HS)和声搜索中的勘探开发
4区 计算机科学 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-10-07 DOI: 10.55730/1300-0632.4029
MOHAMMED ABDULMAJEED MOHARRAM, DIVYA MEENA SUNDARAM
: Hyperspectral imaging has emerged as a prominent area of research in the field of remote sensing science. However, hyperspectral images (HSIs) pose a notable challenge due to the presence of numerous irrelevant and redundant spectral bands exhibiting high correlation. Therefore, it is necessary to enhance the classification performance for HSI processing by selecting the most relevant discriminative spectral bands. To this end, this paper introduces a metaheuristic search method called enhancing exploration-exploitation in harmony search (E3HS). The standard harmony search suffers from many weaknesses, such as premature convergence and falling easily into the local optimum. Consequently, E3HS was proposed to evade falling into the local optimum by creating a balance between exploration and exploitation strategies to accelerate convergence toward the global optimum solution. Finally, two machine learning classifiers (k-nearest neighbor and support vector machine) were employed for hyperspectral image classification at the pixel level. Moreover, the proposed method was compared with the bat algorithm, Archimedes optimization algorithm, particle swarm optimization, standard harmony search, genetic algorithm, and krill herd algorithm. The experimental results demonstrated significant improvement with overall accuracy equal to 87.49%, 94.85%, and 94.41% for the Indian Pines, Pavia University, and Salinas datasets, respectively.
{"title":"Enhancing exploration-exploitation in harmony search for airborne hyperspectral imaging band selection (E3HS)","authors":"MOHAMMED ABDULMAJEED MOHARRAM, DIVYA MEENA SUNDARAM","doi":"10.55730/1300-0632.4029","DOIUrl":"https://doi.org/10.55730/1300-0632.4029","url":null,"abstract":": Hyperspectral imaging has emerged as a prominent area of research in the field of remote sensing science. However, hyperspectral images (HSIs) pose a notable challenge due to the presence of numerous irrelevant and redundant spectral bands exhibiting high correlation. Therefore, it is necessary to enhance the classification performance for HSI processing by selecting the most relevant discriminative spectral bands. To this end, this paper introduces a metaheuristic search method called enhancing exploration-exploitation in harmony search (E3HS). The standard harmony search suffers from many weaknesses, such as premature convergence and falling easily into the local optimum. Consequently, E3HS was proposed to evade falling into the local optimum by creating a balance between exploration and exploitation strategies to accelerate convergence toward the global optimum solution. Finally, two machine learning classifiers (k-nearest neighbor and support vector machine) were employed for hyperspectral image classification at the pixel level. Moreover, the proposed method was compared with the bat algorithm, Archimedes optimization algorithm, particle swarm optimization, standard harmony search, genetic algorithm, and krill herd algorithm. The experimental results demonstrated significant improvement with overall accuracy equal to 87.49%, 94.85%, and 94.41% for the Indian Pines, Pavia University, and Salinas datasets, respectively.","PeriodicalId":49410,"journal":{"name":"Turkish Journal of Electrical Engineering and Computer Sciences","volume":"45 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135302572","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
YOLO and LSH-based video stream analytics landscape for short-term traffic density surveillance at road networks 基于YOLO和lsh的视频流分析景观,用于道路网络的短期交通密度监控
4区 计算机科学 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-10-07 DOI: 10.55730/1300-0632.4036
LAVANYA K, STUTI TIWARI, RAHUL ANAND, JUDE HEMANTH
The duty of monitoring traffic during rush hour is difficult due to the fact that modern roadways are getting more crowded every day. The automated solutions that have already been created in this area are ineffective at processing enormous amounts of data in a short amount of time, leading to ineffectiveness and inconsistent results. The YOLO (you only look once) and LSH (locality sensitive hashing) algorithms are combined with the Kafka architecture in this study to create a method for assessing traffic density in real-time scenarios. Our concept, which is specifically designed for vehicular networks, predicts the traffic density in a given location by gathering live stream data from traffic surveillance cameras and transforming it into frames (at a rate of 11 per minute) using the YOLOv3 algorithm, which is a crucial parameter for performing effective traffic diversion by suggesting alternate routes and avoiding traffic congestion. The predicted density is then projected onto Google Maps for the convenience of local clients. The comparative study?s results demonstrate that our strategy consistently and accurately predicts vehicular density, with an accuracy of more than 90 percent under all conditions. It also shows a significant improvement in both precision and recall, with a 4.08 percent improvement.
由于现代道路每天都变得越来越拥挤,在高峰时段监控交通的任务很困难。在此领域中已经创建的自动化解决方案在短时间内处理大量数据时效率低下,导致效率低下和结果不一致。在本研究中,YOLO(你只看一次)和LSH(位置敏感散列)算法与Kafka架构相结合,创建了一种在实时场景中评估流量密度的方法。我们的概念是专门为车辆网络设计的,通过收集来自交通监控摄像头的实时流数据并使用YOLOv3算法将其转换为帧(每分钟11帧)来预测给定位置的交通密度,这是通过建议替代路线和避免交通拥堵来执行有效交通分流的关键参数。然后,预测的密度被投影到谷歌地图上,以方便当地客户。比较研究?S的结果表明,我们的策略始终如一地准确预测车辆密度,在所有条件下的准确率都超过90%。它在准确率和召回率方面也有显著提高,提高了4.08%。
{"title":"YOLO and LSH-based video stream analytics landscape for short-term traffic density surveillance at road networks","authors":"LAVANYA K, STUTI TIWARI, RAHUL ANAND, JUDE HEMANTH","doi":"10.55730/1300-0632.4036","DOIUrl":"https://doi.org/10.55730/1300-0632.4036","url":null,"abstract":"The duty of monitoring traffic during rush hour is difficult due to the fact that modern roadways are getting more crowded every day. The automated solutions that have already been created in this area are ineffective at processing enormous amounts of data in a short amount of time, leading to ineffectiveness and inconsistent results. The YOLO (you only look once) and LSH (locality sensitive hashing) algorithms are combined with the Kafka architecture in this study to create a method for assessing traffic density in real-time scenarios. Our concept, which is specifically designed for vehicular networks, predicts the traffic density in a given location by gathering live stream data from traffic surveillance cameras and transforming it into frames (at a rate of 11 per minute) using the YOLOv3 algorithm, which is a crucial parameter for performing effective traffic diversion by suggesting alternate routes and avoiding traffic congestion. The predicted density is then projected onto Google Maps for the convenience of local clients. The comparative study?s results demonstrate that our strategy consistently and accurately predicts vehicular density, with an accuracy of more than 90 percent under all conditions. It also shows a significant improvement in both precision and recall, with a 4.08 percent improvement.","PeriodicalId":49410,"journal":{"name":"Turkish Journal of Electrical Engineering and Computer Sciences","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135302714","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Turkish Journal of Electrical Engineering and Computer Sciences
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1