Turkish Journal of Electrical Engineering and Computer Sciences最新文献

英文中文

Focal modulation network for lung segmentation in chest X-ray images 胸部x线图像中肺分割的焦点调制网络

4区计算机科学 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Turkish Journal of Electrical Engineering and Computer Sciences

Pub Date : 2023-10-07 DOI: 10.55730/1300-0632.4031

ŞABAN ÖZTÜRK, TOLGA ÇUKUR

: Segmentation of lung regions is of key importance for the automatic analysis of Chest X-Ray (CXR) images, which have a vital role in the detection of various pulmonary diseases. Precise identification of lung regions is the basic prerequisite for disease diagnosis and treatment planning. However, achieving precise lung segmentation poses significant challenges due to factors such as variations in anatomical shape and size, the presence of strong edges at the rib cage and clavicle, and overlapping anatomical structures resulting from diverse diseases. Although commonly considered as the de-facto standard in medical image segmentation, the convolutional UNet architecture and its variants fall short in addressing these challenges, primarily due to the limited ability to model long-range dependencies between image features. While vision transformers equipped with self-attention mechanisms excel at capturing long-range relationships, either a coarse-grained global self-attention or a fine-grained local self-attention is typically adopted for segmentation tasks on high-resolution images to alleviate quadratic computational cost at the expense of performance loss. This paper introduces a focal modulation UNet model (FMN-UNet) to enhance segmentation performance by effectively aggregating fine-grained local and coarse-grained global relations at a reasonable computational cost. FMN-UNet first encodes CXR images via a convolutional encoder to suppress background regions and extract latent feature maps at a relatively modest resolution. FMN-UNet then leverages global and local attention mechanisms to model contextual relationships across the images. These contextual feature maps are convolutionally decoded to produce segmentation masks. The segmentation performance of FMN-UNet is compared against state-of-the-art methods on three public CXR datasets (JSRT, Montgomery, and Shenzhen). Experiments in each dataset demonstrate the superior performance of FMN-UNet against baselines.

{"title":"Focal modulation network for lung segmentation in chest X-ray images","authors":"ŞABAN ÖZTÜRK, TOLGA ÇUKUR","doi":"10.55730/1300-0632.4031","DOIUrl":"https://doi.org/10.55730/1300-0632.4031","url":null,"abstract":": Segmentation of lung regions is of key importance for the automatic analysis of Chest X-Ray (CXR) images, which have a vital role in the detection of various pulmonary diseases. Precise identification of lung regions is the basic prerequisite for disease diagnosis and treatment planning. However, achieving precise lung segmentation poses significant challenges due to factors such as variations in anatomical shape and size, the presence of strong edges at the rib cage and clavicle, and overlapping anatomical structures resulting from diverse diseases. Although commonly considered as the de-facto standard in medical image segmentation, the convolutional UNet architecture and its variants fall short in addressing these challenges, primarily due to the limited ability to model long-range dependencies between image features. While vision transformers equipped with self-attention mechanisms excel at capturing long-range relationships, either a coarse-grained global self-attention or a fine-grained local self-attention is typically adopted for segmentation tasks on high-resolution images to alleviate quadratic computational cost at the expense of performance loss. This paper introduces a focal modulation UNet model (FMN-UNet) to enhance segmentation performance by effectively aggregating fine-grained local and coarse-grained global relations at a reasonable computational cost. FMN-UNet first encodes CXR images via a convolutional encoder to suppress background regions and extract latent feature maps at a relatively modest resolution. FMN-UNet then leverages global and local attention mechanisms to model contextual relationships across the images. These contextual feature maps are convolutionally decoded to produce segmentation masks. The segmentation performance of FMN-UNet is compared against state-of-the-art methods on three public CXR datasets (JSRT, Montgomery, and Shenzhen). Experiments in each dataset demonstrate the superior performance of FMN-UNet against baselines.","PeriodicalId":49410,"journal":{"name":"Turkish Journal of Electrical Engineering and Computer Sciences","volume":"206 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135302571","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

CCCD: Corner detection and curve reconstruction for improved 3D surface reconstruction from 2D medical images CCCD:角点检测和曲线重建，用于改进二维医学图像的三维表面重建

4区计算机科学 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Turkish Journal of Electrical Engineering and Computer Sciences

Pub Date : 2023-10-07 DOI: 10.55730/1300-0632.4027

MRIGANKA SARMAH, ARAMBAM NEELIMA

: The conventional approach to creating 3D surfaces from 2D medical images is the marching cube algorithm, but it often results in rough surfaces. On the other hand, B-spline curves and nonuniform rational B-splines (NURBSs) offer a smoother alternative for 3D surface reconstruction. However, NURBSs use control points (CTPs) to define the object shape and corners play an important role in defining the boundary shape as well. Thus, in order to fill the research gap in applying corner detection (CD) methods to generate the most favorable CTPs, in this paper corner points are identified to predict organ shape. However, CTPs must be in ordered coordinate pairs. This ordering problem is resolved using curve reconstruction (CR) or chain code (CC) algorithms. Existing CR methods lead to issues like holes, while some chain codes have junction-induced errors that need preprocessing. To address the above issues, a new graph neural network (GNN)-based approach named curvature and chain code-based corner detection (CCCD) is introduced that not only orders the CTPs but also removes junction errors. The goal is to improve accuracy and reliability in generating smooth surfaces. The paper fuses well-known CD methods with a curve generation technique and compares these alternative fused methods with CCCD. CCCD is also compared against other curve reconstruction techniques to establish its superiority. For validation, CCCD’s accuracy in predicting boundaries is compared with deep learning models like Polar U-Net, KiU-Net 3D, and HdenseUnet, achieving an impressive Dice score of 98.49%, even with only 39.13% boundary

{"title":"CCCD: Corner detection and curve reconstruction for improved 3D surface reconstruction from 2D medical images","authors":"MRIGANKA SARMAH, ARAMBAM NEELIMA","doi":"10.55730/1300-0632.4027","DOIUrl":"https://doi.org/10.55730/1300-0632.4027","url":null,"abstract":": The conventional approach to creating 3D surfaces from 2D medical images is the marching cube algorithm, but it often results in rough surfaces. On the other hand, B-spline curves and nonuniform rational B-splines (NURBSs) offer a smoother alternative for 3D surface reconstruction. However, NURBSs use control points (CTPs) to define the object shape and corners play an important role in defining the boundary shape as well. Thus, in order to fill the research gap in applying corner detection (CD) methods to generate the most favorable CTPs, in this paper corner points are identified to predict organ shape. However, CTPs must be in ordered coordinate pairs. This ordering problem is resolved using curve reconstruction (CR) or chain code (CC) algorithms. Existing CR methods lead to issues like holes, while some chain codes have junction-induced errors that need preprocessing. To address the above issues, a new graph neural network (GNN)-based approach named curvature and chain code-based corner detection (CCCD) is introduced that not only orders the CTPs but also removes junction errors. The goal is to improve accuracy and reliability in generating smooth surfaces. The paper fuses well-known CD methods with a curve generation technique and compares these alternative fused methods with CCCD. CCCD is also compared against other curve reconstruction techniques to establish its superiority. For validation, CCCD’s accuracy in predicting boundaries is compared with deep learning models like Polar U-Net, KiU-Net 3D, and HdenseUnet, achieving an impressive Dice score of 98.49%, even with only 39.13% boundary","PeriodicalId":49410,"journal":{"name":"Turkish Journal of Electrical Engineering and Computer Sciences","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135302576","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Deep feature extraction, dimensionality reduction, and classification of medical images using combined deep learning architectures, autoencoder, and multiple machine learning models 结合深度学习架构、自动编码器和多种机器学习模型进行深度特征提取、降维和医学图像分类

4区计算机科学 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Turkish Journal of Electrical Engineering and Computer Sciences

Pub Date : 2023-10-07 DOI: 10.55730/1300-0632.4037

AHMET HİDAYET KİRAZ, FATIME OUMAR DJIBRILLAH, MEHMET EMİN YÜKSEL

引用次数: 0

Classification of chronic pain using fMRI data: Unveiling brain activity patterns for diagnosis 使用功能磁共振成像数据分类慢性疼痛:揭示脑活动模式诊断

4区计算机科学 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Turkish Journal of Electrical Engineering and Computer Sciences

Pub Date : 2023-10-07 DOI: 10.55730/1300-0632.4034

REJULA V, ANITHA J, BELFIN ROBINSON

Millions of people throughout the world suffer from the complicated and crippling condition of chronic pain. It can be brought on by several underlying disorders or injuries and is defined by chronic pain that lasts for a period exceeding three months. To better understand the brain processes behind pain and create prediction models for pain-related outcomes, machine learning is a potent technology that may be applied in Functional magnetic resonance imaging (fMRI) chronic pain research. Data (fMRI and T1-weighted images) from 76 participants has been included (30 chronic pain and 46 healthy controls). The raw data were preprocessed using fMRIprep and then parcellated using five various atlases such as MSDL, Yeo?17, Harvard, Schaefer, and Pauli. Then the functional connectivity between the parcellated Region of Interests (ROIs) has been taken as features for the machine learning classifier models using the Blood Oxygenation Level Dependent (BOLD) signals. To distinguish between those with chronic pain and healthy controls, this study used Support Vector Machines (SVM), Boosting, Bagging, convolutional neural network (CNN), XGboost, and Stochastic Gradient Descent (SDG) classifiers. The classification models use stratified shuffle split sampling to fragment the training and testing dataset during various iterations. Hyperparameter tuning was used to get the best classifier model across several combinations of parameters. The best parameters for the classifier were measured by the accuracy, sensitivity, and specificity of the model. Finally, to identify the top ROIs involved in chronic pain was unveiled by the probability-based feature importance method. The result shows that Pauli (subcortical atlas) and MSDL (cortical atlas) worked well for this chronic pain fMRI data. Boosting algorithm classified chronic pain and healthy controls with 94.35% accuracy on the data parcellated with the Pauli atlas. The top four regions contributing to this classifier model were the extended Amygdala, the Subthalamic nucleus, the Hypothalamus, and the Caudate Nucleus. Also, the fMRI data parcellated using a cortical MSDL atlas was classified using the XGboost model with an accuracy of 87.5%. Left Frontal Pole, Medial Default mode Network, right pars opercularis, dorsal anterior cingulate cortex (dACC), and Front Default mode network are the top five regions that contributed to classify the participants. These findings demonstrate that patterns of brain activity in areas associated with pain processing can be used to categorize individuals as chronic pain patients or healthy controls reliably. These discoveries may help with the identification and management of chronic pain and may pave the way for the creation of more potent tailored medicines for those who suffer from it.

全世界有数百万人患有慢性疼痛这种复杂的致残病症。它可以由几种潜在的疾病或损伤引起，并被定义为持续超过三个月的慢性疼痛。为了更好地理解疼痛背后的大脑过程，并为疼痛相关结果创建预测模型，机器学习是一项强有力的技术，可以应用于功能性磁共振成像(fMRI)慢性疼痛研究。数据(功能磁共振成像和t1加权图像)来自76名参与者(30名慢性疼痛和46名健康对照)。原始数据使用fmri预处理，然后使用五种不同的地图集(如MSDL, Yeo?哈佛，谢弗和泡利。然后，使用血氧水平依赖(BOLD)信号将分割的兴趣区域(roi)之间的功能连通性作为机器学习分类器模型的特征。为了区分慢性疼痛患者和健康对照组，本研究使用了支持向量机(SVM)、Boosting、Bagging、卷积神经网络(CNN)、XGboost和随机梯度下降(SDG)分类器。该分类模型在不同的迭代过程中使用分层洗牌分割采样来分割训练和测试数据集。采用超参数调优方法在多个参数组合中获得最佳分类器模型。通过模型的准确性、灵敏度和特异性来衡量分类器的最佳参数。最后，采用基于概率的特征重要度方法对慢性疼痛的roi进行识别。结果表明Pauli(皮质下图谱)和MSDL(皮质图谱)对慢性疼痛的fMRI数据有很好的效果。boost算法在Pauli图谱分割的数据上对慢性疼痛和健康对照进行分类，准确率为94.35%。这一分类器模型中最重要的四个区域是扩展的杏仁核、丘脑下核、下丘脑和尾状核。此外，使用皮质MSDL图谱进行分割的fMRI数据使用XGboost模型进行分类，准确率为87.5%。左侧额极、内侧默认网络、右侧包膜部、前扣带背皮层和前默认网络是对被试分类贡献最大的5个区域。这些发现表明，与疼痛处理相关区域的大脑活动模式可以可靠地用于将个体分类为慢性疼痛患者或健康对照者。这些发现可能有助于识别和管理慢性疼痛，并可能为那些患有慢性疼痛的人创造更有效的量身定制药物铺平道路。

{"title":"Classification of chronic pain using fMRI data: Unveiling brain activity patterns for diagnosis","authors":"REJULA V, ANITHA J, BELFIN ROBINSON","doi":"10.55730/1300-0632.4034","DOIUrl":"https://doi.org/10.55730/1300-0632.4034","url":null,"abstract":"Millions of people throughout the world suffer from the complicated and crippling condition of chronic pain. It can be brought on by several underlying disorders or injuries and is defined by chronic pain that lasts for a period exceeding three months. To better understand the brain processes behind pain and create prediction models for pain-related outcomes, machine learning is a potent technology that may be applied in Functional magnetic resonance imaging (fMRI) chronic pain research. Data (fMRI and T1-weighted images) from 76 participants has been included (30 chronic pain and 46 healthy controls). The raw data were preprocessed using fMRIprep and then parcellated using five various atlases such as MSDL, Yeo?17, Harvard, Schaefer, and Pauli. Then the functional connectivity between the parcellated Region of Interests (ROIs) has been taken as features for the machine learning classifier models using the Blood Oxygenation Level Dependent (BOLD) signals. To distinguish between those with chronic pain and healthy controls, this study used Support Vector Machines (SVM), Boosting, Bagging, convolutional neural network (CNN), XGboost, and Stochastic Gradient Descent (SDG) classifiers. The classification models use stratified shuffle split sampling to fragment the training and testing dataset during various iterations. Hyperparameter tuning was used to get the best classifier model across several combinations of parameters. The best parameters for the classifier were measured by the accuracy, sensitivity, and specificity of the model. Finally, to identify the top ROIs involved in chronic pain was unveiled by the probability-based feature importance method. The result shows that Pauli (subcortical atlas) and MSDL (cortical atlas) worked well for this chronic pain fMRI data. Boosting algorithm classified chronic pain and healthy controls with 94.35% accuracy on the data parcellated with the Pauli atlas. The top four regions contributing to this classifier model were the extended Amygdala, the Subthalamic nucleus, the Hypothalamus, and the Caudate Nucleus. Also, the fMRI data parcellated using a cortical MSDL atlas was classified using the XGboost model with an accuracy of 87.5%. Left Frontal Pole, Medial Default mode Network, right pars opercularis, dorsal anterior cingulate cortex (dACC), and Front Default mode network are the top five regions that contributed to classify the participants. These findings demonstrate that patterns of brain activity in areas associated with pain processing can be used to categorize individuals as chronic pain patients or healthy controls reliably. These discoveries may help with the identification and management of chronic pain and may pave the way for the creation of more potent tailored medicines for those who suffer from it.","PeriodicalId":49410,"journal":{"name":"Turkish Journal of Electrical Engineering and Computer Sciences","volume":"47 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135302712","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A machine learning approach for dyslexia detection using Turkish audio records 使用土耳其语音频记录检测阅读障碍的机器学习方法

4区计算机科学 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Turkish Journal of Electrical Engineering and Computer Sciences

Pub Date : 2023-09-29 DOI: 10.55730/1300-0632.4024

TUĞBERK TAŞ, MUHAMMED ABDULLAH BÜLBÜL, ABAS HAŞİMOĞLU, YAVUZ MERAL, YASİN ÇALIŞKAN, GUNAY BUDAGOVA, MÜCAHİD KUTLU

Dyslexia is a learning disorder, characterized by impairment in the ability to read, spell, and decode letters. It is vital to detect dyslexia in earlier stages to reduce its effects. However, diagnosing dyslexia is a time-consuming and costly process. In this paper, we propose a machine-learning model that predicts whether a Turkish-speaking child has dyslexia using his/her audio records. Therefore, our model can be easily used by smart phones and work as a warning system such that children who are likely to be dyslexic according to our model can seek an examination by experts. In order to train and evaluate, we first create a unique dataset that includes audio recordings of 12 dyslexic children and 13 nondyslexic children in an 8-month period. We explore various machine learning algorithms such as KNN and SVM and use the following features: Mel-frequency cepstral coefficients, reading rate, reading accuracy, the ratio of missing words, and confidence scores of the speech-to-text process. In our experiments, we show that children with dyslexia can be detected with 95.63% accuracy even though we use single-sentence long audio records. In addition, we show that the prediction performance of our model is similar to that of the humans?. In this paper, we provide a preliminary study showing that detecting children with dyslexia based on their audio records is possible. Once the mobile application version of our model is developed, parents can easily check whether their children are likely to be dyslexic or not, and seek professional help accordingly.

阅读障碍是一种学习障碍，其特征是阅读、拼写和解码字母的能力受损。在早期发现阅读障碍以减少其影响是至关重要的。然而，诊断阅读障碍是一个耗时且昂贵的过程。在本文中，我们提出了一个机器学习模型，该模型可以通过使用土耳其语儿童的音频记录来预测他/她是否患有阅读障碍。因此，我们的模型可以很容易地被智能手机使用，并作为一个警告系统，这样，根据我们的模型，可能患有阅读障碍的儿童可以寻求专家的检查。为了训练和评估，我们首先创建了一个独特的数据集，其中包括12名诵读困难儿童和13名非诵读困难儿童在8个月期间的录音。我们探索了各种机器学习算法，如KNN和SVM，并使用以下特征:mel频率倒谱系数、阅读速率、阅读精度、缺词率和语音到文本过程的置信度分数。在我们的实验中，我们发现即使我们使用单句长的音频记录，也可以以95.63%的准确率检测出患有阅读障碍的儿童。此外，我们还证明了该模型的预测性能与人类的预测性能相似。在本文中，我们提供了一项初步研究，表明根据他们的音频记录来检测儿童是否患有阅读障碍是可能的。一旦我们的模型的移动应用版本被开发出来，父母就可以很容易地检查他们的孩子是否有阅读困难的可能，并寻求相应的专业帮助。

{"title":"A machine learning approach for dyslexia detection using Turkish audio records","authors":"TUĞBERK TAŞ, MUHAMMED ABDULLAH BÜLBÜL, ABAS HAŞİMOĞLU, YAVUZ MERAL, YASİN ÇALIŞKAN, GUNAY BUDAGOVA, MÜCAHİD KUTLU","doi":"10.55730/1300-0632.4024","DOIUrl":"https://doi.org/10.55730/1300-0632.4024","url":null,"abstract":"Dyslexia is a learning disorder, characterized by impairment in the ability to read, spell, and decode letters. It is vital to detect dyslexia in earlier stages to reduce its effects. However, diagnosing dyslexia is a time-consuming and costly process. In this paper, we propose a machine-learning model that predicts whether a Turkish-speaking child has dyslexia using his/her audio records. Therefore, our model can be easily used by smart phones and work as a warning system such that children who are likely to be dyslexic according to our model can seek an examination by experts. In order to train and evaluate, we first create a unique dataset that includes audio recordings of 12 dyslexic children and 13 nondyslexic children in an 8-month period. We explore various machine learning algorithms such as KNN and SVM and use the following features: Mel-frequency cepstral coefficients, reading rate, reading accuracy, the ratio of missing words, and confidence scores of the speech-to-text process. In our experiments, we show that children with dyslexia can be detected with 95.63% accuracy even though we use single-sentence long audio records. In addition, we show that the prediction performance of our model is similar to that of the humans?. In this paper, we provide a preliminary study showing that detecting children with dyslexia based on their audio records is possible. Once the mobile application version of our model is developed, parents can easily check whether their children are likely to be dyslexic or not, and seek professional help accordingly.","PeriodicalId":49410,"journal":{"name":"Turkish Journal of Electrical Engineering and Computer Sciences","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135246527","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Stepwise dynamic nearest neighbor (SDNN): a new algorithm for classification 逐步动态最近邻(SDNN):一种新的分类算法

4区计算机科学 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Turkish Journal of Electrical Engineering and Computer Sciences

Pub Date : 2023-09-29 DOI: 10.55730/1300-0632.4016

DENİZ KARABAŞ, DERYA BİRANT, PELİN YILDIRIM TAŞER

Although the standard k-nearest neighbor (KNN) algorithm has been used widely for classification in many different fields, it suffers from various limitations that abate its classification ability, such as being influenced by the distribution of instances, ignoring distances between the test instance and its neighbors during classification, and building a single/weak learner. This paper proposes a novel algorithm, called stepwise dynamic nearest neighbor (SDNN), which can effectively handle these problems. Instead of using a fixed parameter k like KNN, it uses a dynamic neighborhood strategy according to the data distribution and implements a new voting mechanism, called stepwise voting. Experimental results were conducted on 50 benchmark datasets. The results showed that the proposed SDNN method outperformed the KNN method, KNN variants, and the state-of-the-art methods in terms of accuracy.

虽然标准k近邻(KNN)算法在许多不同领域的分类中得到了广泛的应用，但它存在各种限制，这些限制削弱了它的分类能力，例如受实例分布的影响，在分类过程中忽略测试实例与其邻居之间的距离，以及构建单个/弱学习器。本文提出了一种新的算法，称为逐步动态最近邻(SDNN)，可以有效地处理这些问题。它不像KNN那样使用固定的参数k，而是根据数据分布使用动态邻域策略，并实现了一种新的投票机制，称为逐步投票。实验结果在50个基准数据集上进行。结果表明，所提出的SDNN方法在准确率方面优于KNN方法、KNN变体和最先进的方法。

引用次数: 0

Dynamic deep neural network inference via adaptive channel skipping 动态深度神经网络推理自适应信道跳变

4区计算机科学 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Turkish Journal of Electrical Engineering and Computer Sciences

Pub Date : 2023-09-29 DOI: 10.55730/1300-0632.4020

MEIXIA ZOU, XIUWEN LI, JINZHENG FANG, HONG WEN, WEIWEI FANG

Deep neural networks have recently made remarkable achievements in computer vision applications. However, the high computational requirements needed to achieve accurate inference results can be a significant barrier to deploying DNNs on resource-constrained computing devices, such as those found in the Internet-of-things. In this work, we propose a fresh approach called adaptive channel skipping (ACS) that prioritizes the identification of the most suitable channels for skipping and implements an efficient skipping mechanism during inference. We begin with the development of a new gating network model, ACS-GN, which employs fine-grained channel-wise skipping to enable input-dependent inference and achieve a desirable balance between accuracy and resource consumption. To further enhance the efficiency of channel skipping, we propose a dynamic grouping convolutional computing approach, ACS-DG, which helps to reduce the computational cost of ACS-GN. The results of our experiment indicate that ACS-GN and ACS-DG exhibit superior performance compared to existing gating network designs and convolutional computing mechanisms, respectively. When they are combined, the ACS framework results in a significant reduction of computational expenses and a remarkable improvement in the accuracy of inferences

近年来，深度神经网络在计算机视觉应用方面取得了令人瞩目的成就。然而，实现准确推理结果所需的高计算需求可能是在资源受限的计算设备(例如物联网中的设备)上部署dnn的重大障碍。在这项工作中，我们提出了一种称为自适应信道跳变(ACS)的新方法，该方法优先识别最适合跳变的信道，并在推理期间实现有效的跳变机制。我们首先开发了一种新的门控网络模型ACS-GN，它采用细粒度的通道跳转来实现依赖输入的推理，并在准确性和资源消耗之间实现理想的平衡。为了进一步提高信道跳变的效率，我们提出了一种动态分组卷积计算方法ACS-DG，这有助于降低ACS-GN的计算成本。实验结果表明，与现有的门控网络设计和卷积计算机制相比，ACS-GN和ACS-DG分别表现出优越的性能。当它们结合在一起时，ACS框架显著减少了计算费用，并显著提高了推理的准确性

{"title":"Dynamic deep neural network inference via adaptive channel skipping","authors":"MEIXIA ZOU, XIUWEN LI, JINZHENG FANG, HONG WEN, WEIWEI FANG","doi":"10.55730/1300-0632.4020","DOIUrl":"https://doi.org/10.55730/1300-0632.4020","url":null,"abstract":"Deep neural networks have recently made remarkable achievements in computer vision applications. However, the high computational requirements needed to achieve accurate inference results can be a significant barrier to deploying DNNs on resource-constrained computing devices, such as those found in the Internet-of-things. In this work, we propose a fresh approach called adaptive channel skipping (ACS) that prioritizes the identification of the most suitable channels for skipping and implements an efficient skipping mechanism during inference. We begin with the development of a new gating network model, ACS-GN, which employs fine-grained channel-wise skipping to enable input-dependent inference and achieve a desirable balance between accuracy and resource consumption. To further enhance the efficiency of channel skipping, we propose a dynamic grouping convolutional computing approach, ACS-DG, which helps to reduce the computational cost of ACS-GN. The results of our experiment indicate that ACS-GN and ACS-DG exhibit superior performance compared to existing gating network designs and convolutional computing mechanisms, respectively. When they are combined, the ACS framework results in a significant reduction of computational expenses and a remarkable improvement in the accuracy of inferences","PeriodicalId":49410,"journal":{"name":"Turkish Journal of Electrical Engineering and Computer Sciences","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135246529","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Recognizing handwritten digits using spiking neural networks with learning algorithms based on sliding mode control theory 基于滑模控制理论的脉冲神经网络手写数字识别

4区计算机科学 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Turkish Journal of Electrical Engineering and Computer Sciences

Pub Date : 2023-09-29 DOI: 10.55730/1300-0632.4022

YEŞİM ÖNİZ, MEHMET AYYILDIZ

In this paper, a spiking neural network (SNN) has been proposed for recognizing the digits written on the LCD screen of an experimental setup. The convergence of the learning algorithm has been ensured by using sliding mode control (SMC) theory and the Lyapunov stability method for the adaptation of the network parameters. The spike response model (SRM) has been utilized in the design of the SNN. The performance of the proposed learning scheme has been evaluated both on the experimental data and on the MNIST dataset. The simulated and experimental results of the SNN structure have been compared with the responses of a conventional neural network (ANN) for which the weight update rules have been also derived using SMC theory. The conducted simulations and experimental studies reveal that convergence can be ensured for the proposed learning scheme and the SNN yields higher recognition accuracy compared to a conventional ANN.

本文提出了一种尖峰神经网络(SNN)来识别写在实验装置LCD屏幕上的数字。采用滑模控制理论和Lyapunov稳定性方法对网络参数进行自适应，保证了学习算法的收敛性。尖峰响应模型(SRM)已被应用于SNN的设计中。在实验数据和MNIST数据集上对所提出的学习方案的性能进行了评估。将SNN结构的仿真和实验结果与传统神经网络(ANN)的响应进行了比较，并利用SMC理论推导了其权值更新规则。仿真和实验研究表明，所提出的学习方案可以保证收敛性，并且与传统的人工神经网络相比，SNN具有更高的识别精度。

引用次数: 0

Using t-distributed stochastic neighbor embedding for visualization and segmentation of 3D point clouds of plants 基于t分布随机邻居嵌入的植物三维点云可视化与分割

4区计算机科学 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Turkish Journal of Electrical Engineering and Computer Sciences

Pub Date : 2023-09-29 DOI: 10.55730/1300-0632.4018

HELİN DUTAĞACI

In this work, the use of t-SNE is proposed to embed 3D point clouds of plants into 2D space for plant characterization. It is demonstrated that t-SNE operates as a practical tool to flatten and visualize a complete 3D plant model in 2D space. The perplexity parameter of t-SNE allows 2D rendering of plant structures at various organizational levels. Aside from the promise of serving as a visualization tool for plant scientists, t-SNE also provides a gateway for processing 3D point clouds of plants using their embedded counterparts in 2D. In this paper, simple methods were proposed to perform semantic segmentation and instance segmentation via grouping the embedded 2D points. The evaluation of these methods on a public 3D plant data set conveys the potential of t-SNE for enabling 2D implementation of various steps involved in automatic 3D phenotyping pipelines.

本文提出利用t-SNE将植物的三维点云嵌入到二维空间中进行植物表征。结果表明，t-SNE是一种实用的工具，可以在二维空间中平面化和可视化完整的3D植物模型。t-SNE的perplexity参数允许对不同组织层次的植物结构进行二维渲染。除了作为植物科学家的可视化工具之外，t-SNE还提供了一个网关，可以使用植物的嵌入式2D点云来处理3D点云。本文提出了一种简单的方法，通过对嵌入的二维点进行分组来进行语义分割和实例分割。在公共3D植物数据集上对这些方法进行的评估表明，t-SNE具有实现自动3D表型管道中涉及的各种步骤的2D实现的潜力。

引用次数: 0

Direct pore-based identification for fingerprint matching process 基于直接孔隙识别的指纹匹配过程

4区计算机科学 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Turkish Journal of Electrical Engineering and Computer Sciences

Pub Date : 2023-09-29 DOI: 10.55730/1300-0632.4019

VEDAT DELICAN, BEHÇET UĞUR TÖREYİN, EGE ÇETİN, AYLİN YALÇIN SARIBEY

Fingerprints are one of the most important scientific proof instruments in solving forensic cases. Identification in fingerprints consists of three levels based on the flow direction of the papillary lines at the first level, the minutiae points at the second level, and the pores at the third level. The inadequacy of existing imaging systems in detecting fingerprints and the lack of pore details at the desired level limit the widespread use of third-level identification. The fact that fingerprints with images based on pores in the unsolved database are not subjected to any evaluation criteria and remain in the database reveals the importance of the study to be carried out. In this study, different from classical fingerprint identification methods, a direct pore-based identification system for fingerprint matching is proposed with the dataset created by using the Docucenter Nirvis device and Projectina Image Acquisition-7000 software as a hyperspectral imaging system where pores were visualized more clearly. Although difficult from an operational perspective, the pores in the 800 fingerprints in the database were manually marked for the accuracy of the results. Next, by using a scoring based on iterative closest point algorithm, latent fingerprints were found. Results suggest that the higher the number of pores examined and the more accurately the pores were marked, the higher the hit score. At the same time, query results showed that the scores of other sequential fingerprints in the database which came after the matching fingerprint were even lower.

指纹是解决司法案件中最重要的科学证据工具之一。指纹识别根据第一层乳头状纹的流动方向、第二层细微点的流动方向和第三层孔隙的流动方向分为三个层次。现有的成像系统在检测指纹方面的不足和缺乏所需层次的孔隙细节限制了第三层次识别的广泛使用。未解决数据库中基于孔隙图像的指纹不受任何评价标准的约束，并保留在数据库中，这表明了该研究的重要性。与传统的指纹识别方法不同，本文采用Docucenter Nirvis设备和Projectina Image Acquisition-7000软件创建的数据集作为高光谱成像系统，提出了一种基于孔隙的指纹匹配直接识别系统。虽然从操作的角度来看很困难，但为了结果的准确性，我们手工标记了数据库中800个指纹中的孔隙。其次，采用基于迭代最近点算法的评分，找到潜在指纹。结果表明，检查的孔隙数量越多，标记的孔隙越准确，命中分数就越高。同时，查询结果显示，在匹配指纹之后的数据库中其他顺序指纹的分数甚至更低。

{"title":"Direct pore-based identification for fingerprint matching process","authors":"VEDAT DELICAN, BEHÇET UĞUR TÖREYİN, EGE ÇETİN, AYLİN YALÇIN SARIBEY","doi":"10.55730/1300-0632.4019","DOIUrl":"https://doi.org/10.55730/1300-0632.4019","url":null,"abstract":"Fingerprints are one of the most important scientific proof instruments in solving forensic cases. Identification in fingerprints consists of three levels based on the flow direction of the papillary lines at the first level, the minutiae points at the second level, and the pores at the third level. The inadequacy of existing imaging systems in detecting fingerprints and the lack of pore details at the desired level limit the widespread use of third-level identification. The fact that fingerprints with images based on pores in the unsolved database are not subjected to any evaluation criteria and remain in the database reveals the importance of the study to be carried out. In this study, different from classical fingerprint identification methods, a direct pore-based identification system for fingerprint matching is proposed with the dataset created by using the Docucenter Nirvis device and Projectina Image Acquisition-7000 software as a hyperspectral imaging system where pores were visualized more clearly. Although difficult from an operational perspective, the pores in the 800 fingerprints in the database were manually marked for the accuracy of the results. Next, by using a scoring based on iterative closest point algorithm, latent fingerprints were found. Results suggest that the higher the number of pores examined and the more accurately the pores were marked, the higher the hit score. At the same time, query results showed that the scores of other sequential fingerprints in the database which came after the matching fingerprint were even lower.","PeriodicalId":49410,"journal":{"name":"Turkish Journal of Electrical Engineering and Computer Sciences","volume":"2015 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135246526","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Turkish Journal of Electrical Engineering and Computer Sciences

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀