首页 > 最新文献

Int. J. Image Graph.最新文献

英文 中文
A Review on Rice Crop Disease Classification Using Computational Approach 基于计算方法的水稻病害分类研究进展
Pub Date : 2021-12-23 DOI: 10.1142/s0219467822400071
V. Malathi, M. P. Gopinath
Rice is a significant cereal crop across the world. In rice cultivation, different types of sowing methods are followed, and thus bring in issues regarding sampling collection. Climate, soil, water level, and a diversified variety of crop seeds (hybrid and traditional varieties) and the period of growth are some of the challenges. This survey mainly focuses on rice crop diseases which affect the parts namely leaves, stems, roots, and spikelet; it mainly focuses on leaf-based diseases. Existing methods for diagnosing leaf disease include statistical approaches, data mining, image processing, machine learning, and deep learning techniques. This review mainly addresses diseases of the rice crop, a framework to diagnose rice crop diseases, and computational approaches in Image Processing, Machine Learning, Deep Learning, and Convolutional Neural Networks. Based on performance indicators, interpretations were made for the following algorithms namely support vector machine (SVM), convolutional neural network (CNN), backpropagational neural network (BPNN), and feedforward neural network (FFNN).
水稻是世界上重要的谷类作物。在水稻种植中,采用不同类型的播种方法,因此带来了采样问题。气候、土壤、水位、作物种子的多样化(杂交品种和传统品种)和生长期是其中的一些挑战。本次调查主要针对水稻作物的叶、茎、根和小穗等部位的病害;它主要关注叶基疾病。现有的诊断叶片疾病的方法包括统计方法、数据挖掘、图像处理、机器学习和深度学习技术。本文主要介绍了水稻作物病害、水稻作物病害诊断框架以及图像处理、机器学习、深度学习和卷积神经网络的计算方法。基于性能指标,对支持向量机(SVM)、卷积神经网络(CNN)、反向传播神经网络(BPNN)、前馈神经网络(FFNN)等算法进行了解释。
{"title":"A Review on Rice Crop Disease Classification Using Computational Approach","authors":"V. Malathi, M. P. Gopinath","doi":"10.1142/s0219467822400071","DOIUrl":"https://doi.org/10.1142/s0219467822400071","url":null,"abstract":"Rice is a significant cereal crop across the world. In rice cultivation, different types of sowing methods are followed, and thus bring in issues regarding sampling collection. Climate, soil, water level, and a diversified variety of crop seeds (hybrid and traditional varieties) and the period of growth are some of the challenges. This survey mainly focuses on rice crop diseases which affect the parts namely leaves, stems, roots, and spikelet; it mainly focuses on leaf-based diseases. Existing methods for diagnosing leaf disease include statistical approaches, data mining, image processing, machine learning, and deep learning techniques. This review mainly addresses diseases of the rice crop, a framework to diagnose rice crop diseases, and computational approaches in Image Processing, Machine Learning, Deep Learning, and Convolutional Neural Networks. Based on performance indicators, interpretations were made for the following algorithms namely support vector machine (SVM), convolutional neural network (CNN), backpropagational neural network (BPNN), and feedforward neural network (FFNN).","PeriodicalId":177479,"journal":{"name":"Int. J. Image Graph.","volume":"378 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131884981","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Graph Theory-Based Brain Network Connectivity Analysis and Classification of Alzheimer's Disease 基于图论的阿尔茨海默病脑网络连通性分析与分类
Pub Date : 2021-12-22 DOI: 10.1142/s021946782240006x
A. Thushara, C. UshadeviAmma, Ansamma John
Alzheimer’s Disease (AD) is basically a progressive neurodegenerative disorder associated with abnormal brain networks that affect millions of elderly people and degrades their quality of life. The abnormalities in brain networks are due to the disruption of White Matter (WM) fiber tracts that connect the brain regions. Diffusion-Weighted Imaging (DWI) captures the brain’s WM integrity. Here, the correlation betwixt the WM degeneration and also AD is investigated by utilizing graph theory as well as Machine Learning (ML) algorithms. By using the DW image obtained from Alzheimer’s Disease Neuroimaging Initiative (ADNI) database, the brain graph of each subject is constructed. The features extracted from the brain graph form the basis to differentiate between Mild Cognitive Impairment (MCI), Control Normal (CN) and AD subjects. Performance evaluation is done using binary and multiclass classification algorithms and obtained an accuracy that outperforms the current top-notch DWI-based studies.
阿尔茨海默病(AD)基本上是一种进行性神经退行性疾病,与异常的大脑网络有关,影响数百万老年人并降低他们的生活质量。大脑网络的异常是由于连接大脑区域的白质(WM)纤维束的破坏。弥散加权成像(DWI)可以捕捉到大脑WM的完整性。本文利用图论和机器学习(ML)算法研究了WM退化和AD之间的相关性。利用从阿尔茨海默病神经成像倡议(ADNI)数据库中获取的DW图像,构建每个受试者的脑图。从脑图中提取的特征构成了区分轻度认知障碍(MCI)、控制正常(CN)和AD受试者的基础。使用二元和多类分类算法进行性能评估,并获得了优于当前一流的基于dwi的研究的精度。
{"title":"Graph Theory-Based Brain Network Connectivity Analysis and Classification of Alzheimer's Disease","authors":"A. Thushara, C. UshadeviAmma, Ansamma John","doi":"10.1142/s021946782240006x","DOIUrl":"https://doi.org/10.1142/s021946782240006x","url":null,"abstract":"Alzheimer’s Disease (AD) is basically a progressive neurodegenerative disorder associated with abnormal brain networks that affect millions of elderly people and degrades their quality of life. The abnormalities in brain networks are due to the disruption of White Matter (WM) fiber tracts that connect the brain regions. Diffusion-Weighted Imaging (DWI) captures the brain’s WM integrity. Here, the correlation betwixt the WM degeneration and also AD is investigated by utilizing graph theory as well as Machine Learning (ML) algorithms. By using the DW image obtained from Alzheimer’s Disease Neuroimaging Initiative (ADNI) database, the brain graph of each subject is constructed. The features extracted from the brain graph form the basis to differentiate between Mild Cognitive Impairment (MCI), Control Normal (CN) and AD subjects. Performance evaluation is done using binary and multiclass classification algorithms and obtained an accuracy that outperforms the current top-notch DWI-based studies.","PeriodicalId":177479,"journal":{"name":"Int. J. Image Graph.","volume":"156 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114733890","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Finger Vein Recognition Model for Biometric Authentication Using Intelligent Deep Learning 基于智能深度学习的指纹静脉识别模型
Pub Date : 2021-12-22 DOI: 10.1142/s0219467822400046
M. Madhusudhan, V. U. Rani, Chetana Hegde
In recent years, biometric authentication systems have remained a hot research topic, as they can recognize or authenticate a person by comparing their data to other biometric data stored in a database. Fingerprints, palm prints, hand vein, finger vein, palm vein, and other anatomic or behavioral features have all been used to develop a variety of biometric approaches. Finger vein recognition (FVR) is a common method of examining the patterns of the finger veins for proper authentication among the various biometrics. Finger vein acquisition, preprocessing, feature extraction, and authentication are all part of the proposed intelligent deep learning-based FVR (IDL-FVR) model. Infrared imaging devices have primarily captured the use of finger veins. Furthermore, a region of interest extraction process is carried out in order to save the finger part. The shark smell optimization algorithm is used to tune the hyperparameters of the bidirectional long–short-term memory model properly. Finally, an authentication process based on Euclidean distance is performed, which compares the features of the current finger vein image to those in the database. The IDL-FVR model surpassed the earlier methods by accomplishing a maximum accuracy of 99.93%. Authentication is successful when the Euclidean distance is small and vice versa.
近年来,生物特征认证系统一直是一个热门的研究课题,因为它可以通过将一个人的数据与存储在数据库中的其他生物特征数据进行比较来识别或认证一个人。指纹、掌纹、手静脉、手指静脉、手掌静脉和其他解剖学或行为特征都被用于开发各种生物识别方法。手指静脉识别(FVR)是多种生物识别技术中检测手指静脉形态以进行身份验证的一种常用方法。手指静脉采集、预处理、特征提取和认证都是所提出的基于智能深度学习的FVR (IDL-FVR)模型的一部分。红外成像设备主要捕捉到手指静脉的使用。此外,为了保存手指部分,还进行了兴趣区域提取。利用鲨鱼气味优化算法对双向长短期记忆模型的超参数进行了适当的调整。最后,进行基于欧几里得距离的身份验证,将当前手指静脉图像的特征与数据库中的特征进行比较。IDL-FVR模型的最高准确率达到99.93%,超过了早期的方法。当欧几里得距离较小时,验证成功,反之亦然。
{"title":"Finger Vein Recognition Model for Biometric Authentication Using Intelligent Deep Learning","authors":"M. Madhusudhan, V. U. Rani, Chetana Hegde","doi":"10.1142/s0219467822400046","DOIUrl":"https://doi.org/10.1142/s0219467822400046","url":null,"abstract":"In recent years, biometric authentication systems have remained a hot research topic, as they can recognize or authenticate a person by comparing their data to other biometric data stored in a database. Fingerprints, palm prints, hand vein, finger vein, palm vein, and other anatomic or behavioral features have all been used to develop a variety of biometric approaches. Finger vein recognition (FVR) is a common method of examining the patterns of the finger veins for proper authentication among the various biometrics. Finger vein acquisition, preprocessing, feature extraction, and authentication are all part of the proposed intelligent deep learning-based FVR (IDL-FVR) model. Infrared imaging devices have primarily captured the use of finger veins. Furthermore, a region of interest extraction process is carried out in order to save the finger part. The shark smell optimization algorithm is used to tune the hyperparameters of the bidirectional long–short-term memory model properly. Finally, an authentication process based on Euclidean distance is performed, which compares the features of the current finger vein image to those in the database. The IDL-FVR model surpassed the earlier methods by accomplishing a maximum accuracy of 99.93%. Authentication is successful when the Euclidean distance is small and vice versa.","PeriodicalId":177479,"journal":{"name":"Int. J. Image Graph.","volume":"42 5","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120858131","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Pan-Sharpening for Spectral Details Preservation Via Convolutional Sparse Coding in Non-Subsampled Shearlet Space 基于非下采样Shearlet空间卷积稀疏编码的泛锐化光谱细节保存
Pub Date : 2021-12-22 DOI: 10.1142/s0219467823500134
Dharaj. Sangani, R. Thakker, Sumankumar D. Panchal, Rajesh Gogineni
The optical satellite sensors encounter certain constraints on producing high-resolution multispectral (HRMS) images. Pan-sharpening (PS) is a remote sensing image fusion technique, which is an effective mechanism to overcome the limitations of available imaging products. The prevalent issue in PS algorithms is the imbalance between spatial quality and spectral details preservation, thereby producing intensity variations in the fused image. In this paper, a PS method is proposed based on convolutional sparse coding (CSC) implemented in the non-subsampled shearlet transform (NSST) domain. The source images, panchromatic (PAN) and multispectral (MS) images, are decomposed using NSST. The resultant high-frequency bands are fused using adaptive weights determined from chaotic grey wolf optimization (CGWO) algorithm. The CSC-based model is employed to fuse the low-frequency bands. Further, an iterative filtering mechanism is developed to enhance the quality of fused image. Four datasets with different geographical content like urban area, vegetation, etc. and eight existing algorithms are used for evaluation of the proposed PS method. The comprehensive visual and quantitative results approve that the proposed method accomplishes considerable improvement in spatial and spectral details equivalence in the pan-sharpened image.
光学卫星传感器在产生高分辨率多光谱(HRMS)图像时受到一定的限制。泛锐化(Pan-sharpening, PS)是一种遥感图像融合技术,是克服现有成像产品局限性的有效机制。PS算法普遍存在的问题是空间质量与光谱细节保存之间的不平衡,从而在融合图像中产生强度变化。提出了一种在非下采样shearlet变换(NSST)域实现的基于卷积稀疏编码(CSC)的PS方法。对源图像全色(PAN)和多光谱(MS)图像进行NSST分解。利用混沌灰狼优化(CGWO)算法确定的自适应权值进行高频频带融合。采用基于csc的模型进行低频融合。为了提高融合图像的质量,提出了一种迭代滤波机制。利用城市面积、植被等4个地理内容不同的数据集和8种现有算法对所提出的PS方法进行了评价。综合视觉和定量结果表明,该方法在泛锐化图像的空间和光谱细节等效性方面取得了较大的改善。
{"title":"Pan-Sharpening for Spectral Details Preservation Via Convolutional Sparse Coding in Non-Subsampled Shearlet Space","authors":"Dharaj. Sangani, R. Thakker, Sumankumar D. Panchal, Rajesh Gogineni","doi":"10.1142/s0219467823500134","DOIUrl":"https://doi.org/10.1142/s0219467823500134","url":null,"abstract":"The optical satellite sensors encounter certain constraints on producing high-resolution multispectral (HRMS) images. Pan-sharpening (PS) is a remote sensing image fusion technique, which is an effective mechanism to overcome the limitations of available imaging products. The prevalent issue in PS algorithms is the imbalance between spatial quality and spectral details preservation, thereby producing intensity variations in the fused image. In this paper, a PS method is proposed based on convolutional sparse coding (CSC) implemented in the non-subsampled shearlet transform (NSST) domain. The source images, panchromatic (PAN) and multispectral (MS) images, are decomposed using NSST. The resultant high-frequency bands are fused using adaptive weights determined from chaotic grey wolf optimization (CGWO) algorithm. The CSC-based model is employed to fuse the low-frequency bands. Further, an iterative filtering mechanism is developed to enhance the quality of fused image. Four datasets with different geographical content like urban area, vegetation, etc. and eight existing algorithms are used for evaluation of the proposed PS method. The comprehensive visual and quantitative results approve that the proposed method accomplishes considerable improvement in spatial and spectral details equivalence in the pan-sharpened image.","PeriodicalId":177479,"journal":{"name":"Int. J. Image Graph.","volume":"117 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124885086","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Postal Automation System in Gurmukhi Script using Deep Learning 使用深度学习的Gurmukhi脚本邮政自动化系统
Pub Date : 2021-12-22 DOI: 10.1142/s0219467823500055
Sandhya Sharma, Sheifali Gupta, Neeraj Kumar, Tanvi Arora
Nowadays in the era of automation, the postal automation system is one of the major research areas. Developing a postal automation system for a nation like India is much troublesome than other nations because of India’s multi-script and multi-lingual behavior. This proposed work will be helpful in the postal automation of district names of Punjab (state) written in Gurmukhi script, which is the official language of the state in North India. For this, a holistic approach i.e. a segmentation-free technique has been used with the help of Convolutional Neural Network (CNN) and Deep learning (DL). For the purpose of recognition, a database of 22[Formula: see text]000 images (samples) which are handwritten in Gurmukhi script for all the 22 districts of Punjab is prepared. Each sample is written two times by 500 different writers generating 1000 samples for each district name. Two CNN models are proposed which are named as ConvNetGuru and ConvNetGuruMod for the purpose of recognition. Maximum validation accuracy achieved by ConvNetGuru is 90% and ConvNetGuruMod is 98%.
在当今自动化时代,邮政自动化系统是主要的研究领域之一。由于印度的多文字和多语言行为,为像印度这样的国家开发邮政自动化系统比其他国家麻烦得多。这项提议的工作将有助于旁遮普邦(邦)用古尔穆克语书写的地区名称的邮政自动化,古尔穆克语是印度北部邦的官方语言。为此,在卷积神经网络(CNN)和深度学习(DL)的帮助下,使用了一种整体方法,即无分割技术。为了识别的目的,旁遮普邦所有22个地区的22[公式:见文本]000张用古尔穆克语手写的图像(样本)的数据库已经准备好了。每个样本由500个不同的作者编写两次,为每个地区名称生成1000个样本。为了进行识别,提出了两个CNN模型,分别命名为ConvNetGuru和ConvNetGuruMod。ConvNetGuru实现的最大验证精度为90%,ConvNetGuruMod为98%。
{"title":"Postal Automation System in Gurmukhi Script using Deep Learning","authors":"Sandhya Sharma, Sheifali Gupta, Neeraj Kumar, Tanvi Arora","doi":"10.1142/s0219467823500055","DOIUrl":"https://doi.org/10.1142/s0219467823500055","url":null,"abstract":"Nowadays in the era of automation, the postal automation system is one of the major research areas. Developing a postal automation system for a nation like India is much troublesome than other nations because of India’s multi-script and multi-lingual behavior. This proposed work will be helpful in the postal automation of district names of Punjab (state) written in Gurmukhi script, which is the official language of the state in North India. For this, a holistic approach i.e. a segmentation-free technique has been used with the help of Convolutional Neural Network (CNN) and Deep learning (DL). For the purpose of recognition, a database of 22[Formula: see text]000 images (samples) which are handwritten in Gurmukhi script for all the 22 districts of Punjab is prepared. Each sample is written two times by 500 different writers generating 1000 samples for each district name. Two CNN models are proposed which are named as ConvNetGuru and ConvNetGuruMod for the purpose of recognition. Maximum validation accuracy achieved by ConvNetGuru is 90% and ConvNetGuruMod is 98%.","PeriodicalId":177479,"journal":{"name":"Int. J. Image Graph.","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131665582","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Spatio-Temporal Inference Transformer Network for Video Inpainting 用于视频喷漆的时空推理变压器网络
Pub Date : 2021-12-18 DOI: 10.1142/s0219467823500079
Gajanan Tudavekar, S. Saraf, Sanjay R. Patil
Video inpainting aims to complete in a visually pleasing way the missing regions in video frames. Video inpainting is an exciting task due to the variety of motions across different frames. The existing methods usually use attention models to inpaint videos by seeking the damaged content from other frames. Nevertheless, these methods suffer due to irregular attention weight from spatio-temporal dimensions, thus giving rise to artifacts in the inpainted video. To overcome the above problem, Spatio-Temporal Inference Transformer Network (STITN) has been proposed. The STITN aligns the frames to be inpainted and concurrently inpaints all the frames, and a spatio-temporal adversarial loss function improves the STITN. Our method performs considerably better than the existing deep learning approaches in quantitative and qualitative evaluation.
视频补绘旨在以视觉愉悦的方式完成视频帧中缺失的区域。视频绘画是一项令人兴奋的任务,因为各种各样的运动跨越不同的帧。现有的方法通常是利用注意力模型从其他帧中寻找损坏的内容来重新绘制视频。然而,这些方法由于来自时空维度的注意力权重不规则而受到影响,从而在绘制的视频中产生伪影。为了克服上述问题,提出了时空推理变压器网络(STITN)。stin对要绘制的帧进行对齐并同时绘制所有帧,并且一个时空对抗损失函数改进了stin。我们的方法在定量和定性评估方面比现有的深度学习方法要好得多。
{"title":"Spatio-Temporal Inference Transformer Network for Video Inpainting","authors":"Gajanan Tudavekar, S. Saraf, Sanjay R. Patil","doi":"10.1142/s0219467823500079","DOIUrl":"https://doi.org/10.1142/s0219467823500079","url":null,"abstract":"Video inpainting aims to complete in a visually pleasing way the missing regions in video frames. Video inpainting is an exciting task due to the variety of motions across different frames. The existing methods usually use attention models to inpaint videos by seeking the damaged content from other frames. Nevertheless, these methods suffer due to irregular attention weight from spatio-temporal dimensions, thus giving rise to artifacts in the inpainted video. To overcome the above problem, Spatio-Temporal Inference Transformer Network (STITN) has been proposed. The STITN aligns the frames to be inpainted and concurrently inpaints all the frames, and a spatio-temporal adversarial loss function improves the STITN. Our method performs considerably better than the existing deep learning approaches in quantitative and qualitative evaluation.","PeriodicalId":177479,"journal":{"name":"Int. J. Image Graph.","volume":"118 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123242950","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dimensionality Reduction and Visualization of Bharatanatyam Mudras 梵天手印的降维和可视化
Pub Date : 2021-12-18 DOI: 10.1142/s0219467823500018
R. Raj, S. Dharan, T. T. Sunil
Cultural dances are practiced all over the world. The study of various gestures of the performer using computer vision techniques can help in better understanding of these dance forms and for annotation purposes. Bharatanatyam is a classical dance that originated in South India. Bharatanatyam performer uses hand gestures (mudras), facial expressions and body movements to communicate to the audience the intended meaning. According to Natyashastra, a classical text on Indian dance, there are 28 Asamyukta Hastas (single-hand gestures) and 23 Samyukta Hastas (Double-hand gestures) in Bharatanatyam. Open datasets on Bharatanatyam dance gestures are not presently available. An exhaustive open dataset comprising of various mudras in Bharatanatyam was created. The dataset consists of 15[Formula: see text]396 distinct single-hand mudra images and 13[Formula: see text]035 distinct double-hand mudra images. In this paper, we explore the dataset using various multidimensional visualization techniques. PCA, Kernel PCA, Local Linear Embedding, Multidimensional Scaling, Isomap, t-SNE and PCA–t-SNE combination are being investigated. The best visualization for exploration of the dataset is obtained using PCA–t-SNE combination.
世界各地都有文化舞蹈。使用计算机视觉技术研究表演者的各种手势可以帮助更好地理解这些舞蹈形式并用于注释目的。Bharatanatyam是一种起源于印度南部的古典舞蹈。Bharatanatyam表演者使用手势(手印),面部表情和身体动作向观众传达预期的意思。根据印度舞蹈的经典文本Natyashastra,在Bharatanatyam中有28个Asamyukta hasas(单手手势)和23个Samyukta hasas(双手手势)。目前还没有Bharatanatyam舞蹈手势的开放数据集。创建了一个详尽的开放数据集,包括Bharatanatyam中的各种手印。该数据集由15[公式:见文本]396个不同的单手手印图像和13[公式:见文本]035个不同的双手手印图像组成。在本文中,我们使用各种多维可视化技术来探索数据集。主要研究了PCA、核PCA、局部线性嵌入、多维尺度、isommap、t-SNE和PCA - t-SNE组合。使用PCA-t-SNE组合获得了数据集探索的最佳可视化效果。
{"title":"Dimensionality Reduction and Visualization of Bharatanatyam Mudras","authors":"R. Raj, S. Dharan, T. T. Sunil","doi":"10.1142/s0219467823500018","DOIUrl":"https://doi.org/10.1142/s0219467823500018","url":null,"abstract":"Cultural dances are practiced all over the world. The study of various gestures of the performer using computer vision techniques can help in better understanding of these dance forms and for annotation purposes. Bharatanatyam is a classical dance that originated in South India. Bharatanatyam performer uses hand gestures (mudras), facial expressions and body movements to communicate to the audience the intended meaning. According to Natyashastra, a classical text on Indian dance, there are 28 Asamyukta Hastas (single-hand gestures) and 23 Samyukta Hastas (Double-hand gestures) in Bharatanatyam. Open datasets on Bharatanatyam dance gestures are not presently available. An exhaustive open dataset comprising of various mudras in Bharatanatyam was created. The dataset consists of 15[Formula: see text]396 distinct single-hand mudra images and 13[Formula: see text]035 distinct double-hand mudra images. In this paper, we explore the dataset using various multidimensional visualization techniques. PCA, Kernel PCA, Local Linear Embedding, Multidimensional Scaling, Isomap, t-SNE and PCA–t-SNE combination are being investigated. The best visualization for exploration of the dataset is obtained using PCA–t-SNE combination.","PeriodicalId":177479,"journal":{"name":"Int. J. Image Graph.","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130050939","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A New Method for Arabic Text Detection in Natural Scene Images 自然场景图像中阿拉伯语文本检测的新方法
Pub Date : 2021-12-17 DOI: 10.1142/s0219467823500109
Houda Gaddour, S. Kanoun, N. Vincent
Text in scene images can provide useful and vital information for content-based image analysis. Therefore, text detection and script identification in images are an important task. In this paper, we propose a new method for text detection in natural scene images, particularly for Arabic text, based on a bottom-up approach where four principal steps can be highlighted. The detection of extremely stable and homogeneous regions of interest (ROIs) is based on the Color Stability and Homogeneity Regions (CSHR) proposed technique. These regions are then labeled as textual or non-textual ROI. This identification is based on a structural approach. The textual ROIs are grouped to constitute zones according to spatial relations between them. Finally, the textual or non-textual nature of the constituted zones is refined. This last identification is based on handcrafted features and on features built from a Convolutional Neural Network (CNN) after learning. The proposed method was evaluated on the databases used for text detection in natural scene images: the competitions organized in 2017 edition of the International Conference on Document Analysis and Recognition (ICDAR2017), the Urdu-text database and our Natural Scene Image Database for Arabic Text detection (NSIDAT) database. The obtained experimental results seem to be interesting.
场景图像中的文本可以为基于内容的图像分析提供有用和重要的信息。因此,图像中的文本检测和脚本识别是一个重要的课题。在本文中,我们提出了一种新的自然场景图像文本检测方法,特别是阿拉伯语文本,基于自下而上的方法,其中四个主要步骤可以突出显示。极为稳定均匀感兴趣区域(roi)的检测是基于颜色稳定均匀区域(CSHR)提出的技术。然后将这些区域标记为文本或非文本ROI。这种识别是基于一种结构方法。根据文本roi之间的空间关系,将文本roi分组构成区域。最后,细化构成区域的文本或非文本性质。最后一种识别是基于手工制作的特征和学习后从卷积神经网络(CNN)构建的特征。该方法在用于自然场景图像文本检测的数据库上进行了评估:2017年国际文档分析与识别会议(ICDAR2017)、乌尔都语文本数据库和我们的阿拉伯语文本检测自然场景图像数据库(NSIDAT)数据库组织的竞赛。得到的实验结果似乎很有趣。
{"title":"A New Method for Arabic Text Detection in Natural Scene Images","authors":"Houda Gaddour, S. Kanoun, N. Vincent","doi":"10.1142/s0219467823500109","DOIUrl":"https://doi.org/10.1142/s0219467823500109","url":null,"abstract":"Text in scene images can provide useful and vital information for content-based image analysis. Therefore, text detection and script identification in images are an important task. In this paper, we propose a new method for text detection in natural scene images, particularly for Arabic text, based on a bottom-up approach where four principal steps can be highlighted. The detection of extremely stable and homogeneous regions of interest (ROIs) is based on the Color Stability and Homogeneity Regions (CSHR) proposed technique. These regions are then labeled as textual or non-textual ROI. This identification is based on a structural approach. The textual ROIs are grouped to constitute zones according to spatial relations between them. Finally, the textual or non-textual nature of the constituted zones is refined. This last identification is based on handcrafted features and on features built from a Convolutional Neural Network (CNN) after learning. The proposed method was evaluated on the databases used for text detection in natural scene images: the competitions organized in 2017 edition of the International Conference on Document Analysis and Recognition (ICDAR2017), the Urdu-text database and our Natural Scene Image Database for Arabic Text detection (NSIDAT) database. The obtained experimental results seem to be interesting.","PeriodicalId":177479,"journal":{"name":"Int. J. Image Graph.","volume":"842 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123295540","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multiple Imputation by Chained Equations-K-Nearest Neighbors and Deep Neural Network Architecture for Kidney Disease Prediction 基于链方程- k近邻和深度神经网络结构的多重归算肾脏疾病预测
Pub Date : 2021-12-17 DOI: 10.1142/s0219467823500146
M. Fathima, R. Hariharan, S. Raja
Chronic kidney disease (CKD) is a health concern that affects people all over the world. Kidney dysfunction or impaired kidney functions are the causes of CKD. The machine learning-based prediction models are used to determine the risk level of CKD and assist healthcare practitioners in delaying and preventing the disease’s progression. The researchers proposed many prediction models for determining the CKD risk level. Although these models performed well, their precision is limited since they do not handle missing values in the clinical dataset adequately. The missing values of a clinical dataset can degrade the training outcomes that leads to false predictions. Thus, imputing missing values increases the prediction model performance. This proposed work developed a novel imputation technique by combining Multiple Imputation by Chained Equations and [Formula: see text]-Nearest Neighbors (MICE–KNN) for imputing the missing values. The experimental results show that MICE–KNN accurately predicts the missing values, and the Deep Neural Network (DNN) improves the prediction performance of the CKD model. Various metrics like mean absolute error, accuracy, specificity, Matthews correlation coefficient, the area under the curve, [Formula: see text]-score, sensitivity, and precision have been used to evaluate the proposed CKD model performance. The performance analysis exhibits that MICE–KNN with deep learning outperforms other classifiers. According to our experimental study, the MICE–KNN imputation algorithm with DNN is more appropriate for predicting the kidney disease.
慢性肾脏疾病(CKD)是影响全世界人民的健康问题。肾功能不全或肾功能受损是CKD的病因。基于机器学习的预测模型用于确定CKD的风险水平,并协助医疗从业者延迟和预防疾病的进展。研究人员提出了许多预测模型来确定CKD的风险水平。尽管这些模型表现良好,但它们的精度有限,因为它们不能充分处理临床数据集中的缺失值。临床数据集的缺失值会降低训练结果,从而导致错误的预测。因此,输入缺失值可以提高预测模型的性能。这项工作提出了一种新的imputation技术,将多重imputation与[公式:见文本]-Nearest Neighbors (MICE-KNN)相结合,用于imputation缺失值。实验结果表明,MICE-KNN能够准确预测缺失值,深度神经网络(Deep Neural Network, DNN)提高了CKD模型的预测性能。各种指标,如平均绝对误差、准确性、特异性、马修斯相关系数、曲线下面积、[公式:见文本]评分、灵敏度和精度已被用于评估所提出的CKD模型的性能。性能分析表明,结合深度学习的MICE-KNN优于其他分类器。根据我们的实验研究,结合DNN的MICE-KNN imputation算法更适合预测肾脏疾病。
{"title":"Multiple Imputation by Chained Equations-K-Nearest Neighbors and Deep Neural Network Architecture for Kidney Disease Prediction","authors":"M. Fathima, R. Hariharan, S. Raja","doi":"10.1142/s0219467823500146","DOIUrl":"https://doi.org/10.1142/s0219467823500146","url":null,"abstract":"Chronic kidney disease (CKD) is a health concern that affects people all over the world. Kidney dysfunction or impaired kidney functions are the causes of CKD. The machine learning-based prediction models are used to determine the risk level of CKD and assist healthcare practitioners in delaying and preventing the disease’s progression. The researchers proposed many prediction models for determining the CKD risk level. Although these models performed well, their precision is limited since they do not handle missing values in the clinical dataset adequately. The missing values of a clinical dataset can degrade the training outcomes that leads to false predictions. Thus, imputing missing values increases the prediction model performance. This proposed work developed a novel imputation technique by combining Multiple Imputation by Chained Equations and [Formula: see text]-Nearest Neighbors (MICE–KNN) for imputing the missing values. The experimental results show that MICE–KNN accurately predicts the missing values, and the Deep Neural Network (DNN) improves the prediction performance of the CKD model. Various metrics like mean absolute error, accuracy, specificity, Matthews correlation coefficient, the area under the curve, [Formula: see text]-score, sensitivity, and precision have been used to evaluate the proposed CKD model performance. The performance analysis exhibits that MICE–KNN with deep learning outperforms other classifiers. According to our experimental study, the MICE–KNN imputation algorithm with DNN is more appropriate for predicting the kidney disease.","PeriodicalId":177479,"journal":{"name":"Int. J. Image Graph.","volume":"50 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123292001","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A Survey on Various Deep Learning Algorithms for an Efficient Facial Expression Recognition System 面向高效面部表情识别系统的各种深度学习算法综述
Pub Date : 2021-12-15 DOI: 10.1142/s0219467822400058
Rudranath Banerjee, S. De, Shouvik Dey
Facial Expression (FE) encompasses information concerning the emotional together with the physical state of a human. In the last few years, FE Recognition (FER) has turned out to be a propitious research field. FER is the chief processing technique for non-verbal intentions, and also it is a significant and propitious computer vision together with the artificial intelligence field. As a novel machine learning theory, Deep Learning (DL) not only highlights the depth of the learning model but also emphasizes the significance of Feature Learning (FL) for the network model, and it has made several research achievements in FER. Here, the present research states are examined typically from the latest FE extraction algorithm as well as the FER centered on DL. The research on classifiers gathered from recent papers discloses a more powerful as well as reliable comprehending of the peculiar traits of classifiers for research fellows. At the ending of the survey, few problems in addition to chances that are required to be tackled in the upcoming future are presented.
面部表情(FE)包含了人的情绪和身体状态的信息。在过去的几年里,FE识别(FER)已经成为一个有利的研究领域。FER是非言语意图的主要处理技术,是计算机视觉与人工智能领域的重要发展方向。作为一种新颖的机器学习理论,深度学习(Deep learning, DL)不仅突出了学习模型的深度,而且强调了特征学习(Feature learning, FL)对网络模型的重要性,并在特征学习方面取得了一些研究成果。本文主要从最新的FE提取算法和以DL为中心的FE提取算法两方面考察了目前的研究现状。从最近的论文中收集的关于分类器的研究为研究人员提供了对分类器特有特征的更有力和可靠的理解。在调查的最后,除了在即将到来的未来需要解决的机会之外,几乎没有什么问题。
{"title":"A Survey on Various Deep Learning Algorithms for an Efficient Facial Expression Recognition System","authors":"Rudranath Banerjee, S. De, Shouvik Dey","doi":"10.1142/s0219467822400058","DOIUrl":"https://doi.org/10.1142/s0219467822400058","url":null,"abstract":"Facial Expression (FE) encompasses information concerning the emotional together with the physical state of a human. In the last few years, FE Recognition (FER) has turned out to be a propitious research field. FER is the chief processing technique for non-verbal intentions, and also it is a significant and propitious computer vision together with the artificial intelligence field. As a novel machine learning theory, Deep Learning (DL) not only highlights the depth of the learning model but also emphasizes the significance of Feature Learning (FL) for the network model, and it has made several research achievements in FER. Here, the present research states are examined typically from the latest FE extraction algorithm as well as the FER centered on DL. The research on classifiers gathered from recent papers discloses a more powerful as well as reliable comprehending of the peculiar traits of classifiers for research fellows. At the ending of the survey, few problems in addition to chances that are required to be tackled in the upcoming future are presented.","PeriodicalId":177479,"journal":{"name":"Int. J. Image Graph.","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128389024","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
Int. J. Image Graph.
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1