首页 > 最新文献

2020 International Conference on Machine Vision and Image Processing (MVIP)最新文献

英文 中文
Improving Persian Digit Recognition by Combining Deep Neural Networks and SVM and Using PCA 结合深度神经网络和支持向量机及PCA改进波斯语数字识别
Pub Date : 2020-02-01 DOI: 10.1109/MVIP49855.2020.9116893
Amir. M Mousavi. H, A. Bossaghzadeh
One of the machine vision tasks is optical character recognition (OCR) that researchers in this field are trying to achieve a high performance and accuracy in the classification task. In this paper, we have used a fine tuned deep Neural networks for Hoda dataset, which is the largest dataset for Persian handwritten digit classification, to extract valuable discriminative features. then, these features are fed to a linear support vector machine (SVM) for classification part. In the next experiment, In order to improve the accuracy and computational load, we applied the Principal component analysis (PCA) to reduce the extracted features dimensions then we fed it to SVM. To the best of our knowledge the proposed method was better than other methods in terms of accuracy measure
光学字符识别(OCR)是机器视觉任务之一,该领域的研究人员正在努力在分类任务中实现高性能和准确性。在本文中,我们对Hoda数据集(这是波斯语手写数字分类的最大数据集)使用微调深度神经网络来提取有价值的判别特征。然后,将这些特征输入到线性支持向量机(SVM)中进行分类。在接下来的实验中,为了提高准确率和计算量,我们使用主成分分析(PCA)对提取的特征进行降维,然后将其输入支持向量机。据我们所知,所提出的方法在精度测量方面优于其他方法
{"title":"Improving Persian Digit Recognition by Combining Deep Neural Networks and SVM and Using PCA","authors":"Amir. M Mousavi. H, A. Bossaghzadeh","doi":"10.1109/MVIP49855.2020.9116893","DOIUrl":"https://doi.org/10.1109/MVIP49855.2020.9116893","url":null,"abstract":"One of the machine vision tasks is optical character recognition (OCR) that researchers in this field are trying to achieve a high performance and accuracy in the classification task. In this paper, we have used a fine tuned deep Neural networks for Hoda dataset, which is the largest dataset for Persian handwritten digit classification, to extract valuable discriminative features. then, these features are fed to a linear support vector machine (SVM) for classification part. In the next experiment, In order to improve the accuracy and computational load, we applied the Principal component analysis (PCA) to reduce the extracted features dimensions then we fed it to SVM. To the best of our knowledge the proposed method was better than other methods in terms of accuracy measure","PeriodicalId":255375,"journal":{"name":"2020 International Conference on Machine Vision and Image Processing (MVIP)","volume":"62 6","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132123540","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Performance Improvement of Gaussian Filter using SIMD Technology 利用SIMD技术改进高斯滤波器的性能
Pub Date : 2020-02-01 DOI: 10.1109/MVIP49855.2020.9116883
Maryam Moradifar, A. Shahbahrami
Denoising is an important process before applying other post-processing techniques on medical images. To obtain better quality images many denoising approaches have been introduced. Gaussian filter is a spatial domain filter, which is proper to blur and to remove noise from images. Since the Gaussian filter modifies the input signal by convolution with a Gaussian function it is a computationally intensive algorithm. Hence to enhance the performance of the algorithm, it is better to perform two 1-D convolution operations instead of one 2-D convolution operation and then parallelize it. In this paper in order to increase the performance of 1-D convolution operation, we exploit both Data- and Thread-Level Parallelism using parallel programming models such as Intrinsic Programming Model, Compiler's Automatic Vectorization and Open Multi-Processing. The experimental results were shown that the performance of our implementations is much higher than other approaches Performance improvements of Multi-threaded version of all implementations are significantly improved compared to single-core implementations, and a speedup of 52.33x obtained over the optimal scalar implementation.
去噪是医学图像其他后处理技术应用之前的重要步骤。为了获得更好的图像质量,引入了许多去噪方法。高斯滤波器是一种空间域滤波器,适合于图像的模糊处理和去噪。由于高斯滤波器通过与高斯函数的卷积来修改输入信号,因此它是一种计算量很大的算法。因此,为了提高算法的性能,最好执行两次一维卷积操作,而不是一次二维卷积操作,然后并行化。为了提高一维卷积运算的性能,我们利用并行编程模型如内在编程模型、编译器自动向量化和开放多处理来开发数据级和线程级并行性。实验结果表明,所有实现的多线程版本的性能都比单核实现有了明显的提高,比最优标量实现的速度提高了52.33倍。
{"title":"Performance Improvement of Gaussian Filter using SIMD Technology","authors":"Maryam Moradifar, A. Shahbahrami","doi":"10.1109/MVIP49855.2020.9116883","DOIUrl":"https://doi.org/10.1109/MVIP49855.2020.9116883","url":null,"abstract":"Denoising is an important process before applying other post-processing techniques on medical images. To obtain better quality images many denoising approaches have been introduced. Gaussian filter is a spatial domain filter, which is proper to blur and to remove noise from images. Since the Gaussian filter modifies the input signal by convolution with a Gaussian function it is a computationally intensive algorithm. Hence to enhance the performance of the algorithm, it is better to perform two 1-D convolution operations instead of one 2-D convolution operation and then parallelize it. In this paper in order to increase the performance of 1-D convolution operation, we exploit both Data- and Thread-Level Parallelism using parallel programming models such as Intrinsic Programming Model, Compiler's Automatic Vectorization and Open Multi-Processing. The experimental results were shown that the performance of our implementations is much higher than other approaches Performance improvements of Multi-threaded version of all implementations are significantly improved compared to single-core implementations, and a speedup of 52.33x obtained over the optimal scalar implementation.","PeriodicalId":255375,"journal":{"name":"2020 International Conference on Machine Vision and Image Processing (MVIP)","volume":"56 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115932878","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Gaussian Soft Margin Angular Loss for Face Recognition 人脸识别的高斯软边缘角损失
Pub Date : 2020-02-01 DOI: 10.1109/MVIP49855.2020.9116917
Bahman Rouhani, Alireza Samadzadeh, M. Rahmati, A. Nickabadi
Advances in deep learning has lead to drastic improvements in face recognition. A key part of these deep models is their loss function. Consequently developing an efficient and suitable loss function has been an important topic in face recognition in the recent years. Angular-margin-based losses achieve an acceptable performance and inter-class separability. However they are held back by their enforcement of hard margins on all the samples of the training dataset, regardless of whether these samples actually differ from all the other classes enough to enforce a margin. It can be argued that in a large enough dataset with many different settings and age gaps, some faces will look similar to the faces of other classes. In an intuitive and expressive embedding, we expect some faces to be embedded near similar classes with a small margin. Thus we propose a loss function that while maximizing the inter-class distance and intra-class compactness, allows for the samples which naturally reside further from class center to have a smaller margin. We implement an extremely light and fast to train model using MobileNets and achieve accuracy comparable to state of the art method.
深度学习的进步导致了人脸识别的巨大进步。这些深度模型的一个关键部分是它们的损失函数。因此,开发一种高效、合适的损失函数已成为近年来人脸识别领域的一个重要课题。基于角边缘的损失获得了可接受的性能和类间可分性。然而,他们在训练数据集的所有样本上强制执行硬边际,而不管这些样本是否与所有其他类别的差异足以强制执行边际,他们都受到阻碍。可以认为,在具有许多不同设置和年龄差距的足够大的数据集中,有些面孔看起来与其他类别的面孔相似。在直观和富有表现力的嵌入中,我们期望一些面孔被嵌入在相似的类附近,并具有较小的边距。因此,我们提出了一个损失函数,它在最大化类间距离和类内紧凑性的同时,允许离类中心较远的样本具有较小的裕度。我们使用MobileNets实现了一个非常轻和快速的训练模型,并实现了与最先进方法相当的精度。
{"title":"Gaussian Soft Margin Angular Loss for Face Recognition","authors":"Bahman Rouhani, Alireza Samadzadeh, M. Rahmati, A. Nickabadi","doi":"10.1109/MVIP49855.2020.9116917","DOIUrl":"https://doi.org/10.1109/MVIP49855.2020.9116917","url":null,"abstract":"Advances in deep learning has lead to drastic improvements in face recognition. A key part of these deep models is their loss function. Consequently developing an efficient and suitable loss function has been an important topic in face recognition in the recent years. Angular-margin-based losses achieve an acceptable performance and inter-class separability. However they are held back by their enforcement of hard margins on all the samples of the training dataset, regardless of whether these samples actually differ from all the other classes enough to enforce a margin. It can be argued that in a large enough dataset with many different settings and age gaps, some faces will look similar to the faces of other classes. In an intuitive and expressive embedding, we expect some faces to be embedded near similar classes with a small margin. Thus we propose a loss function that while maximizing the inter-class distance and intra-class compactness, allows for the samples which naturally reside further from class center to have a smaller margin. We implement an extremely light and fast to train model using MobileNets and achieve accuracy comparable to state of the art method.","PeriodicalId":255375,"journal":{"name":"2020 International Conference on Machine Vision and Image Processing (MVIP)","volume":"62 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114484321","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Defect Image Enhancement Approach for Detection of Defective Area in CFRPs Through Local Defect Resonance 基于局部缺陷共振的CFRPs缺陷图像增强检测方法
Pub Date : 2020-02-01 DOI: 10.1109/MVIP49855.2020.9116901
Saman Hadi, Reza P. R. Hasanzadeh, M. Kersemans
Nowadays composite materials such as carbon fiber reinforced polymers (CFRP)s have been widely used in industrial applications. But, they are susceptible to impact damage and subsequent fatigue cracking and delamination which in long term lead to some negative consequences such as erosion and also breaking the material. Due to the inability to visually observe such defects and also the high sensitivity of industrial components to invasive inspections, non-destructive testing (NDT) techniques are used to deal with the aforementioned problems. In this regards, an ultrasound-based NDT technique called Local defect resonance (LDR) leads to remarkable results for detecting various types of defects in CFRPs. In LDR technique, high frequency acoustical vibrations are used to get a localized resonant activation of a defective region such that these excitation frequencies lead to a significant increase of the vibration amplitude in the defective area relative to the sound area. The problem which arises is that in order to properly localize the defect, the defect resonance frequency must be known which is practically impossible. In this paper, a new defect imaging methodology is proposed, which can localize the defects without any prior knowledge about their location and resonance frequencies. Experiments are performed on a CFRP sample with flat bottom hole (FBH) defects and the proposed method has been quantitatively validated through the experiments by using the signal-to-noise ratio (SNR) criterion. The results show the superiority of our method over some well-known algorithms.
目前,碳纤维增强聚合物(CFRP)等复合材料在工业上得到了广泛应用。但是,它们很容易受到冲击损伤和随后的疲劳开裂和分层,这在长期内会导致一些负面后果,如侵蚀和破坏材料。由于无法直观地观察到这些缺陷,以及工业部件对侵入式检测的高灵敏度,因此使用无损检测技术来处理上述问题。在这方面,一种基于超声的无损检测技术,称为局部缺陷共振(LDR),在检测cfrp中各种类型的缺陷方面取得了显著的效果。在LDR技术中,使用高频声振动来获得缺陷区域的局部共振激活,使得这些激励频率导致缺陷区域相对于声区域的振动幅值显着增加。由此产生的问题是,为了正确地定位缺陷,必须知道缺陷的共振频率,这实际上是不可能的。本文提出了一种新的缺陷成像方法,该方法可以在不知道缺陷位置和共振频率的情况下对缺陷进行定位。以具有平底孔(FBH)缺陷的CFRP试样为实验对象,采用信噪比(SNR)准则对该方法进行了定量验证。结果表明,该方法优于一些已知的算法。
{"title":"A Defect Image Enhancement Approach for Detection of Defective Area in CFRPs Through Local Defect Resonance","authors":"Saman Hadi, Reza P. R. Hasanzadeh, M. Kersemans","doi":"10.1109/MVIP49855.2020.9116901","DOIUrl":"https://doi.org/10.1109/MVIP49855.2020.9116901","url":null,"abstract":"Nowadays composite materials such as carbon fiber reinforced polymers (CFRP)s have been widely used in industrial applications. But, they are susceptible to impact damage and subsequent fatigue cracking and delamination which in long term lead to some negative consequences such as erosion and also breaking the material. Due to the inability to visually observe such defects and also the high sensitivity of industrial components to invasive inspections, non-destructive testing (NDT) techniques are used to deal with the aforementioned problems. In this regards, an ultrasound-based NDT technique called Local defect resonance (LDR) leads to remarkable results for detecting various types of defects in CFRPs. In LDR technique, high frequency acoustical vibrations are used to get a localized resonant activation of a defective region such that these excitation frequencies lead to a significant increase of the vibration amplitude in the defective area relative to the sound area. The problem which arises is that in order to properly localize the defect, the defect resonance frequency must be known which is practically impossible. In this paper, a new defect imaging methodology is proposed, which can localize the defects without any prior knowledge about their location and resonance frequencies. Experiments are performed on a CFRP sample with flat bottom hole (FBH) defects and the proposed method has been quantitatively validated through the experiments by using the signal-to-noise ratio (SNR) criterion. The results show the superiority of our method over some well-known algorithms.","PeriodicalId":255375,"journal":{"name":"2020 International Conference on Machine Vision and Image Processing (MVIP)","volume":"78 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124801679","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Retina-Inspired Multiresolution Analysis Framework for Pansharpening 视网膜启发的泛锐化多分辨率分析框架
Pub Date : 2020-02-01 DOI: 10.1109/MVIP49855.2020.9116884
Mehran Maneshi, H. Ghassemian, Ghassem Khademi, M. Imani
Technical limitations on the satellite sensors make a trade-off between the spectral and spatial resolution in remotely sensed images. To deal with this issue, pansharpening has been emerged to prepare a single image with the high spatial and spectral resolution, simultaneously. This paper presents a pansharpening approach based on the retina-inspired model and the multiresolution analysis (MRA) framework. The retina- inspired model is simplified by the difference of Gaussian (DoG) operator, and we apply it to the panchromatic image to extract the spatial details. Furthermore, the injection gains in the MRA framework are calculated through an iterative process where the gains at each iteration are updated based on the fusion result obtained from its previous iteration. To investigate the performance of the proposed model, it is compared with some classical pansharpening approaches with two data sets captured by the GeoEye-1 and Pléiades satellite imagery sensors. The experimental results show the proposed retina-inspired pansharpening method acts well in injecting the spatial information along with reducing the spectral distortion.
卫星传感器的技术限制使得遥感图像的光谱分辨率和空间分辨率之间必须进行权衡。为了解决这一问题,出现了一种同时制备高空间和光谱分辨率的单幅图像的泛锐化技术。提出了一种基于视网膜启发模型和多分辨率分析(MRA)框架的泛锐化方法。采用差分高斯算子(DoG)对视网膜模型进行简化,并将其应用于全色图像中提取空间细节。此外,通过迭代过程计算MRA框架中的注入增益,其中每次迭代的增益是基于前一次迭代获得的融合结果更新的。为了研究该模型的性能,用GeoEye-1和plimadades卫星图像传感器捕获的两个数据集与一些经典的泛锐化方法进行了比较。实验结果表明,基于视网膜的泛锐化方法能够很好地注入空间信息,降低光谱畸变。
{"title":"A Retina-Inspired Multiresolution Analysis Framework for Pansharpening","authors":"Mehran Maneshi, H. Ghassemian, Ghassem Khademi, M. Imani","doi":"10.1109/MVIP49855.2020.9116884","DOIUrl":"https://doi.org/10.1109/MVIP49855.2020.9116884","url":null,"abstract":"Technical limitations on the satellite sensors make a trade-off between the spectral and spatial resolution in remotely sensed images. To deal with this issue, pansharpening has been emerged to prepare a single image with the high spatial and spectral resolution, simultaneously. This paper presents a pansharpening approach based on the retina-inspired model and the multiresolution analysis (MRA) framework. The retina- inspired model is simplified by the difference of Gaussian (DoG) operator, and we apply it to the panchromatic image to extract the spatial details. Furthermore, the injection gains in the MRA framework are calculated through an iterative process where the gains at each iteration are updated based on the fusion result obtained from its previous iteration. To investigate the performance of the proposed model, it is compared with some classical pansharpening approaches with two data sets captured by the GeoEye-1 and Pléiades satellite imagery sensors. The experimental results show the proposed retina-inspired pansharpening method acts well in injecting the spatial information along with reducing the spectral distortion.","PeriodicalId":255375,"journal":{"name":"2020 International Conference on Machine Vision and Image Processing (MVIP)","volume":"124 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129626960","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Iranian License Plate Recognition using Deep Learning 伊朗车牌识别使用深度学习
Pub Date : 2020-02-01 DOI: 10.1109/MVIP49855.2020.9116897
Atefeh Ranjkesh Rashtehroudi, A. Shahbahrami, Alireza Akoushideh
Automated License Plate Recognition (ALPR) has many applications in intelligent transport system. The ALPR has three main steps, License Plate (LP) localization, segmentation and Optical Character Recognition (OCR). Each step needs different techniques in real condition and each technique has its specific characteristics. The LP localization techniques detect the LP after that segmentation algorithms should segment and isolate each character from each other. Finally, the OCR step is applied to recognize the separated characters. The final accuracy depends on the accuracy of each step. To improve the OCR step performance, we combine both segmentation and OCR steps as a single-stage using deep learning techniques such as the You Only Look Once (YOLO) framework. Our experimental results show that this proposed approach recognizes the Iranian LP characters with accuracy 99.2% compared to previous works.
车牌自动识别技术在智能交通系统中有着广泛的应用。ALPR有三个主要步骤:车牌定位、分割和光学字符识别(OCR)。每个步骤在实际条件下都需要不同的技术,每种技术都有其特定的特点。LP定位技术对LP进行检测,然后分割算法对每个字符进行分割和分离。最后,应用OCR步骤对分离字符进行识别。最终的精度取决于每一步的精度。为了提高OCR步骤的性能,我们使用深度学习技术(如You Only Look Once (YOLO)框架)将分割和OCR步骤结合为一个单阶段。实验结果表明,该方法对伊朗语LP字符的识别准确率达到99.2%。
{"title":"Iranian License Plate Recognition using Deep Learning","authors":"Atefeh Ranjkesh Rashtehroudi, A. Shahbahrami, Alireza Akoushideh","doi":"10.1109/MVIP49855.2020.9116897","DOIUrl":"https://doi.org/10.1109/MVIP49855.2020.9116897","url":null,"abstract":"Automated License Plate Recognition (ALPR) has many applications in intelligent transport system. The ALPR has three main steps, License Plate (LP) localization, segmentation and Optical Character Recognition (OCR). Each step needs different techniques in real condition and each technique has its specific characteristics. The LP localization techniques detect the LP after that segmentation algorithms should segment and isolate each character from each other. Finally, the OCR step is applied to recognize the separated characters. The final accuracy depends on the accuracy of each step. To improve the OCR step performance, we combine both segmentation and OCR steps as a single-stage using deep learning techniques such as the You Only Look Once (YOLO) framework. Our experimental results show that this proposed approach recognizes the Iranian LP characters with accuracy 99.2% compared to previous works.","PeriodicalId":255375,"journal":{"name":"2020 International Conference on Machine Vision and Image Processing (MVIP)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133000564","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Using imaging Photoplethysmography (iPPG) Signal for Blood Pressure Estimation 利用成像光容积脉搏波(iPPG)信号估计血压
Pub Date : 2020-02-01 DOI: 10.1109/MVIP49855.2020.9116902
Reza Heydari Goudarzi, Seyedeh Somayyeh Mousavi, M. Charmi
Blood Pressure (BP) is one of the most important vital signs of the human body, which its value provides valuable physiological information about cardiac function for physicians. In recent years, many types of research have been done in the field of BP estimation using Photoplethysmography (PPG) signal. On the other hand, the results of the studies on Heart Rate (HR) and Respiration Rate (RR) calculations have been reported using the imaging Photoplethysmography (iPPG) signal. The iPPG signal is a kind of PPG signal that is recorded in a non-contact method using a camera.This paper is among the first studies to provide a new algorithm for estimating BP values using the only iPPG signal and with a non-contact method. The validity of the proposed algorithm was evaluated in a gathered database with 40 people. The algorithm in estimation of the Diastolic Blood Pressure (DBP) was able to achieve mean error of −0.2 and standard deviation of 6.41 mmHg and in estimation of the Systolic Blood Pressure (SBP) was able to achieve mean error of 0.45 and standard deviation of 12.39 mmHg.
血压(BP)是人体最重要的生命体征之一,它的价值为医生提供了有价值的心功能生理信息。近年来,利用光容积脉搏波(Photoplethysmography, PPG)信号进行BP估计的研究较多。另一方面,使用成像光容积脉搏波(iPPG)信号计算心率(HR)和呼吸速率(RR)的研究结果也有报道。iPPG信号是一种利用摄像机以非接触方式记录的PPG信号。本文是第一批利用唯一的iPPG信号和非接触方法估计BP值的新算法的研究之一。该算法的有效性在一个40人的数据库中进行了评估。舒张压(DBP)估计算法的平均误差为- 0.2,标准差为6.41 mmHg,收缩压(SBP)估计算法的平均误差为0.45,标准差为12.39 mmHg。
{"title":"Using imaging Photoplethysmography (iPPG) Signal for Blood Pressure Estimation","authors":"Reza Heydari Goudarzi, Seyedeh Somayyeh Mousavi, M. Charmi","doi":"10.1109/MVIP49855.2020.9116902","DOIUrl":"https://doi.org/10.1109/MVIP49855.2020.9116902","url":null,"abstract":"Blood Pressure (BP) is one of the most important vital signs of the human body, which its value provides valuable physiological information about cardiac function for physicians. In recent years, many types of research have been done in the field of BP estimation using Photoplethysmography (PPG) signal. On the other hand, the results of the studies on Heart Rate (HR) and Respiration Rate (RR) calculations have been reported using the imaging Photoplethysmography (iPPG) signal. The iPPG signal is a kind of PPG signal that is recorded in a non-contact method using a camera.This paper is among the first studies to provide a new algorithm for estimating BP values using the only iPPG signal and with a non-contact method. The validity of the proposed algorithm was evaluated in a gathered database with 40 people. The algorithm in estimation of the Diastolic Blood Pressure (DBP) was able to achieve mean error of −0.2 and standard deviation of 6.41 mmHg and in estimation of the Systolic Blood Pressure (SBP) was able to achieve mean error of 0.45 and standard deviation of 12.39 mmHg.","PeriodicalId":255375,"journal":{"name":"2020 International Conference on Machine Vision and Image Processing (MVIP)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126898810","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Brain Tumor Segmentation in MRI Images Using a Hybrid Deep Network Based on Patch and Pixel 基于Patch和Pixel混合深度网络的MRI图像脑肿瘤分割
Pub Date : 2020-02-01 DOI: 10.1109/MVIP49855.2020.9116880
F. Derikvand, Hassan Khotanlou
In recent years, many segmentation methods have been proposed for brain tumor segmentation, among them deeplearning approaches have good performance and have provided better results than other methods. In this paper, an algorithm based on deep neural networks for segmentation of gliomas tumor is presented which is a combination of different Convolutional Neural Network (CNN) architectures. The proposed method uses local and global features of the brain tissue and consists of preprocessing and post-processing steps which leads to better segmentation. The accuracy of the results was evaluated using the dice score coefficient and the sensitivity on the images obtained from two modalities, Flair and T1, from the BraTs2017 data set and achieved acceptable results compared to state of the art methods.
近年来,针对脑肿瘤分割提出了许多分割方法,其中深度学习方法性能较好,效果优于其他方法。本文提出了一种基于深度神经网络的神经胶质瘤分割算法,该算法结合了不同的卷积神经网络(CNN)结构。该方法利用了脑组织的局部特征和全局特征,并分为预处理和后处理两个步骤,从而实现了更好的分割。使用骰子得分系数和对来自BraTs2017数据集的两种模式(Flair和T1)获得的图像的灵敏度来评估结果的准确性,与最先进的方法相比,获得了可接受的结果。
{"title":"Brain Tumor Segmentation in MRI Images Using a Hybrid Deep Network Based on Patch and Pixel","authors":"F. Derikvand, Hassan Khotanlou","doi":"10.1109/MVIP49855.2020.9116880","DOIUrl":"https://doi.org/10.1109/MVIP49855.2020.9116880","url":null,"abstract":"In recent years, many segmentation methods have been proposed for brain tumor segmentation, among them deeplearning approaches have good performance and have provided better results than other methods. In this paper, an algorithm based on deep neural networks for segmentation of gliomas tumor is presented which is a combination of different Convolutional Neural Network (CNN) architectures. The proposed method uses local and global features of the brain tissue and consists of preprocessing and post-processing steps which leads to better segmentation. The accuracy of the results was evaluated using the dice score coefficient and the sensitivity on the images obtained from two modalities, Flair and T1, from the BraTs2017 data set and achieved acceptable results compared to state of the art methods.","PeriodicalId":255375,"journal":{"name":"2020 International Conference on Machine Vision and Image Processing (MVIP)","volume":"2014 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130343841","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Monitoring Wrist and Fingers Range of Motion using Leap Motion Camera for Physical Rehabilitation 使用Leap运动相机监测手腕和手指的运动范围,用于物理康复
Pub Date : 2020-02-01 DOI: 10.1109/MVIP49855.2020.9116876
M. Kavian, A. Nadian-Ghomsheh
Computer vision-based health monitoring systems have gained vast attention especially for physical rehabilitation in the past few years. This paper presents a method for measuring the flexibility of wrist and fingers using leap motion camera. Leap motion was incorporated to acquire the 3D position of hand joints. From the acquired joints, using spatial-temporal features of hand joints, physical exercises targeted at rehabilitating the fingers and wrist range of motion were recognized. Then, appropriate joints selected from the recognized exercises were applied to measure the target range of motion. Apart from the proposed method, the accuracy of leap motion sensor for wrist and fingers range of motion was verified against standard goniometry. Furthermore, the dataset created for this study is published and made publically available for further research in this field. Results of the study showed that leap motion shows promising results for measuring range of motion for several wrist and fingers rehabilitation exercises.
近年来,基于计算机视觉的健康监测系统受到了广泛的关注,特别是在身体康复方面。提出了一种利用跳跃运动摄像机测量腕部和手指柔韧性的方法。采用跳跃运动获取手关节的三维位置。从获得的关节中,利用手部关节的时空特征,识别出旨在恢复手指和手腕活动范围的体育锻炼。然后,从识别的运动中选择适当的关节来测量目标运动范围。在此基础上,根据标准角度测量法验证了跳跃运动传感器对手腕和手指运动范围的精度。此外,为本研究创建的数据集已发布并公开供该领域的进一步研究使用。研究结果表明,跳跃运动在测量手腕和手指康复练习的运动范围方面显示出很好的结果。
{"title":"Monitoring Wrist and Fingers Range of Motion using Leap Motion Camera for Physical Rehabilitation","authors":"M. Kavian, A. Nadian-Ghomsheh","doi":"10.1109/MVIP49855.2020.9116876","DOIUrl":"https://doi.org/10.1109/MVIP49855.2020.9116876","url":null,"abstract":"Computer vision-based health monitoring systems have gained vast attention especially for physical rehabilitation in the past few years. This paper presents a method for measuring the flexibility of wrist and fingers using leap motion camera. Leap motion was incorporated to acquire the 3D position of hand joints. From the acquired joints, using spatial-temporal features of hand joints, physical exercises targeted at rehabilitating the fingers and wrist range of motion were recognized. Then, appropriate joints selected from the recognized exercises were applied to measure the target range of motion. Apart from the proposed method, the accuracy of leap motion sensor for wrist and fingers range of motion was verified against standard goniometry. Furthermore, the dataset created for this study is published and made publically available for further research in this field. Results of the study showed that leap motion shows promising results for measuring range of motion for several wrist and fingers rehabilitation exercises.","PeriodicalId":255375,"journal":{"name":"2020 International Conference on Machine Vision and Image Processing (MVIP)","volume":"69 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124225753","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Recognizing Persian Automobile license plates under adverse rainy conditions 在恶劣的雨天条件下识别波斯汽车牌照
Pub Date : 2020-02-01 DOI: 10.1109/MVIP49855.2020.9116886
Hossein Rezaei, Maryam Haghshenas, Mahboobehsadat Yasini
This study is focused on identifying Persian license plate of Iranian cars in different rain conditions, with different distances and lighting, with simple and complex backgrounds and different angles of stationary cars. A method that is applicable to automated license plate identification systems, which is a type of intelligent transportation system. Systems that have been localized due to the variety of appearance of car license plates in different countries are currently being researched in many countries. Among the important challenges in identifying a vehicle license plate are inappropriate conditions such as adverse weather conditions such as rainy weather, snow, fog and dust, which make it difficult to identify license plates. The proposed method, which is a simple yet efficient method, employs many image processing techniques and morphology operations, and the results of implementing the proposed algorithm in MATLAB 2019b software on 420 Color image of car under low rainfall conditions, moderate rainfall and severe rainfall and storm show accuracy of 81%, 61.5% and 10.5% accuracy in identifying plaque IDs and their separation, respectively.
本研究的重点是在不同的降雨条件下,在不同的距离和光照下,在简单和复杂的背景下,在不同的静止汽车角度下,识别伊朗汽车的波斯语车牌。一种适用于自动车牌识别系统的方法,自动车牌识别系统是智能交通系统的一种。由于不同国家的汽车牌照外观的多样性,已经本地化的系统目前正在许多国家进行研究。车牌识别的重要挑战之一是不适当的条件,如恶劣的天气条件,如雨天、雪、雾和灰尘,这使得车牌识别变得困难。该方法采用了多种图像处理技术和形态学运算,是一种简单而高效的方法,在MATLAB 2019b软件中对420张汽车彩色图像进行了低降雨、中降雨和强降雨及暴雨条件下的斑块id识别和分离准确率分别为81%、61.5%和10.5%。
{"title":"Recognizing Persian Automobile license plates under adverse rainy conditions","authors":"Hossein Rezaei, Maryam Haghshenas, Mahboobehsadat Yasini","doi":"10.1109/MVIP49855.2020.9116886","DOIUrl":"https://doi.org/10.1109/MVIP49855.2020.9116886","url":null,"abstract":"This study is focused on identifying Persian license plate of Iranian cars in different rain conditions, with different distances and lighting, with simple and complex backgrounds and different angles of stationary cars. A method that is applicable to automated license plate identification systems, which is a type of intelligent transportation system. Systems that have been localized due to the variety of appearance of car license plates in different countries are currently being researched in many countries. Among the important challenges in identifying a vehicle license plate are inappropriate conditions such as adverse weather conditions such as rainy weather, snow, fog and dust, which make it difficult to identify license plates. The proposed method, which is a simple yet efficient method, employs many image processing techniques and morphology operations, and the results of implementing the proposed algorithm in MATLAB 2019b software on 420 Color image of car under low rainfall conditions, moderate rainfall and severe rainfall and storm show accuracy of 81%, 61.5% and 10.5% accuracy in identifying plaque IDs and their separation, respectively.","PeriodicalId":255375,"journal":{"name":"2020 International Conference on Machine Vision and Image Processing (MVIP)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126103559","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2020 International Conference on Machine Vision and Image Processing (MVIP)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1