首页 > 最新文献

Proceedings of the 3rd International Conference on Graphics and Signal Processing最新文献

英文 中文
Snake Image Classification using Siamese Networks 基于暹罗网络的蛇图像分类
C. Abeysinghe, A. Welivita, I. Perera
Research into deep learning models suitable for small data sets is still in an immature state since it has received less attention from the machine learning community. Identifying a snake species using images, is a classification problem which has a number of medical, educational and safety-related importance but no large data set. Due to the lack of large data sets and difficulty in collecting such data set, no one has applied deep learning algorithms, to solve this problem. In this paper, we explored the applicability of single shot learning techniques along with deep neural networks to solve the snake image classification problem. By using a convolutional architecture, we were able to achieve strong results and did a comparative analysis with human results.
对于适合小数据集的深度学习模型的研究还处于不成熟的状态,因为它受到机器学习社区的关注较少。使用图像识别蛇的种类是一个分类问题,具有许多医学、教育和安全相关的重要性,但没有大型数据集。由于缺乏大型数据集和收集这些数据集的难度,没有人应用深度学习算法来解决这个问题。在本文中,我们探索了单镜头学习技术与深度神经网络在解决蛇图像分类问题中的适用性。通过使用卷积架构,我们能够获得强大的结果,并与人类的结果进行了比较分析。
{"title":"Snake Image Classification using Siamese Networks","authors":"C. Abeysinghe, A. Welivita, I. Perera","doi":"10.1145/3338472.3338476","DOIUrl":"https://doi.org/10.1145/3338472.3338476","url":null,"abstract":"Research into deep learning models suitable for small data sets is still in an immature state since it has received less attention from the machine learning community. Identifying a snake species using images, is a classification problem which has a number of medical, educational and safety-related importance but no large data set. Due to the lack of large data sets and difficulty in collecting such data set, no one has applied deep learning algorithms, to solve this problem. In this paper, we explored the applicability of single shot learning techniques along with deep neural networks to solve the snake image classification problem. By using a convolutional architecture, we were able to achieve strong results and did a comparative analysis with human results.","PeriodicalId":142573,"journal":{"name":"Proceedings of the 3rd International Conference on Graphics and Signal Processing","volume":"379 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114889673","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
Two-Dimensional Discriminative Feature Selection for Image Recognition 图像识别的二维判别特征选择
Yong Zhao, Yongjie Chu, Lindu Zhao
In many computer vision tasks, the available original data is in matrix form. Traditional methods often convert a matrix into a vector before processing them. This kind of methods not only ignore the location information of matrix elements, but have to deal with the high-dimensional vectors. Two-dimensional linear discriminant analysis (2DLDA) is a widely used approach in image recognition which works with data matrix directly and computes efficiently. When mapping the original data onto low-dimensional space, however, the two projection matrices of 2DLDA cannot remove features with little or no information, resulting in redundant features in the projected space. To address the problem, in this paper we propose an algorithm named two-dimensional discriminative feature selection (2DDFS) for bidirectional direct feature selection on matrix data directly. Based on 2DLDA, it employs the l2,1 norm to regularize the two transformation matrices when learning them. To obtain the optimal solutions, we design an effective optimization algorithm. Then we conduct experiments on two image databases to evaluate the performance of the proposed method, by comparing with other related methods. The promising results demonstrate the effectiveness of our method.
在许多计算机视觉任务中,可用的原始数据是矩阵形式的。传统的方法通常是先将矩阵转换成向量再进行处理。这种方法不仅忽略了矩阵元素的位置信息,而且必须处理高维向量。二维线性判别分析(2DLDA)是一种应用广泛的图像识别方法,它直接处理数据矩阵,计算效率高。然而,当将原始数据映射到低维空间时,2DLDA的两个投影矩阵无法去除信息很少或没有信息的特征,导致投影空间中存在冗余特征。为了解决这一问题,本文提出了一种二维判别特征选择(2DDFS)算法,用于直接对矩阵数据进行双向直接特征选择。该方法基于2DLDA,在学习两个变换矩阵时,采用l2,1范数对其进行正则化。为了得到最优解,我们设计了一种有效的优化算法。然后,我们在两个图像数据库上进行了实验,通过与其他相关方法的比较来评估所提出方法的性能。结果表明了该方法的有效性。
{"title":"Two-Dimensional Discriminative Feature Selection for Image Recognition","authors":"Yong Zhao, Yongjie Chu, Lindu Zhao","doi":"10.1145/3338472.3338478","DOIUrl":"https://doi.org/10.1145/3338472.3338478","url":null,"abstract":"In many computer vision tasks, the available original data is in matrix form. Traditional methods often convert a matrix into a vector before processing them. This kind of methods not only ignore the location information of matrix elements, but have to deal with the high-dimensional vectors. Two-dimensional linear discriminant analysis (2DLDA) is a widely used approach in image recognition which works with data matrix directly and computes efficiently. When mapping the original data onto low-dimensional space, however, the two projection matrices of 2DLDA cannot remove features with little or no information, resulting in redundant features in the projected space. To address the problem, in this paper we propose an algorithm named two-dimensional discriminative feature selection (2DDFS) for bidirectional direct feature selection on matrix data directly. Based on 2DLDA, it employs the l2,1 norm to regularize the two transformation matrices when learning them. To obtain the optimal solutions, we design an effective optimization algorithm. Then we conduct experiments on two image databases to evaluate the performance of the proposed method, by comparing with other related methods. The promising results demonstrate the effectiveness of our method.","PeriodicalId":142573,"journal":{"name":"Proceedings of the 3rd International Conference on Graphics and Signal Processing","volume":"56 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125863074","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Designing the Knowledge Management System: A Case Study Approach in IT Consultant Company 知识管理系统的设计:以IT咨询公司为例
Jeffrey Johannes Austen Bongku, Yohannes Kurniawan
The goal of this research is to identify, analyze and design an effective knowledge management system for XYZ company in Indonesia. As the growing of consultant company in Indonesia, XYZ company need to keep the valuable knowledge, so they can use it in the future. The analyze and design method used by this paper are: Wiig Knowledge Management Cycle, Nonaka and Takeuchi SECI Model, and object-oriented analysis and design. And for the data collection, the authors did the interview with the enterprise solution manager and observation in the company to capture company daily activities. The results are the knowledge management system implemented to capture, manage and use the company knowledge work more effective. We can conclude that the knowledge management system can help the company to maintain their knowledge.
本研究的目的是为印度尼西亚的XYZ公司识别、分析和设计一个有效的知识管理系统。随着印尼咨询公司的成长,XYZ公司需要保留有价值的知识,以便将来使用。本文采用的分析与设计方法有:Wiig知识管理周期、Nonaka和Takeuchi SECI模型和面向对象的分析与设计。在数据收集方面,作者对企业解决方案经理进行了访谈,并在公司进行了观察,以捕捉公司的日常活动。其结果是实施了知识管理系统,以更有效地捕获、管理和利用公司的知识工作。我们可以得出结论,知识管理系统可以帮助公司维护他们的知识。
{"title":"Designing the Knowledge Management System: A Case Study Approach in IT Consultant Company","authors":"Jeffrey Johannes Austen Bongku, Yohannes Kurniawan","doi":"10.1145/3338472.3338473","DOIUrl":"https://doi.org/10.1145/3338472.3338473","url":null,"abstract":"The goal of this research is to identify, analyze and design an effective knowledge management system for XYZ company in Indonesia. As the growing of consultant company in Indonesia, XYZ company need to keep the valuable knowledge, so they can use it in the future. The analyze and design method used by this paper are: Wiig Knowledge Management Cycle, Nonaka and Takeuchi SECI Model, and object-oriented analysis and design. And for the data collection, the authors did the interview with the enterprise solution manager and observation in the company to capture company daily activities. The results are the knowledge management system implemented to capture, manage and use the company knowledge work more effective. We can conclude that the knowledge management system can help the company to maintain their knowledge.","PeriodicalId":142573,"journal":{"name":"Proceedings of the 3rd International Conference on Graphics and Signal Processing","volume":"78 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126158738","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Deep Learning for Gastric Pathology Detection in Endoscopic Images 基于深度学习的胃内镜病理检测
V. Khryashchev, O. Stepanova, A. Lebedev, S. Kashin, R. Kuvaev
Computer-aided diagnosis of cancer based on endoscopic image analysis is a promising area in the field of computer vision and machine learning. Convolutional neural networks are one of the most popular approaches in endoscopic image analysis. This paper presents the algorithm of pathology detection in endoscopic images of gastric lesions based on convolutional neural network. Training and testing of the algorithm was carried out on the NVIDIA DGX-1 supercomputer using endoscopic images from the test base, assembled together with the Yaroslavl Regional Cancer Hospital. As a result of experiments, the mAP metric was calculated and the value was 0.875, which is a high result for the task of object detection in images.
基于内窥镜图像分析的计算机辅助癌症诊断是计算机视觉和机器学习领域的一个有前途的领域。卷积神经网络是内窥镜图像分析中最流行的方法之一。提出了一种基于卷积神经网络的胃病变内镜图像病理检测算法。算法的训练和测试在NVIDIA DGX-1超级计算机上进行,使用来自测试基地的内窥镜图像,与雅罗斯拉夫尔地区癌症医院一起组装。实验结果表明,mAP度量值为0.875,对于图像中的目标检测任务来说,这是一个较高的结果。
{"title":"Deep Learning for Gastric Pathology Detection in Endoscopic Images","authors":"V. Khryashchev, O. Stepanova, A. Lebedev, S. Kashin, R. Kuvaev","doi":"10.1145/3338472.3338492","DOIUrl":"https://doi.org/10.1145/3338472.3338492","url":null,"abstract":"Computer-aided diagnosis of cancer based on endoscopic image analysis is a promising area in the field of computer vision and machine learning. Convolutional neural networks are one of the most popular approaches in endoscopic image analysis. This paper presents the algorithm of pathology detection in endoscopic images of gastric lesions based on convolutional neural network. Training and testing of the algorithm was carried out on the NVIDIA DGX-1 supercomputer using endoscopic images from the test base, assembled together with the Yaroslavl Regional Cancer Hospital. As a result of experiments, the mAP metric was calculated and the value was 0.875, which is a high result for the task of object detection in images.","PeriodicalId":142573,"journal":{"name":"Proceedings of the 3rd International Conference on Graphics and Signal Processing","volume":"41 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126362333","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
3D Human Pose Dataset Augmentation Using Generative Adversarial Network 基于生成对抗网络的三维人体姿态数据集增强
Huyuan ShangGuan, R. Mukundan
Methods for 3D human pose estimation from monocular images based on convolutional neural networks require a large number of training data of well annotated pose-image pairs. Although many 3D human pose datasets have been created, more training data with accurate 3D annotation is still in shortage for the training of neural networks. Recently in image generation area, techniques based on generative adversarial network show potential to generate realistic image data from user input pose data. In this paper, we propose a neural network-based method for 3D human pose dataset augmentation. In our method, we use an autoencoder to learn a latent representation of existing pose data and to produce new poses of similar style. Human pose data generated by autoencoder is input into a generative adversarial network to synthesize mask images with an actor performing the same style, which can be transformed to colored images at the end. For the evaluation of the proposed method, we augment it with a small amount of labeled data. The experimental analysis shows that our method can generate more valid labeled data from small labeled data, which can boost the training of pose estimation using neural networks.
基于卷积神经网络的单眼图像三维人体姿态估计方法需要大量带注释的姿态-图像对训练数据。虽然已经创建了许多三维人体姿态数据集,但对于神经网络的训练来说,仍然缺乏更多具有准确三维注释的训练数据。最近在图像生成领域,基于生成对抗网络的技术显示出从用户输入的姿态数据生成真实图像数据的潜力。本文提出了一种基于神经网络的三维人体姿态数据增强方法。在我们的方法中,我们使用一个自动编码器来学习现有姿势数据的潜在表示,并产生类似风格的新姿势。将自动编码器生成的人体姿态数据输入到生成对抗网络中,合成具有相同风格的演员的面具图像,最后将其转换为彩色图像。为了评估所提出的方法,我们使用少量标记数据对其进行扩充。实验分析表明,该方法可以从小的标记数据中生成更有效的标记数据,从而提高神经网络姿态估计的训练效率。
{"title":"3D Human Pose Dataset Augmentation Using Generative Adversarial Network","authors":"Huyuan ShangGuan, R. Mukundan","doi":"10.1145/3338472.3338475","DOIUrl":"https://doi.org/10.1145/3338472.3338475","url":null,"abstract":"Methods for 3D human pose estimation from monocular images based on convolutional neural networks require a large number of training data of well annotated pose-image pairs. Although many 3D human pose datasets have been created, more training data with accurate 3D annotation is still in shortage for the training of neural networks. Recently in image generation area, techniques based on generative adversarial network show potential to generate realistic image data from user input pose data. In this paper, we propose a neural network-based method for 3D human pose dataset augmentation. In our method, we use an autoencoder to learn a latent representation of existing pose data and to produce new poses of similar style. Human pose data generated by autoencoder is input into a generative adversarial network to synthesize mask images with an actor performing the same style, which can be transformed to colored images at the end. For the evaluation of the proposed method, we augment it with a small amount of labeled data. The experimental analysis shows that our method can generate more valid labeled data from small labeled data, which can boost the training of pose estimation using neural networks.","PeriodicalId":142573,"journal":{"name":"Proceedings of the 3rd International Conference on Graphics and Signal Processing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126213431","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A Deep Learning Based Multi-color Space Approach for Pedestrian Attribute Recognition 基于深度学习的多颜色空间行人属性识别方法
Imran N. Junejo
Pedestrian behavior understanding and identification in surveillance scenarios has attraction a tremendous amount of attention over the past many years. An integral part of this problem involves identifying various human visual attributes in the scene. Over the years, researcher have proposed various solutions and have explored various features. However, they have focused on either engineered features or simple RGB images. In this paper, we explore the problem of crowd at- tribute recognition using RGB (Red, Green, Blue), HSV (Hue, Saturation, Value) and L*a*b* color models and propose a 3-branch Siamese network to solve the problem. We present a unique approach of using these three color models and fine- tune a pre-trained VGG-19 network for our task. We perform extensive experimentation on the most challenging public PETA dataset, which is by far the largest and the most diverse dataset of its kind. We show an improvement over the state of the art work.
在过去的许多年里,监控场景中行人行为的理解和识别吸引了大量的关注。这个问题的一个组成部分涉及识别场景中各种人类视觉属性。多年来,研究人员提出了各种解决方案,并探索了各种特征。然而,他们关注的要么是工程特征,要么是简单的RGB图像。本文利用RGB (Red, Green, Blue), HSV (Hue, Saturation, Value)和L*a*b*颜色模型探讨了人群致敬识别问题,并提出了一个3分支的Siamese网络来解决这个问题。我们提出了一种独特的方法,使用这三种颜色模型和微调预训练的VGG-19网络为我们的任务。我们在最具挑战性的公共PETA数据集上进行了广泛的实验,这是迄今为止同类数据集中最大和最多样化的数据集。我们的作品比目前的水平有所提高。
{"title":"A Deep Learning Based Multi-color Space Approach for Pedestrian Attribute Recognition","authors":"Imran N. Junejo","doi":"10.1145/3338472.3338493","DOIUrl":"https://doi.org/10.1145/3338472.3338493","url":null,"abstract":"Pedestrian behavior understanding and identification in surveillance scenarios has attraction a tremendous amount of attention over the past many years. An integral part of this problem involves identifying various human visual attributes in the scene. Over the years, researcher have proposed various solutions and have explored various features. However, they have focused on either engineered features or simple RGB images. In this paper, we explore the problem of crowd at- tribute recognition using RGB (Red, Green, Blue), HSV (Hue, Saturation, Value) and L*a*b* color models and propose a 3-branch Siamese network to solve the problem. We present a unique approach of using these three color models and fine- tune a pre-trained VGG-19 network for our task. We perform extensive experimentation on the most challenging public PETA dataset, which is by far the largest and the most diverse dataset of its kind. We show an improvement over the state of the art work.","PeriodicalId":142573,"journal":{"name":"Proceedings of the 3rd International Conference on Graphics and Signal Processing","volume":"31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131466026","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Three-Dimensional Shape Information Acquisition Using Binocular Stereo Vision for Reflective Steel Plate 基于双目立体视觉的反射钢板三维形状信息获取
Xin Wen, Kechen Song
Three-dimensional (3-D) shape information acquisition for reflective steel plate is a very difficult challenge. In this work, a measurement framework based on three-step-phase-shifting and binocular stereo vision is proposed to obtain 3D shape measurement for reflective steel plate. In the proposed measurement framework, three-step-phase-shifting method is adopted to obtain the phase. Then, 3D points can be obtained by binocular stereo vision method. To verify the proposed measurement framework, a 3D measurement hardware system is developed, which consists of DLP Lightcrafter 4500 and two USB cameras. For reflective steel plate sample object, experimental results confirmed that the proposed framework could be applied to obtain 3D shape information of the reflective steel plate.
反射钢板的三维形状信息采集是一个非常困难的挑战。本文提出了一种基于三步移相和双目立体视觉的测量框架,实现了反射钢板的三维形状测量。在提出的测量框架中,采用三阶移相法获取相位。然后,利用双目立体视觉方法获得三维点。为了验证所提出的测量框架,开发了一个三维测量硬件系统,该系统由DLP Lightcrafter 4500和两个USB摄像机组成。对于反射钢板样本对象,实验结果证实了所提出的框架可以用于获取反射钢板的三维形状信息。
{"title":"Three-Dimensional Shape Information Acquisition Using Binocular Stereo Vision for Reflective Steel Plate","authors":"Xin Wen, Kechen Song","doi":"10.1145/3338472.3338491","DOIUrl":"https://doi.org/10.1145/3338472.3338491","url":null,"abstract":"Three-dimensional (3-D) shape information acquisition for reflective steel plate is a very difficult challenge. In this work, a measurement framework based on three-step-phase-shifting and binocular stereo vision is proposed to obtain 3D shape measurement for reflective steel plate. In the proposed measurement framework, three-step-phase-shifting method is adopted to obtain the phase. Then, 3D points can be obtained by binocular stereo vision method. To verify the proposed measurement framework, a 3D measurement hardware system is developed, which consists of DLP Lightcrafter 4500 and two USB cameras. For reflective steel plate sample object, experimental results confirmed that the proposed framework could be applied to obtain 3D shape information of the reflective steel plate.","PeriodicalId":142573,"journal":{"name":"Proceedings of the 3rd International Conference on Graphics and Signal Processing","volume":"41 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132938087","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Atrial Fibrillation Detection Based on the Combination of Depth and Statistical Features of ECG 基于心电深度特征与统计特征相结合的房颤检测
Mingchun Li, Gary He, Baofeng Zhu
Atrial fibrillation is a kind of common chronic arrhythmia. The incidence of atrial fibrillation increases with aging. Therefore, especially for the elderly, accurate detection of atrial fibrillation can effectively prevent stroke. In this paper, we propose a strategy that combines the heartbeat model based on deep learning with statistical heart rate features, using a classifier such as a multi-layer perceptron to identify atrial fibrillation rhythm. It is worth noticing that the heartbeat model that we used to extract features for the classification of heartbeat. Through this transfer learning method, the features of each heartbeat in the heart rhythm are extracted one by one for the identification task of atrial fibrillation. We evaluated the proposed method on the MIT-BIH AF dataset. The experimental result shows that under the attention mechanism, the accuracy of the proposed method is 98.91%, the sensitivity is 99.41% and the specificity is 98.50%, which outperforms most of the current algorithms.
心房颤动是一种常见的慢性心律失常。心房颤动的发病率随着年龄的增长而增加。因此,尤其对于老年人,准确检测房颤可以有效预防脑卒中。在本文中,我们提出了一种将基于深度学习的心跳模型与统计心率特征相结合的策略,使用多层感知器等分类器来识别心房颤动节律。值得注意的是,我们使用心跳模型提取心跳分类的特征。通过这种迁移学习方法,逐个提取心律中每一次心跳的特征,完成心房颤动的识别任务。我们在MIT-BIH AF数据集上评估了所提出的方法。实验结果表明,在注意机制下,该方法的准确率为98.91%,灵敏度为99.41%,特异性为98.50%,优于目前大多数算法。
{"title":"Atrial Fibrillation Detection Based on the Combination of Depth and Statistical Features of ECG","authors":"Mingchun Li, Gary He, Baofeng Zhu","doi":"10.1145/3338472.3338485","DOIUrl":"https://doi.org/10.1145/3338472.3338485","url":null,"abstract":"Atrial fibrillation is a kind of common chronic arrhythmia. The incidence of atrial fibrillation increases with aging. Therefore, especially for the elderly, accurate detection of atrial fibrillation can effectively prevent stroke. In this paper, we propose a strategy that combines the heartbeat model based on deep learning with statistical heart rate features, using a classifier such as a multi-layer perceptron to identify atrial fibrillation rhythm. It is worth noticing that the heartbeat model that we used to extract features for the classification of heartbeat. Through this transfer learning method, the features of each heartbeat in the heart rhythm are extracted one by one for the identification task of atrial fibrillation. We evaluated the proposed method on the MIT-BIH AF dataset. The experimental result shows that under the attention mechanism, the accuracy of the proposed method is 98.91%, the sensitivity is 99.41% and the specificity is 98.50%, which outperforms most of the current algorithms.","PeriodicalId":142573,"journal":{"name":"Proceedings of the 3rd International Conference on Graphics and Signal Processing","volume":"43 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121036458","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Medicine Recognition from Colors and Text 从颜色和文字进行医学识别
Tanjina Piash, Zakir Hossan, Ashraful Amin
People of hazy or blurred vision or the elderly people finds it way too challenging just to identify the pills if they are out of the box or packet. And various pills of various shapes, size, texture, color comes with a diverse set of medicinal components. It creates confusion among pills of same color and shape to identify based on a specific texture. For visually impaired people, even if they configure the shape of the pill, the color information and the texts imprinted on the pill remains unknown to them. In this paper, the splitting processes of a dataset according to the number of colors and the texts imprinted on the pills, will be described. Initially the color information were extracted by segmenting pill region from pill image and then some statistical measurements i.e. Kurtosis and skewness, are calculated for probability distributions generated from the image histograms. Thus figuring out the how many colors the pill surface consists of. For the text recognition, the probable text region is detected for an error free text detection. For high quality image data, the reference images from NLM RxIMAGE database has been utilized. The overall accuracy of the proposed system for number of color determination is 95.6% and text recognition accuracy is 81.32%.
视力模糊的人或老年人发现,仅仅是识别盒子或包装外的药片就太有挑战性了。各种形状、大小、质地、颜色的药丸都有不同的药用成分。它会在相同颜色和形状的药丸之间制造混淆,从而根据特定的纹理进行识别。对于视力受损的人来说,即使他们设置了药丸的形状,药丸上的颜色信息和文字仍然是他们所不知道的。在本文中,将描述根据颜色数量和药丸上的文本标记对数据集进行分割的过程。首先通过分割药丸图像中的药丸区域提取颜色信息,然后对图像直方图生成的概率分布计算峰度和偏度等统计度量。从而计算出药丸表面由多少种颜色组成。对于文本识别,检测可能的文本区域以进行无错误文本检测。为了获得高质量的图像数据,我们利用了NLM RxIMAGE数据库中的参考图像。该系统对颜色数量确定的总体准确率为95.6%,对文本的识别准确率为81.32%。
{"title":"Medicine Recognition from Colors and Text","authors":"Tanjina Piash, Zakir Hossan, Ashraful Amin","doi":"10.1145/3338472.3338484","DOIUrl":"https://doi.org/10.1145/3338472.3338484","url":null,"abstract":"People of hazy or blurred vision or the elderly people finds it way too challenging just to identify the pills if they are out of the box or packet. And various pills of various shapes, size, texture, color comes with a diverse set of medicinal components. It creates confusion among pills of same color and shape to identify based on a specific texture. For visually impaired people, even if they configure the shape of the pill, the color information and the texts imprinted on the pill remains unknown to them. In this paper, the splitting processes of a dataset according to the number of colors and the texts imprinted on the pills, will be described. Initially the color information were extracted by segmenting pill region from pill image and then some statistical measurements i.e. Kurtosis and skewness, are calculated for probability distributions generated from the image histograms. Thus figuring out the how many colors the pill surface consists of. For the text recognition, the probable text region is detected for an error free text detection. For high quality image data, the reference images from NLM RxIMAGE database has been utilized. The overall accuracy of the proposed system for number of color determination is 95.6% and text recognition accuracy is 81.32%.","PeriodicalId":142573,"journal":{"name":"Proceedings of the 3rd International Conference on Graphics and Signal Processing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129456828","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Iterative Broad Null Steering 迭代宽零操舵
Burak Ors, R. Suleesathira
Forming a broad null around the direction of interferences and keeping the main beam toward the desired direction is desirable to suppress interferences in the multipath environment. In this paper, a null broadening beamforming is proposed to enhance the iterative optimal beamformer. Placing broaden null is achieved through the projection and diagonal loading approach. MUltiple SIgnal Classification (MUSIC) is used to estimate the directions of arrival and angular spread. The null width is the estimated angular spread. A criteria based on the convergence of the output SINR is proposed to terminate the iteration. Simulation results are presented for illustration that the proposed iterative broad null beamformer is capable of steering broad nulls to interference signals, consequently, increasing the output signal to interference plus noise ration (SINR) and decreasing the number of iterations.
在多径环境中,在干扰方向周围形成宽零并保持主波束朝向期望方向是抑制干扰的理想方法。本文提出了一种零展宽波束形成方法来增强迭代优化波束形成器。通过投影和对角线加载方法实现了宽零的放置。多信号分类(MUSIC)用于估计到达方向和角传播。空宽度是估计的角扩展。提出了基于输出信噪比收敛性的迭代终止准则。仿真结果表明,所提出的迭代宽频零波束形成器能够将宽频零转向干扰信号,从而提高了输出信号的干扰加噪比(SINR),减少了迭代次数。
{"title":"Iterative Broad Null Steering","authors":"Burak Ors, R. Suleesathira","doi":"10.1145/3338472.3338487","DOIUrl":"https://doi.org/10.1145/3338472.3338487","url":null,"abstract":"Forming a broad null around the direction of interferences and keeping the main beam toward the desired direction is desirable to suppress interferences in the multipath environment. In this paper, a null broadening beamforming is proposed to enhance the iterative optimal beamformer. Placing broaden null is achieved through the projection and diagonal loading approach. MUltiple SIgnal Classification (MUSIC) is used to estimate the directions of arrival and angular spread. The null width is the estimated angular spread. A criteria based on the convergence of the output SINR is proposed to terminate the iteration. Simulation results are presented for illustration that the proposed iterative broad null beamformer is capable of steering broad nulls to interference signals, consequently, increasing the output signal to interference plus noise ration (SINR) and decreasing the number of iterations.","PeriodicalId":142573,"journal":{"name":"Proceedings of the 3rd International Conference on Graphics and Signal Processing","volume":"28 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117026683","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Proceedings of the 3rd International Conference on Graphics and Signal Processing
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1