首页 > 最新文献

2017 IEEE International Conference on Signal and Image Processing Applications (ICSIPA)最新文献

英文 中文
Significance of contrast and structure features for an improved color image classification system 对比度和结构特征对改进彩色图像分类系统的意义
V. Sowmya, D. Govind, K. Soman
In general, the three main modules of color image classification systems are: color-to-grayscale image conversion, feature extraction and classification. The color-to-grayscale image conversion is the important pre-processing step which must incorporate the significant and discriminative contrast and structure information in the converted grayscale images as in the original color image. All the existing techniques for color-to-grayscale image conversion preserves the significant contrast and structure information in the converted grayscale images in different manners. Hence, the present work is to analyze the significant and discriminative contrast and structure information preserved in the converted grayscale images using two different decolorization techniques called rgb2gray and singular value decomposition based color-to-grayscale image conversion (SVD) applied in the color image classification systems using the three different proposed features. The three different features for color image classification systems are proposed based on the combination of the existing dense SIFT features and the contrast & structure content computed using color-to-gray structure similarity index (C2G-SSIM) metric.
一般来说,彩色图像分类系统的三个主要模块是:彩色到灰度图像转换、特征提取和分类。彩色图像到灰度图像的转换是重要的预处理步骤,它必须将转换后的灰度图像中具有显著性和判别性的对比度和结构信息与原彩色图像中的信息相结合。现有的彩色-灰度图像转换技术都以不同的方式保留了转换后的灰度图像中重要的对比度和结构信息。因此,本文的工作是利用rgb2gray和基于奇异值分解的彩色到灰度图像转换(SVD)两种不同的脱色技术,利用这三种不同的特征,分析转换后的灰度图像中保留的显著性和判别性对比度和结构信息。结合现有的密集SIFT特征和利用色灰结构相似指数(C2G-SSIM)度量计算的对比度和结构含量,提出了用于彩色图像分类系统的三种不同特征。
{"title":"Significance of contrast and structure features for an improved color image classification system","authors":"V. Sowmya, D. Govind, K. Soman","doi":"10.1109/ICSIPA.2017.8120608","DOIUrl":"https://doi.org/10.1109/ICSIPA.2017.8120608","url":null,"abstract":"In general, the three main modules of color image classification systems are: color-to-grayscale image conversion, feature extraction and classification. The color-to-grayscale image conversion is the important pre-processing step which must incorporate the significant and discriminative contrast and structure information in the converted grayscale images as in the original color image. All the existing techniques for color-to-grayscale image conversion preserves the significant contrast and structure information in the converted grayscale images in different manners. Hence, the present work is to analyze the significant and discriminative contrast and structure information preserved in the converted grayscale images using two different decolorization techniques called rgb2gray and singular value decomposition based color-to-grayscale image conversion (SVD) applied in the color image classification systems using the three different proposed features. The three different features for color image classification systems are proposed based on the combination of the existing dense SIFT features and the contrast & structure content computed using color-to-gray structure similarity index (C2G-SSIM) metric.","PeriodicalId":268112,"journal":{"name":"2017 IEEE International Conference on Signal and Image Processing Applications (ICSIPA)","volume":"54 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127155850","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Adaptive feature learning CNN for behavior recognition in crowd scene 基于自适应特征学习CNN的人群场景行为识别
Aliyu Nuhu Shuaibu, A. Malik, I. Faye
Learning and recognizing 3-dimension (3D) adaptive features are important for crowd scene understanding in video surveillance research. Deep learning architectures such as Convolutional Neural Networks (CNN) have recently shown much success in various computer vision applications. Existing approaches such as hand-crafted method and 2D-CNN architectures are widely used in adaptive feature representations on image data. However, learning dynamic and temporal features in 3D scale features in videos remains an open problem. In this study, we proposed a novel technique 3D-scale Convolutional Neural Network (3DS-CNN), based on the decomposition of 3D feature maps into 2D spatio and 2D temporal feature representations. Extensive experiments on hundreds of video scene were demonstrated on publicly available crowd datasets. Quantitative and qualitative evaluations indicate that the proposed model display superior performance when compared to baseline approaches. The mean average precision of 95.30% was recorded on WWW crowd dataset.
在视频监控研究中,学习和识别三维自适应特征对人群场景的理解具有重要意义。卷积神经网络(CNN)等深度学习架构最近在各种计算机视觉应用中取得了很大成功。现有的方法如手工方法和2D-CNN架构被广泛用于图像数据的自适应特征表示。然而,在视频的三维尺度特征中学习动态和时间特征仍然是一个悬而未决的问题。在这项研究中,我们提出了一种基于3D特征映射分解为二维空间和二维时间特征表示的3D尺度卷积神经网络(3DS-CNN)新技术。对数百个视频场景进行了广泛的实验,并在公开可用的人群数据集上进行了演示。定量和定性评价表明,与基线方法相比,所提出的模型显示出优越的性能。WWW人群数据集的平均准确率为95.30%。
{"title":"Adaptive feature learning CNN for behavior recognition in crowd scene","authors":"Aliyu Nuhu Shuaibu, A. Malik, I. Faye","doi":"10.1109/ICSIPA.2017.8120636","DOIUrl":"https://doi.org/10.1109/ICSIPA.2017.8120636","url":null,"abstract":"Learning and recognizing 3-dimension (3D) adaptive features are important for crowd scene understanding in video surveillance research. Deep learning architectures such as Convolutional Neural Networks (CNN) have recently shown much success in various computer vision applications. Existing approaches such as hand-crafted method and 2D-CNN architectures are widely used in adaptive feature representations on image data. However, learning dynamic and temporal features in 3D scale features in videos remains an open problem. In this study, we proposed a novel technique 3D-scale Convolutional Neural Network (3DS-CNN), based on the decomposition of 3D feature maps into 2D spatio and 2D temporal feature representations. Extensive experiments on hundreds of video scene were demonstrated on publicly available crowd datasets. Quantitative and qualitative evaluations indicate that the proposed model display superior performance when compared to baseline approaches. The mean average precision of 95.30% was recorded on WWW crowd dataset.","PeriodicalId":268112,"journal":{"name":"2017 IEEE International Conference on Signal and Image Processing Applications (ICSIPA)","volume":"64 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130694361","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Frame based approach for automatic event boundary detection of soccer video using optical flow 基于帧的足球视频光流事件边界自动检测方法
Devang S. Pandya, M. Zaveri
Due to rapid growth of digital contents, there has been an increasing demand of summarized videos to save time and other network resources. Automated soccer video analysis is a challenging due to involvement of more actors and rapid movements of players and camera. It is necessary first to detect various events of the video precisely before analyzing and labeling them. This paper proposes frame based approach for the automatic demarcation of events of the soccer video. However this task is very challenging due to variety of soccer leagues, various illumination and ground conditions. To overcome such issues we propose method which is invariant to such conditions and can successfully demarcate the soccer events. We exploit optical flow techniques to measure the motion. We introduce change in optical flow to extract the behavior of an event over the video span. Later, adaptive threshold is computed based on change in optical flow. We conducted number of simulations with variety of videos to validate the method. Proposed method achieves nearly 90% of accuracy and found robust in spite of illumination variation.
由于数字内容的快速增长,人们对视频摘要的需求越来越大,以节省时间和其他网络资源。自动化足球视频分析是一个具有挑战性的,因为涉及更多的演员和快速移动的球员和摄像机。首先要对视频中的各种事件进行准确的检测,然后再对其进行分析和标记。提出了一种基于帧的足球视频事件自动标定方法。然而,由于各种足球联赛,各种照明和地面条件,这项任务非常具有挑战性。为了克服这些问题,我们提出了一种不受这些条件影响的方法,可以成功地划分足球项目。我们利用光流技术来测量运动。我们引入光流的变化来提取事件在视频跨度上的行为。然后根据光流的变化计算自适应阈值。我们用各种视频进行了大量的模拟来验证该方法。该方法的准确率接近90%,且在光照变化情况下具有鲁棒性。
{"title":"Frame based approach for automatic event boundary detection of soccer video using optical flow","authors":"Devang S. Pandya, M. Zaveri","doi":"10.1109/ICSIPA.2017.8120644","DOIUrl":"https://doi.org/10.1109/ICSIPA.2017.8120644","url":null,"abstract":"Due to rapid growth of digital contents, there has been an increasing demand of summarized videos to save time and other network resources. Automated soccer video analysis is a challenging due to involvement of more actors and rapid movements of players and camera. It is necessary first to detect various events of the video precisely before analyzing and labeling them. This paper proposes frame based approach for the automatic demarcation of events of the soccer video. However this task is very challenging due to variety of soccer leagues, various illumination and ground conditions. To overcome such issues we propose method which is invariant to such conditions and can successfully demarcate the soccer events. We exploit optical flow techniques to measure the motion. We introduce change in optical flow to extract the behavior of an event over the video span. Later, adaptive threshold is computed based on change in optical flow. We conducted number of simulations with variety of videos to validate the method. Proposed method achieves nearly 90% of accuracy and found robust in spite of illumination variation.","PeriodicalId":268112,"journal":{"name":"2017 IEEE International Conference on Signal and Image Processing Applications (ICSIPA)","volume":"61 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126018869","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Transfer learning for Diabetic Macular Edema (DME) detection on Optical Coherence Tomography (OCT) images 光学相干断层扫描(OCT)图像上糖尿病黄斑水肿(DME)检测的迁移学习
G. Chan, M. Awais, S. A. A. Shah, T. Tang, Cheng-Kai Lu, F. Mériaudeau
Diabetic Macular Edema (DME) is a common eye disease that causes irreversible vision loss for diabetic patients, if left untreated. Thus, early diagnosis of DME could help in early treatment and prevent blindness. This paper aims to create a framework based on deep learning for DME recognition on Spectral Domain Optical Coherence Tomography (SD-OCT) images through transfer learning. First, images are pre-processed: denoised using Block-Matching and 3-Dimension (BM3D) filtering and cropped through image boundary extraction. Later, features are extracted using CNN of AlexNet and finally images are classified using SVM classifier. The results are evaluated using 8-fold cross-validation. The experiments show that denoised and cropped images lead to better classification performances, exceeding previous other recent published works of 96% accuracy.
糖尿病性黄斑水肿(DME)是一种常见的眼病,如果不及时治疗,会导致糖尿病患者不可逆的视力丧失。因此,DME的早期诊断有助于早期治疗和预防失明。本文旨在通过迁移学习,建立一个基于深度学习的框架,用于谱域光学相干断层扫描(SD-OCT)图像的DME识别。首先,对图像进行预处理:使用块匹配和三维(BM3D)滤波去噪,通过图像边界提取裁剪。然后使用AlexNet的CNN提取特征,最后使用SVM分类器对图像进行分类。使用8倍交叉验证对结果进行评估。实验表明,去噪和裁剪后的图像具有更好的分类性能,超过了最近发表的其他96%的准确率。
{"title":"Transfer learning for Diabetic Macular Edema (DME) detection on Optical Coherence Tomography (OCT) images","authors":"G. Chan, M. Awais, S. A. A. Shah, T. Tang, Cheng-Kai Lu, F. Mériaudeau","doi":"10.1109/ICSIPA.2017.8120662","DOIUrl":"https://doi.org/10.1109/ICSIPA.2017.8120662","url":null,"abstract":"Diabetic Macular Edema (DME) is a common eye disease that causes irreversible vision loss for diabetic patients, if left untreated. Thus, early diagnosis of DME could help in early treatment and prevent blindness. This paper aims to create a framework based on deep learning for DME recognition on Spectral Domain Optical Coherence Tomography (SD-OCT) images through transfer learning. First, images are pre-processed: denoised using Block-Matching and 3-Dimension (BM3D) filtering and cropped through image boundary extraction. Later, features are extracted using CNN of AlexNet and finally images are classified using SVM classifier. The results are evaluated using 8-fold cross-validation. The experiments show that denoised and cropped images lead to better classification performances, exceeding previous other recent published works of 96% accuracy.","PeriodicalId":268112,"journal":{"name":"2017 IEEE International Conference on Signal and Image Processing Applications (ICSIPA)","volume":"90 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115865626","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 39
Convolutional constellation mapping technique for PAPR reduction in filter bank multi-carrier 滤波器组多载波中减小PAPR的卷积星座映射技术
A. Eltholth, M. Gaber, M. Fouda, H. Mansour
Filter Bank Multi-Carrier (FBMC), has been proposed to solve the problems of spectral efficiency and adjacent channel leakage power in OFDM. This solution has added implications for the Peak-to-Average Power Ratio (PAPR) problem. This paper aims to reduce PAPR in FBMC through convolutional constellation mapping. The idea is to give a different constellation point to the symbol if successive values of the same symbol occur in the state space, thus reducing the chances of coherent addition of subcarriers without increasing the constellation points or even the need for iterative mapping. Computer-based simulations show the superior performance of the proposed Convolutional Constellation Mapping (CCM) in reducing the PAPR of FBMC.
为了解决OFDM中频谱效率和相邻信道泄漏功率的问题,提出了滤波器组多载波(FBMC)。该解决方案增加了峰值与平均功率比(PAPR)问题的含义。本文旨在通过卷积星座映射来降低FBMC的PAPR。其思想是,如果在状态空间中出现相同符号的连续值,则给符号一个不同的星座点,从而减少子载波相干添加的机会,而不增加星座点甚至不需要迭代映射。计算机仿真结果表明,所提出的卷积星座映射(CCM)在降低FBMC的PAPR方面具有优异的性能。
{"title":"Convolutional constellation mapping technique for PAPR reduction in filter bank multi-carrier","authors":"A. Eltholth, M. Gaber, M. Fouda, H. Mansour","doi":"10.1109/ICSIPA.2017.8120656","DOIUrl":"https://doi.org/10.1109/ICSIPA.2017.8120656","url":null,"abstract":"Filter Bank Multi-Carrier (FBMC), has been proposed to solve the problems of spectral efficiency and adjacent channel leakage power in OFDM. This solution has added implications for the Peak-to-Average Power Ratio (PAPR) problem. This paper aims to reduce PAPR in FBMC through convolutional constellation mapping. The idea is to give a different constellation point to the symbol if successive values of the same symbol occur in the state space, thus reducing the chances of coherent addition of subcarriers without increasing the constellation points or even the need for iterative mapping. Computer-based simulations show the superior performance of the proposed Convolutional Constellation Mapping (CCM) in reducing the PAPR of FBMC.","PeriodicalId":268112,"journal":{"name":"2017 IEEE International Conference on Signal and Image Processing Applications (ICSIPA)","volume":"38 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133929816","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
New pilot allocation design schemes for sparse channel estimation in OFDM system OFDM系统中稀疏信道估计的导频分配设计新方案
Anthony Ngozichukwuka Uwaechia, N. Mahyuddin
In this paper, the problem of the deterministic pilot allocation for sparse channel estimation in Orthogonal Frequency Division Multiplexing (OFDM) system is investigated. This method is based on mutual coherence minimization of the measurement matrix associated with the OFDM system pilot subcarriers. It is known that if the set of pilot pattern is a Cyclic Difference Set (CDS), the mutual coherence of the measurement matrix is minimized. However, CDS in most practical OFDM system is not available. Few research efforts have tackled the problem of pilot allocation by proposing methods that lead to suboptimal solutions in order to ignore the computationally complex exhaustive search method. This contribution, however, proposes two pilot allocation design schemes for the construction of deterministic partial Fourier matrices satisfying the Restricted Isometry Property (RIP) namely, the Generic Random Search (GRS) and Progressive Search (PS) based on bounding the mutual coherence between different columns of the measurement matrix. Simulation results show that the two proposed pilot allocation design schemes are effective and offer a better channel estimation performance in terms of MSE when compared to former pilot allocation design methods.
研究了正交频分复用(OFDM)系统稀疏信道估计中导频分配的确定性问题。该方法基于与OFDM系统导频子载波相关的测量矩阵的相互相干最小化。已知如果导频模式集是一个循环差分集(CDS),则测量矩阵的相互相干性最小。然而,在大多数实际的OFDM系统中,CDS是不可用的。很少有研究通过提出导致次优解的方法来解决试点分配问题,以忽略计算复杂的穷举搜索方法。然而,这一贡献提出了两种用于构建满足限制等距性质(RIP)的确定性部分傅立叶矩阵的试点分配设计方案,即基于测量矩阵不同列之间相互相干性的通用随机搜索(GRS)和渐进搜索(PS)。仿真结果表明,与以往的导频分配设计方法相比,所提出的两种导频分配设计方案都是有效的,并且在MSE方面具有更好的信道估计性能。
{"title":"New pilot allocation design schemes for sparse channel estimation in OFDM system","authors":"Anthony Ngozichukwuka Uwaechia, N. Mahyuddin","doi":"10.1109/ICSIPA.2017.8120581","DOIUrl":"https://doi.org/10.1109/ICSIPA.2017.8120581","url":null,"abstract":"In this paper, the problem of the deterministic pilot allocation for sparse channel estimation in Orthogonal Frequency Division Multiplexing (OFDM) system is investigated. This method is based on mutual coherence minimization of the measurement matrix associated with the OFDM system pilot subcarriers. It is known that if the set of pilot pattern is a Cyclic Difference Set (CDS), the mutual coherence of the measurement matrix is minimized. However, CDS in most practical OFDM system is not available. Few research efforts have tackled the problem of pilot allocation by proposing methods that lead to suboptimal solutions in order to ignore the computationally complex exhaustive search method. This contribution, however, proposes two pilot allocation design schemes for the construction of deterministic partial Fourier matrices satisfying the Restricted Isometry Property (RIP) namely, the Generic Random Search (GRS) and Progressive Search (PS) based on bounding the mutual coherence between different columns of the measurement matrix. Simulation results show that the two proposed pilot allocation design schemes are effective and offer a better channel estimation performance in terms of MSE when compared to former pilot allocation design methods.","PeriodicalId":268112,"journal":{"name":"2017 IEEE International Conference on Signal and Image Processing Applications (ICSIPA)","volume":"47 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134276449","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
On vehicle state tracking for long-term carpark video surveillance 基于车辆状态跟踪的停车场长期视频监控
R. Lim, Clarence Weihan Cheong, John See, I. Tan, L. Wong, Huai-Qian Khor
Car park video surveillance systems present a huge volume of data that can be beneficial for video analytics and data analysis. We present a vehicle state tracking method for long term video surveillance with the goal of obtaining trajectories and vehicle states of various car park users. However, this is a challenging task in outdoor scenarios due to non-optimal camera viewing angle compounded by ever-changing illumination & weather conditions. To address these challenges, we propose a parking state machine that tracks the vehicle state in a large outdoor car park area. The proposed method was tested on 10 hours of continuous video data with various illumination and environmental conditions. Owing to the imbalanced distribution of parking states, we report the precision, recall and F1 scores to determine the overall performance of the system. Our approach proves to be fairly accurate, fast and robust against severe scene variations.
停车场视频监控系统提供了大量的数据,这些数据对视频分析和数据分析是有益的。我们提出了一种用于长期视频监控的车辆状态跟踪方法,目的是获取各种停车场用户的轨迹和车辆状态。然而,这在户外场景中是一项具有挑战性的任务,因为非最佳的相机视角加上不断变化的照明和天气条件。为了解决这些挑战,我们提出了一个停车状态机,它可以跟踪大型室外停车场的车辆状态。在不同光照和环境条件下对连续10小时的视频数据进行了测试。由于停车状态的不平衡分布,我们报告了精度,召回率和F1分数来确定系统的整体性能。我们的方法被证明是相当准确,快速和强大的严重的场景变化。
{"title":"On vehicle state tracking for long-term carpark video surveillance","authors":"R. Lim, Clarence Weihan Cheong, John See, I. Tan, L. Wong, Huai-Qian Khor","doi":"10.1109/ICSIPA.2017.8120638","DOIUrl":"https://doi.org/10.1109/ICSIPA.2017.8120638","url":null,"abstract":"Car park video surveillance systems present a huge volume of data that can be beneficial for video analytics and data analysis. We present a vehicle state tracking method for long term video surveillance with the goal of obtaining trajectories and vehicle states of various car park users. However, this is a challenging task in outdoor scenarios due to non-optimal camera viewing angle compounded by ever-changing illumination & weather conditions. To address these challenges, we propose a parking state machine that tracks the vehicle state in a large outdoor car park area. The proposed method was tested on 10 hours of continuous video data with various illumination and environmental conditions. Owing to the imbalanced distribution of parking states, we report the precision, recall and F1 scores to determine the overall performance of the system. Our approach proves to be fairly accurate, fast and robust against severe scene variations.","PeriodicalId":268112,"journal":{"name":"2017 IEEE International Conference on Signal and Image Processing Applications (ICSIPA)","volume":"51 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115832987","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Deep-learning: A potential method for tuberculosis detection using chest radiography 深度学习:一种利用胸部x线摄影检测结核病的潜在方法
Rahul Hooda, S. Sofat, Simranpreet Kaur, Ajay Mittal, F. Mériaudeau
Tuberculosis (TB) is a major health threat in the developing countries. Many patients die every year due to lack of treatment and error in diagnosis. Developing a computer-aided diagnosis (CAD) system for TB detection can help in early diagnosis and containing the disease. Most of the current CAD systems use handcrafted features, however, lately there is a shift towards deep-learning-based automatic feature extractors. In this paper, we present a potential method for tuberculosis detection using deep-learning which classifies CXR images into two categories, that is, normal and abnormal. We have used CNN architecture with 7 convolutional layers and 3 fully connected layers. The performance of three different optimizers has been compared. Out of these, Adam optimizer with an overall accuracy of 94.73% and validation accuracy of 82.09% performed best amongst them. All the results are obtained using Montgomery and Shenzhen datasets which are available in public domain.
结核病是发展中国家的一个主要健康威胁。由于缺乏治疗和诊断错误,每年都有许多患者死亡。开发用于结核病检测的计算机辅助诊断(CAD)系统可以帮助早期诊断和控制疾病。目前大多数CAD系统都使用手工制作的特征,然而,最近有一种转向基于深度学习的自动特征提取器。在本文中,我们提出了一种利用深度学习将CXR图像分为正常和异常两类的潜在结核病检测方法。我们使用了具有7个卷积层和3个全连接层的CNN架构。比较了三种不同优化器的性能。其中,Adam优化器的总体准确率为94.73%,验证准确率为82.09%。所有结果均使用Montgomery和Shenzhen数据集获得,这些数据集可在公共领域获得。
{"title":"Deep-learning: A potential method for tuberculosis detection using chest radiography","authors":"Rahul Hooda, S. Sofat, Simranpreet Kaur, Ajay Mittal, F. Mériaudeau","doi":"10.1109/ICSIPA.2017.8120663","DOIUrl":"https://doi.org/10.1109/ICSIPA.2017.8120663","url":null,"abstract":"Tuberculosis (TB) is a major health threat in the developing countries. Many patients die every year due to lack of treatment and error in diagnosis. Developing a computer-aided diagnosis (CAD) system for TB detection can help in early diagnosis and containing the disease. Most of the current CAD systems use handcrafted features, however, lately there is a shift towards deep-learning-based automatic feature extractors. In this paper, we present a potential method for tuberculosis detection using deep-learning which classifies CXR images into two categories, that is, normal and abnormal. We have used CNN architecture with 7 convolutional layers and 3 fully connected layers. The performance of three different optimizers has been compared. Out of these, Adam optimizer with an overall accuracy of 94.73% and validation accuracy of 82.09% performed best amongst them. All the results are obtained using Montgomery and Shenzhen datasets which are available in public domain.","PeriodicalId":268112,"journal":{"name":"2017 IEEE International Conference on Signal and Image Processing Applications (ICSIPA)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134080171","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 70
Optimal feature subset selection for fuzzy extreme learning machine using genetic algorithm with multilevel parameter optimization 基于多级参数优化遗传算法的模糊极值学习机特征子集优选
A. Kale, S. Sonavane
The crucial objective of this paper is to design a hybrid model of the genetic algorithm for fuzzy extreme learning machine classifier (GA-FELM), which selects an optimal feature subset by using the multilevel parameter optimization technique. Feature subset selection is an important task in pattern classification and knowledge discovery problems. The generalization performance of the system is not only depending on optimal features but also dependent upon the classifier (learning algorithm). Therefore, it is an important task to select a fast and efficient classifier. Research efforts have affirmed that extreme learning machine (ELM) has superior and accurate classification ability. However, ELM is failed to handle the uncertain data. One of the alternative solutions is fuzzy-ELM, which combines the advantages of fuzzy logic and ELM. GA-FELM is able to handle curse of dimensionality problem, optimization problem and weighted classification problem with maximizing classification accuracy by minimizing the number of features. In order to validate the efficiency of GA-FELM, the comparative performance is evaluated by using three different approaches viz. 1. ELM and GA-ELM 2. GA-ELM and GA-FELM 3. GA-FELM and GA-existing classifier. The result analysis shows that classification accuracy is improved with 9% while reducing 62% features.
本文的关键目标是设计一种混合遗传算法的模糊极限学习机分类器(GA-FELM)模型,该模型通过多级参数优化技术选择最优特征子集。特征子集选择是模式分类和知识发现问题中的一项重要任务。系统的泛化性能不仅取决于最优特征,还取决于分类器(学习算法)。因此,选择一种快速高效的分类器是一项重要的任务。研究证实,极限学习机(extreme learning machine, ELM)具有优越、准确的分类能力。然而,ELM无法处理不确定数据。其中一种替代方案是模糊ELM,它结合了模糊逻辑和ELM的优点。GA-FELM能够通过最小化特征数来实现分类精度最大化,从而解决维数问题、优化问题和加权分类问题。为了验证GA-FELM的效率,使用三种不同的方法来评估比较性能,即:1。ELM和GA-ELM 2。GA-ELM和GA-FELMGA-FELM和GA-existing分类器。结果分析表明,分类准确率提高9%,特征减少62%。
{"title":"Optimal feature subset selection for fuzzy extreme learning machine using genetic algorithm with multilevel parameter optimization","authors":"A. Kale, S. Sonavane","doi":"10.1109/ICSIPA.2017.8120652","DOIUrl":"https://doi.org/10.1109/ICSIPA.2017.8120652","url":null,"abstract":"The crucial objective of this paper is to design a hybrid model of the genetic algorithm for fuzzy extreme learning machine classifier (GA-FELM), which selects an optimal feature subset by using the multilevel parameter optimization technique. Feature subset selection is an important task in pattern classification and knowledge discovery problems. The generalization performance of the system is not only depending on optimal features but also dependent upon the classifier (learning algorithm). Therefore, it is an important task to select a fast and efficient classifier. Research efforts have affirmed that extreme learning machine (ELM) has superior and accurate classification ability. However, ELM is failed to handle the uncertain data. One of the alternative solutions is fuzzy-ELM, which combines the advantages of fuzzy logic and ELM. GA-FELM is able to handle curse of dimensionality problem, optimization problem and weighted classification problem with maximizing classification accuracy by minimizing the number of features. In order to validate the efficiency of GA-FELM, the comparative performance is evaluated by using three different approaches viz. 1. ELM and GA-ELM 2. GA-ELM and GA-FELM 3. GA-FELM and GA-existing classifier. The result analysis shows that classification accuracy is improved with 9% while reducing 62% features.","PeriodicalId":268112,"journal":{"name":"2017 IEEE International Conference on Signal and Image Processing Applications (ICSIPA)","volume":"66 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132384930","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Mixed emotions in multi view face emotion recognition 多视角人脸情绪识别中的混合情绪
F. Goodarzi, F. Rokhani, M. Saripan, M. Marhaban
The problem of recognizing and discriminating mixed emotions in multi view faces using a web camera is discussed in this paper. Based on the literature, there are mainly seven basic emotions that humans can express and understand. However, in some faces in databases, there are characteristics of two or more of this basic emotions. The two databases of BU3DFE and UPM3DFE were tested for mixed emotion accuracy using the proposed multi view face emotion recognition method. The results show an improvement over existing works in mixed emotions recognition.
本文讨论了基于网络摄像机的多视角人脸混合情绪识别问题。根据文献,人类可以表达和理解的基本情绪主要有七种。然而,在数据库中的一些面孔中,有两种或两种以上这种基本情绪的特征。采用所提出的多视角人脸情绪识别方法,对BU3DFE和UPM3DFE两个数据库的混合情绪识别准确率进行了测试。结果表明,该方法在混合情绪识别方面比现有方法有了很大的改进。
{"title":"Mixed emotions in multi view face emotion recognition","authors":"F. Goodarzi, F. Rokhani, M. Saripan, M. Marhaban","doi":"10.1109/ICSIPA.2017.8120643","DOIUrl":"https://doi.org/10.1109/ICSIPA.2017.8120643","url":null,"abstract":"The problem of recognizing and discriminating mixed emotions in multi view faces using a web camera is discussed in this paper. Based on the literature, there are mainly seven basic emotions that humans can express and understand. However, in some faces in databases, there are characteristics of two or more of this basic emotions. The two databases of BU3DFE and UPM3DFE were tested for mixed emotion accuracy using the proposed multi view face emotion recognition method. The results show an improvement over existing works in mixed emotions recognition.","PeriodicalId":268112,"journal":{"name":"2017 IEEE International Conference on Signal and Image Processing Applications (ICSIPA)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133710779","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
期刊
2017 IEEE International Conference on Signal and Image Processing Applications (ICSIPA)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1