2017 IEEE International Joint Conference on Biometrics (IJCB)最新文献

英文中文

Multi-view 3D face reconstruction with deep recurrent neural networks 基于深度递归神经网络的多视图三维人脸重建

2017 IEEE International Joint Conference on Biometrics (IJCB)

Pub Date : 2017-10-01 DOI: 10.1109/BTAS.2017.8272733

Pengfei Dou, I. Kakadiaris

Image-based 3D face reconstruction has great potential in different areas, such as facial recognition, facial analysis, and facial animation. Due to the variations in image quality, single-image-based 3D face reconstruction might not be sufficient to accurately reconstruct a 3D face. To overcome this limitation, multi-view 3D face reconstruction uses multiple images of the same subject and aggregates complementary information for better accuracy. Though theoretically appealing, there are multiple challenges in practice. Among these challenges, the most significant is that it is difficult to establish coherent and accurate correspondence among a set of images, especially when these images are captured in different conditions. In this paper, we propose a method, Deep Recurrent 3D FAce Reconstruction (DRFAR), to solve the task ofmulti-view 3D face reconstruction using a subspace representation of the 3D facial shape and a deep recurrent neural network that consists of both a deep con-volutional neural network (DCNN) and a recurrent neural network (RNN). The DCNN disentangles the facial identity and the facial expression components for each single image independently, while the RNN fuses identity-related features from the DCNN and aggregates the identity specific contextual information, or the identity signal, from the whole set of images to predict the facial identity parameter, which is robust to variations in image quality and is consistent over the whole set of images. Through extensive experiments, we evaluate our proposed method and demonstrate its superiority over existing methods.

基于图像的三维人脸重建在人脸识别、人脸分析和人脸动画等不同领域具有巨大的潜力。由于图像质量的差异，基于单图像的3D人脸重建可能不足以准确地重建3D人脸。为了克服这一限制，多视图3D人脸重建使用同一主题的多幅图像并聚合互补信息以提高准确性。虽然理论上很有吸引力，但在实践中存在多重挑战。在这些挑战中，最重要的是很难在一组图像之间建立连贯和准确的对应关系，特别是当这些图像在不同条件下捕获时。本文提出了一种深度递归3D人脸重建(Deep Recurrent 3D FAce Reconstruction, DRFAR)方法，该方法利用三维人脸形状的子空间表示和由深度卷积神经网络(DCNN)和递归神经网络(RNN)组成的深度递归神经网络来解决多视图3D人脸重建任务。DCNN可以独立分离每张图像的面部身份和面部表情成分，而RNN则融合来自DCNN的身份相关特征，并聚合来自整组图像的身份特定上下文信息或身份信号来预测面部身份参数，该参数对图像质量的变化具有鲁棒性，并且在整组图像上保持一致。通过大量的实验，我们评估了我们提出的方法，并证明了它比现有方法的优越性。

{"title":"Multi-view 3D face reconstruction with deep recurrent neural networks","authors":"Pengfei Dou, I. Kakadiaris","doi":"10.1109/BTAS.2017.8272733","DOIUrl":"https://doi.org/10.1109/BTAS.2017.8272733","url":null,"abstract":"Image-based 3D face reconstruction has great potential in different areas, such as facial recognition, facial analysis, and facial animation. Due to the variations in image quality, single-image-based 3D face reconstruction might not be sufficient to accurately reconstruct a 3D face. To overcome this limitation, multi-view 3D face reconstruction uses multiple images of the same subject and aggregates complementary information for better accuracy. Though theoretically appealing, there are multiple challenges in practice. Among these challenges, the most significant is that it is difficult to establish coherent and accurate correspondence among a set of images, especially when these images are captured in different conditions. In this paper, we propose a method, Deep Recurrent 3D FAce Reconstruction (DRFAR), to solve the task ofmulti-view 3D face reconstruction using a subspace representation of the 3D facial shape and a deep recurrent neural network that consists of both a deep con-volutional neural network (DCNN) and a recurrent neural network (RNN). The DCNN disentangles the facial identity and the facial expression components for each single image independently, while the RNN fuses identity-related features from the DCNN and aggregates the identity specific contextual information, or the identity signal, from the whole set of images to predict the facial identity parameter, which is robust to variations in image quality and is consistent over the whole set of images. Through extensive experiments, we evaluate our proposed method and demonstrate its superiority over existing methods.","PeriodicalId":372008,"journal":{"name":"2017 IEEE International Joint Conference on Biometrics (IJCB)","volume":"28 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126606987","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 39

On the guessability of binary biometric templates: A practical guessing entropy based approach 二元生物特征模板的可猜测性:一种实用的基于猜测熵的方法

2017 IEEE International Joint Conference on Biometrics (IJCB)

Pub Date : 2017-10-01 DOI: 10.1109/BTAS.2017.8272719

Guangcan Mai, M. Lim, P. Yuen

A security index for biometric systems is essential because biometrics have been widely adopted as a secure authentication component in critical systems. Most of bio-metric systems secured by template protection schemes are based on binary templates. To adopt popular template protection schemes such as fuzzy commitment and fuzzy extractor that can be applied on binary templates only, non-binary templates (e.g., real-valued, point-set based) need to be converted to binary. However, existing security measurements for binary template based biometric systems either cannot reflect the actual attack difficulties or are too computationally expensive to be practical. This paper presents an acceleration of the guessing entropy which reflects the expected number of guessing trials in attacking the binary template based biometric systems. The acceleration benefits from computation reuse and pruning. Experimental results on two datasets show that the acceleration has more than 6x, 20x, and 200x speed up without losing the estimation accuracy in different system settings.

由于生物识别技术已被广泛采用为关键系统的安全认证组件，因此生物识别系统的安全索引是必不可少的。大多数采用模板保护方案的生物识别系统都是基于二进制模板的。为了采用目前流行的仅适用于二进制模板的模糊承诺、模糊提取等模板保护方案，需要将非二进制模板(如实值、基于点集的模板)转换为二进制模板。然而，现有的基于二进制模板的生物识别系统的安全措施要么不能反映实际的攻击困难，要么计算成本太高而不实用。本文提出了一种加速猜测熵的方法，它反映了攻击基于二元模板的生物识别系统的预期猜测次数。这种加速得益于计算重用和修剪。在两个数据集上的实验结果表明，在不同的系统设置下，在不损失估计精度的情况下，该算法的加速速度分别超过6倍、20倍和200倍。

引用次数: 3

Location-sensitive sparse representation of deep normal patterns for expression-robust 3D face recognition 表达鲁棒3D人脸识别中深度正常模式的位置敏感稀疏表示

2017 IEEE International Joint Conference on Biometrics (IJCB)

Pub Date : 2017-10-01 DOI: 10.1109/BTAS.2017.8272703

Huibin Li, Jian Sun, Liming Chen

This paper presents a straight-forward yet efficient, and expression-robust 3D face recognition approach by exploring location sensitive sparse representation of deep normal patterns (DNP). In particular, given raw 3D facial surfaces, we first run 3D face pre-processing pipeline, including nose tip detection, face region cropping, and pose normalization. The 3D coordinates of each normalized 3D facial surface are then projected into 2D plane to generate geometry images, from which three images of facial surface normal components are estimated. Each normal image is then fed into a pre-trained deep face net to generate deep representations of facial surface normals, i.e., deep normal patterns. Considering the importance of different facial locations, we propose a location sensitive sparse representation classifier (LS-SRC) for similarity measure among deep normal patterns associated with different 3D faces. Finally, simple score-level fusion of different normal components are used for the final decision. The proposed approach achieves significantly high performance, and reporting rank-one scores of 98.01%, 97.60%, and 96.13% on the FRGC v2.0, Bosphorus, and BU-3DFE databases when only one sample per subject is used in the gallery. These experimental results reveals that the performance of 3D face recognition would be constantly improved with the aid of training deep models from massive 2D face images, which opens the door for future directions of 3D face recognition.

本文通过探索深度正常模式(deep normal patterns, DNP)的位置敏感稀疏表示，提出了一种简单、高效、表达鲁棒的3D人脸识别方法。特别是，给定原始的3D面部表面，我们首先运行3D面部预处理管道，包括鼻尖检测，面部区域裁剪和姿态归一化。然后将每个归一化的三维人脸表面的三维坐标投影到二维平面上生成几何图像，从中估计出人脸表面法线分量的三幅图像。然后将每个法线图像馈送到预训练的深度人脸网络中，以生成面部表面法线的深度表示，即深度法线模式。考虑到不同人脸位置的重要性，我们提出了一种位置敏感的稀疏表示分类器(LS-SRC)，用于测量与不同3D人脸相关的深度法向模式之间的相似性。最后，使用不同正常分量的简单分数级融合进行最终判定。该方法取得了显著的高性能，当图库中每个受试者仅使用一个样本时，在FRGC v2.0、Bosphorus和BU-3DFE数据库上的排名得分分别为98.01%、97.60%和96.13%。这些实验结果表明，通过对大量二维人脸图像进行深度模型训练，可以不断提高三维人脸识别的性能，为未来的三维人脸识别方向打开了大门。

{"title":"Location-sensitive sparse representation of deep normal patterns for expression-robust 3D face recognition","authors":"Huibin Li, Jian Sun, Liming Chen","doi":"10.1109/BTAS.2017.8272703","DOIUrl":"https://doi.org/10.1109/BTAS.2017.8272703","url":null,"abstract":"This paper presents a straight-forward yet efficient, and expression-robust 3D face recognition approach by exploring location sensitive sparse representation of deep normal patterns (DNP). In particular, given raw 3D facial surfaces, we first run 3D face pre-processing pipeline, including nose tip detection, face region cropping, and pose normalization. The 3D coordinates of each normalized 3D facial surface are then projected into 2D plane to generate geometry images, from which three images of facial surface normal components are estimated. Each normal image is then fed into a pre-trained deep face net to generate deep representations of facial surface normals, i.e., deep normal patterns. Considering the importance of different facial locations, we propose a location sensitive sparse representation classifier (LS-SRC) for similarity measure among deep normal patterns associated with different 3D faces. Finally, simple score-level fusion of different normal components are used for the final decision. The proposed approach achieves significantly high performance, and reporting rank-one scores of 98.01%, 97.60%, and 96.13% on the FRGC v2.0, Bosphorus, and BU-3DFE databases when only one sample per subject is used in the gallery. These experimental results reveals that the performance of 3D face recognition would be constantly improved with the aid of training deep models from massive 2D face images, which opens the door for future directions of 3D face recognition.","PeriodicalId":372008,"journal":{"name":"2017 IEEE International Joint Conference on Biometrics (IJCB)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128974683","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 10

ICFVR 2017: 3rd international competition on finger vein recognition ICFVR 2017:第三届手指静脉识别国际比赛

2017 IEEE International Joint Conference on Biometrics (IJCB)

Pub Date : 2017-10-01 DOI: 10.1109/BTAS.2017.8272760

Yi Zhang, Houjun Huang, Haifeng Zhang, Liao Ni, W. Xu, N. U. Ahmed, Md. Shakil Ahmed, Yilun Jin, Ying Chen, Jingxuan Wen, Wenxin Li

In recent years, finger vein recognition has become an important sub-field in biometrics and been applied to real-world applications. The development of finger vein recognition algorithms heavily depends on large-scale real-world data sets. In order to motivate research on finger vein recognition, we released the largest finger vein data set up to now and hold finger vein recognition competitions based on our data set every year. In 2017, International Competition on Finger Vein Recognition (ICFVR) is held jointly with IJCB 2017. 11 teams registered and 10 of them joined the final evaluation. The winner of this year dramatically improved the EER from 2.64% to 0.483% compared to the 'winner of last year. In this paper, we introduce the process and results of ICFVR 2017 and give insights on development of state-of-art finger vein recognition algorithms.

近年来，手指静脉识别已成为生物识别领域的一个重要分支，并开始应用于现实世界。手指静脉识别算法的发展严重依赖于大规模的真实世界数据集。为了激发对手指静脉识别的研究，我们发布了迄今为止最大的手指静脉数据集，并每年基于我们的数据集举办手指静脉识别比赛。2017年与IJCB 2017联合举办国际手指静脉识别大赛(ICFVR)。11个团队报名，其中10个团队参加最终评审。与去年的冠军相比，今年的冠军将EER从2.64%大幅提高到0.483%。在本文中，我们介绍了ICFVR 2017的过程和结果，并对最先进的手指静脉识别算法的发展提出了见解。

引用次数: 3

In defense of low-level structural features and SVMs for facial attribute classification: Application to detection of eye state, Mouth State, and eyeglasses in the wild 基于低级结构特征和支持向量机的面部属性分类:在野外眼睛状态、嘴巴状态和眼镜状态检测中的应用

2017 IEEE International Joint Conference on Biometrics (IJCB)

Pub Date : 2017-10-01 DOI: 10.1109/BTAS.2017.8272747

Abdulaziz Alorf, A. L. Abbott

The current trend in image analysis is to employ automatically detected feature types, such as those obtained using deep-learning techniques. For some applications, however, manually crafted features such as Histogram of Oriented Gradients (HOG) continue to yield better performance in demanding situations. This paper considers both approaches for the problem of facial attribute classification, for images obtained “in the wild.” Attributes of particular interest are eye state (open/closed), mouth state (open/closed), and eyeglasses (present/absent). We present a full face-processing pipeline that employs conventional machine learning techniques, from detection to attribute classification. Experimental results have indicated better performance using RootSIFT with a conventional support-vector machine (SVM) approach, as compared to deep-learning approaches that have been reported in the literature. Our proposed open/closed eye classifier has yielded an accuracy of 99.3% on the CEW dataset, and an accuracy of 98.7% on the ZJU dataset. Similarly, our proposed open/closed mouth classifier has achieved performance similar to deep learning. Also, our proposed presence/absence eyeglasses classifier delivered very good performance, being the best method on LFWA, and second best for the CelebA dataset. The system reported here runs at 30 fps on HD-sized video using a CPU-only implementation.

目前图像分析的趋势是采用自动检测的特征类型，例如使用深度学习技术获得的特征类型。然而，对于某些应用程序，手动制作的特征，如定向梯度直方图(HOG)，在要求苛刻的情况下继续产生更好的性能。本文考虑了这两种方法的面部属性分类问题，对于“在野外”获得的图像。特别感兴趣的属性是眼睛状态(张开/闭上)，嘴巴状态(张开/闭上)和眼镜(在场/不在场)。我们提出了一个完整的人脸处理管道，采用传统的机器学习技术，从检测到属性分类。实验结果表明，与文献中报道的深度学习方法相比，使用传统支持向量机(SVM)方法的RootSIFT具有更好的性能。我们提出的睁眼/闭眼分类器在CEW数据集上的准确率为99.3%，在ZJU数据集上的准确率为98.7%。同样，我们提出的开/闭口分类器也取得了与深度学习相似的性能。此外，我们提出的存在/缺席眼镜分类器提供了非常好的性能，是LFWA上最好的方法，对于CelebA数据集来说是第二好的方法。这里报告的系统在使用仅cpu实现的高清视频上以30 fps的速度运行。

{"title":"In defense of low-level structural features and SVMs for facial attribute classification: Application to detection of eye state, Mouth State, and eyeglasses in the wild","authors":"Abdulaziz Alorf, A. L. Abbott","doi":"10.1109/BTAS.2017.8272747","DOIUrl":"https://doi.org/10.1109/BTAS.2017.8272747","url":null,"abstract":"The current trend in image analysis is to employ automatically detected feature types, such as those obtained using deep-learning techniques. For some applications, however, manually crafted features such as Histogram of Oriented Gradients (HOG) continue to yield better performance in demanding situations. This paper considers both approaches for the problem of facial attribute classification, for images obtained “in the wild.” Attributes of particular interest are eye state (open/closed), mouth state (open/closed), and eyeglasses (present/absent). We present a full face-processing pipeline that employs conventional machine learning techniques, from detection to attribute classification. Experimental results have indicated better performance using RootSIFT with a conventional support-vector machine (SVM) approach, as compared to deep-learning approaches that have been reported in the literature. Our proposed open/closed eye classifier has yielded an accuracy of 99.3% on the CEW dataset, and an accuracy of 98.7% on the ZJU dataset. Similarly, our proposed open/closed mouth classifier has achieved performance similar to deep learning. Also, our proposed presence/absence eyeglasses classifier delivered very good performance, being the best method on LFWA, and second best for the CelebA dataset. The system reported here runs at 30 fps on HD-sized video using a CPU-only implementation.","PeriodicalId":372008,"journal":{"name":"2017 IEEE International Joint Conference on Biometrics (IJCB)","volume":"34 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132609026","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 6

Towards pre-alignment of near-infrared iris images 近红外虹膜图像的预对准

2017 IEEE International Joint Conference on Biometrics (IJCB)

Pub Date : 2017-10-01 DOI: 10.1109/BTAS.2017.8272718

P. Drozdowski, C. Rathgeb, H. Hofbauer, J. Wagner, A. Uhl, C. Busch

The necessity of biometric template alignment imposes a significant computational load and increases the probability of false positive occurrences in biometric systems. While for some modalities, automatic pre-alignment of biometric samples is utilised, this topic has not yet been explored for systems based on the iris. This paper presents a method for pre-alignment of iris images based on the positions ofautomatically detected eye corners. Existing work in the area of automatic eye corner detection has hitherto only involved visible wavelength images; for the near-infrared images, used in the vast majority of current iris recognition systems, this task is significantly more challenging and as of yet unexplored. A comparative study of two methods for solving this problem is presented in this paper. The eye corners detected by the two methods are then used for the pre-alignment and biometric performance evaluation experiments. The system utilising image pre-alignment is benchmarked against a baseline iris recognition system on the iris subset of the BioSecure database. In the benchmark, the workload associated with alignment compensation is significantly reduced, while the biometric performance remains unchanged or even improves slightly.

生物识别模板对齐的必要性施加了显著的计算负荷，并增加了假阳性发生在生物识别系统的概率。虽然对于某些模式，生物特征样本的自动预校准被利用，但这个主题尚未被探索用于基于虹膜的系统。本文提出了一种基于自动检测的眼角位置对虹膜图像进行预对准的方法。目前在自动眼角检测领域的工作只涉及可见光波长图像;对于目前绝大多数虹膜识别系统中使用的近红外图像，这项任务更具挑战性，而且尚未被探索。本文对解决这一问题的两种方法进行了比较研究。然后将两种方法检测到的眼角用于预对准和生物特征性能评估实验。利用图像预校准的系统在BioSecure数据库的虹膜子集上对基线虹膜识别系统进行基准测试。在基准测试中，与校准补偿相关的工作量显著减少，而生物识别性能保持不变甚至略有提高。

{"title":"Towards pre-alignment of near-infrared iris images","authors":"P. Drozdowski, C. Rathgeb, H. Hofbauer, J. Wagner, A. Uhl, C. Busch","doi":"10.1109/BTAS.2017.8272718","DOIUrl":"https://doi.org/10.1109/BTAS.2017.8272718","url":null,"abstract":"The necessity of biometric template alignment imposes a significant computational load and increases the probability of false positive occurrences in biometric systems. While for some modalities, automatic pre-alignment of biometric samples is utilised, this topic has not yet been explored for systems based on the iris. This paper presents a method for pre-alignment of iris images based on the positions ofautomatically detected eye corners. Existing work in the area of automatic eye corner detection has hitherto only involved visible wavelength images; for the near-infrared images, used in the vast majority of current iris recognition systems, this task is significantly more challenging and as of yet unexplored. A comparative study of two methods for solving this problem is presented in this paper. The eye corners detected by the two methods are then used for the pre-alignment and biometric performance evaluation experiments. The system utilising image pre-alignment is benchmarked against a baseline iris recognition system on the iris subset of the BioSecure database. In the benchmark, the workload associated with alignment compensation is significantly reduced, while the biometric performance remains unchanged or even improves slightly.","PeriodicalId":372008,"journal":{"name":"2017 IEEE International Joint Conference on Biometrics (IJCB)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114911039","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

Computing an image Phylogeny Tree from photometrically modified iris images 从光度法修改的虹膜图像计算图像系统发育树

2017 IEEE International Joint Conference on Biometrics (IJCB)

Pub Date : 2017-10-01 DOI: 10.1109/BTAS.2017.8272749

Sudipta Banerjee, A. Ross

Iris recognition entails the use of iris images to recognize an individual. In some cases, the iris image acquired from an individual can be modified by subjecting it to successive photometric transformations such as brightening, gamma correction, median filtering and Gaussian smoothing, resulting in a family of transformed images. Automatically inferring the relationship between the set of transformed images is important in the context of digital image forensics. In this regard, we develop a method to generate an Image Phylogeny Tree (IPT) from a set of such transformed images. Our strategy entails modeling an arbitrary photometric transformation as a linear or non-linear function and utilizing the parameters of the model to quantify the relationship between pairs of images. The estimated parameters are then used to generate the IPT. Modest, yet promising, results are obtained in terms of parameter estimation and IPT generation.

虹膜识别需要使用虹膜图像来识别个体。在某些情况下，从个体获得的虹膜图像可以通过对其进行连续的光度变换(如增亮、伽马校正、中值滤波和高斯平滑)来进行修改，从而产生一系列变换后的图像。在数字图像取证的背景下，自动推断变换图像之间的关系是很重要的。在这方面，我们开发了一种从一组这样的转换图像生成图像系统发育树(IPT)的方法。我们的策略需要将任意光度变换建模为线性或非线性函数，并利用模型的参数来量化图像对之间的关系。然后使用估计的参数来生成IPT。在参数估计和IPT生成方面获得了适度但有希望的结果。

引用次数: 2

Extracting sub-glottal and Supra-glottal features from MFCC using convolutional neural networks for speaker identification in degraded audio signals 利用卷积神经网络提取声门下和上声门特征，用于退化音频信号的说话人识别

2017 IEEE International Joint Conference on Biometrics (IJCB)

Pub Date : 2017-10-01 DOI: 10.1109/BTAS.2017.8272748

Anurag Chowdhury, A. Ross

We present a deep learning based algorithm for speaker recognition from degraded audio signals. We use the commonly employed Mel-Frequency Cepstral Coefficients (MFCC) for representing the audio signals. A convolutional neural network (CNN) based on 1D filters, rather than 2D filters, is then designed. The filters in the CNN are designed to learn inter-dependency between cepstral coefficients extracted from audio frames of fixed temporal expanse. Our approach aims at extracting speaker dependent features, like Sub-glottal and Supra-glottal features, of the human speech production apparatus for identifying speakers from degraded audio signals. The performance of the proposed method is compared against existing baseline schemes on both synthetically and naturally corrupted speech data. Experiments convey the efficacy of the proposed architecture for speaker recognition.

我们提出了一种基于深度学习的从退化音频信号中识别说话人的算法。我们使用常用的Mel-Frequency倒谱系数(MFCC)来表示音频信号。然后设计了一个基于一维滤波器而不是二维滤波器的卷积神经网络(CNN)。CNN中的滤波器被设计用来学习从固定时间跨度的音频帧中提取的倒谱系数之间的相互依赖性。我们的方法旨在提取人类语音产生装置的说话人相关特征，如声门下和声门上特征，用于从降级的音频信号中识别说话人。在合成和自然损坏语音数据上，将该方法与现有的基线算法进行了性能比较。实验结果表明，所提出的结构对说话人识别是有效的。

引用次数: 10

Face anti-spoofing using patch and depth-based CNNs 使用补丁和基于深度的cnn进行人脸防欺骗

2017 IEEE International Joint Conference on Biometrics (IJCB)

Pub Date : 2017-10-01 DOI: 10.1109/BTAS.2017.8272713

Yousef Atoum, Yaojie Liu, Amin Jourabloo, Xiaoming Liu

The face image is the most accessible biometric modality which is used for highly accurate face recognition systems, while it is vulnerable to many different types of presentation attacks. Face anti-spoofing is a very critical step before feeding the face image to biometric systems. In this paper, we propose a novel two-stream CNN-based approach for face anti-spoofing, by extracting the local features and holistic depth maps from the face images. The local features facilitate CNN to discriminate the spoof patches independent of the spatial face areas. On the other hand, holistic depth map examine whether the input image has a face-like depth. Extensive experiments are conducted on the challenging databases (CASIA-FASD, MSU-USSA, and Replay Attack), with comparison to the state of the art.

人脸图像是用于高精度人脸识别系统的最容易获得的生物识别模式，但它容易受到许多不同类型的表示攻击。人脸防欺骗是将人脸图像输入生物识别系统之前非常关键的一步。在本文中，我们提出了一种新的基于cnn的双流人脸防欺骗方法，通过从人脸图像中提取局部特征和整体深度图。局部特征使CNN能够独立于空间人脸区域区分恶搞斑块。另一方面，整体深度图检查输入图像是否具有类似人脸的深度。在具有挑战性的数据库(CASIA-FASD, MSU-USSA和Replay Attack)上进行了广泛的实验，并与最新技术进行了比较。

引用次数: 325

Learning optimised representations for view-invariant gait recognition 视觉不变步态识别的学习优化表征

2017 IEEE International Joint Conference on Biometrics (IJCB)

Pub Date : 2017-10-01 DOI: 10.1109/BTAS.2017.8272769

Ning Jia, Victor Sanchez, Chang-Tsun Li

Gait recognition can be performed without subject cooperation under harsh conditions, thus it is an important tool in forensic gait analysis, security control, and other commercial applications. One critical issue that prevents gait recognition systems from being widely accepted is the performance drop when the camera viewpoint varies between the registered templates and the query data. In this paper, we explore the potential of combining feature optimisers and representations learned by convolutional neural networks (CNN) to achieve efficient view-invariant gait recognition. The experimental results indicate that CNN learns highly discriminative representations across moderate view variations, and these representations can be further improved using view-invariant feature selectors, achieving a high matching accuracy across views.

步态识别在恶劣条件下无需受试者配合即可完成，是法医步态分析、安全控制等商业应用的重要工具。阻碍步态识别系统被广泛接受的一个关键问题是，当摄像机视点在注册模板和查询数据之间变化时，性能会下降。在本文中，我们探索了将特征优化器和卷积神经网络(CNN)学习的表征相结合的潜力，以实现高效的视觉不变步态识别。实验结果表明，CNN在适度的视图变化中学习了高度判别的表征，并且这些表征可以使用视图不变的特征选择器进一步改进，从而在视图之间实现较高的匹配精度。

引用次数: 7

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2017 IEEE International Joint Conference on Biometrics (IJCB)

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀