首页 > 最新文献

2015 International Conference on Systems, Signals and Image Processing (IWSSIP)最新文献

英文 中文
Fetal ECG extraction using πTucker decomposition 利用πTucker分解提取胎儿心电
Pub Date : 2015-11-02 DOI: 10.1109/IWSSIP.2015.7314205
Hassan Akbari, M. Shamsollahi, R. Phlypo
In this paper, we introduce a novel approach based on Tucker Decomposition and quasi-periodic nature of ECG signal for fetal ECG extraction from abdominal ECG mixture. We adapt variable periodicity constraint of the ECG components to main objective function of the Tucker Decomposition and shape it to matrix form in order to simply optimize the objective function. We form a 3rd order tensor by stacking the mixed multichannel ECG and reconstructed fetal and maternal subspaces using BSS methods in order to have the benefit of further artificial observations, and apply our proposed penalized decomposition on it. The proposed method is evaluated on synthetic and real datasets using the criteria Signal to Interference plus Noise Ratio (SINR) for fetal component considering mother component as interference. Results and evaluations show a superior SINR improvement of 1 to 4 dB compared to other state of the art methods.
本文提出了一种基于塔克分解和心电信号准周期特性的胎儿心电提取方法。将心电分量的可变周期约束引入塔克分解的主要目标函数,并将其转化为矩阵形式,以实现目标函数的简单优化。为了便于进一步的人工观测,我们将混合多通道ECG叠加形成一个三阶张量,并利用BSS方法重构胎儿和母亲子空间,并对其进行惩罚分解。将母分量作为干扰,以胎儿分量的信噪比(SINR)为标准,在合成数据集和真实数据集上对该方法进行了评价。结果和评估表明,与其他最先进的方法相比,该方法的信噪比提高了1至4 dB。
{"title":"Fetal ECG extraction using πTucker decomposition","authors":"Hassan Akbari, M. Shamsollahi, R. Phlypo","doi":"10.1109/IWSSIP.2015.7314205","DOIUrl":"https://doi.org/10.1109/IWSSIP.2015.7314205","url":null,"abstract":"In this paper, we introduce a novel approach based on Tucker Decomposition and quasi-periodic nature of ECG signal for fetal ECG extraction from abdominal ECG mixture. We adapt variable periodicity constraint of the ECG components to main objective function of the Tucker Decomposition and shape it to matrix form in order to simply optimize the objective function. We form a 3rd order tensor by stacking the mixed multichannel ECG and reconstructed fetal and maternal subspaces using BSS methods in order to have the benefit of further artificial observations, and apply our proposed penalized decomposition on it. The proposed method is evaluated on synthetic and real datasets using the criteria Signal to Interference plus Noise Ratio (SINR) for fetal component considering mother component as interference. Results and evaluations show a superior SINR improvement of 1 to 4 dB compared to other state of the art methods.","PeriodicalId":249021,"journal":{"name":"2015 International Conference on Systems, Signals and Image Processing (IWSSIP)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-11-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132346727","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
Improvement of speech emotion recognition with neural network classifier by using speech spectrogram 利用语音谱图改进神经网络分类器的语音情感识别
Pub Date : 2015-11-02 DOI: 10.1109/IWSSIP.2015.7314180
Sathit Prasomphan
This research presents a novel algorithm for detecting human emotion via speech recognition by using speech spectrogram. The proposed algorithm aims to detect the emotional by using information inside the spectrogram. Neural network was used for being the classifier. A new approach to feature extraction based on analysis of two dimensions time-frequency representation of a speech signal have been presented. The algorithm was tested with EMO-Database. The experimental results show that the proposed framework can efficiently find the correct speech emotion compared to using the comparing method.
本文提出了一种基于语音谱图的语音识别情感检测算法。该算法旨在利用谱图内部的信息来检测情绪。采用神经网络作为分类器。提出了一种基于语音信号二维时频表示分析的特征提取方法。用EMO-Database对算法进行了验证。实验结果表明,与传统的比较方法相比,该框架能有效地找到正确的语音情感。
{"title":"Improvement of speech emotion recognition with neural network classifier by using speech spectrogram","authors":"Sathit Prasomphan","doi":"10.1109/IWSSIP.2015.7314180","DOIUrl":"https://doi.org/10.1109/IWSSIP.2015.7314180","url":null,"abstract":"This research presents a novel algorithm for detecting human emotion via speech recognition by using speech spectrogram. The proposed algorithm aims to detect the emotional by using information inside the spectrogram. Neural network was used for being the classifier. A new approach to feature extraction based on analysis of two dimensions time-frequency representation of a speech signal have been presented. The algorithm was tested with EMO-Database. The experimental results show that the proposed framework can efficiently find the correct speech emotion compared to using the comparing method.","PeriodicalId":249021,"journal":{"name":"2015 International Conference on Systems, Signals and Image Processing (IWSSIP)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-11-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114754078","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 24
Learning a joint discriminative-generative model for action recognition 学习动作识别的联合判别生成模型
Pub Date : 2015-11-02 DOI: 10.1109/IWSSIP.2015.7313922
I. Alexiou, T. Xiang, S. Gong
An action consists of a sequence of instantaneous motion patterns whose temporal ordering contains critical information especially for distinguishing fine-grained action categories. However, existing action recognition methods are dominated by discriminative classifiers such as kernel machines or metric learning with Bag-of-Words (BoW) action representations. They ignore the temporal structures of actions in exchange for robustness against noise. Although such temporal structures can be modelled explicitly using dynamic generative models such as Hidden Markov Models (HMMs), these generative models are designed to maximise the likelihood of the data therefore providing no guarantee on suitability for discrimination required by action recognition. In this work, a novel approach is proposed to explore the best of both worlds by discriminatively learning a generative action model. Specifically, our approach is based on discriminative Fisher kernel learning which learns a dynamic generative model so that the distance between the log-likelihood gradients induced by two actions of the same class is minimised. We demonstrate the advantages of the proposed model over the state-of-the-art action recognition methods using two challenging benchmark datasets of complex actions.
动作由一系列瞬时运动模式组成,其时间顺序包含关键信息,特别是用于区分细粒度动作类别。然而,现有的动作识别方法主要是判别分类器,如核机器或带有词袋(BoW)动作表示的度量学习。它们忽略动作的时间结构,以换取对噪声的鲁棒性。虽然这种时间结构可以使用动态生成模型(如隐马尔可夫模型(hmm))明确建模,但这些生成模型旨在最大化数据的可能性,因此无法保证行动识别所需的歧视的适用性。在这项工作中,提出了一种新的方法,通过判别学习生成行为模型来探索两个世界的最佳效果。具体来说,我们的方法是基于判别费雪核学习,它学习一个动态生成模型,以便最小化由同一类的两个动作引起的对数似然梯度之间的距离。我们使用两个具有挑战性的复杂动作基准数据集来证明所提出的模型优于最先进的动作识别方法。
{"title":"Learning a joint discriminative-generative model for action recognition","authors":"I. Alexiou, T. Xiang, S. Gong","doi":"10.1109/IWSSIP.2015.7313922","DOIUrl":"https://doi.org/10.1109/IWSSIP.2015.7313922","url":null,"abstract":"An action consists of a sequence of instantaneous motion patterns whose temporal ordering contains critical information especially for distinguishing fine-grained action categories. However, existing action recognition methods are dominated by discriminative classifiers such as kernel machines or metric learning with Bag-of-Words (BoW) action representations. They ignore the temporal structures of actions in exchange for robustness against noise. Although such temporal structures can be modelled explicitly using dynamic generative models such as Hidden Markov Models (HMMs), these generative models are designed to maximise the likelihood of the data therefore providing no guarantee on suitability for discrimination required by action recognition. In this work, a novel approach is proposed to explore the best of both worlds by discriminatively learning a generative action model. Specifically, our approach is based on discriminative Fisher kernel learning which learns a dynamic generative model so that the distance between the log-likelihood gradients induced by two actions of the same class is minimised. We demonstrate the advantages of the proposed model over the state-of-the-art action recognition methods using two challenging benchmark datasets of complex actions.","PeriodicalId":249021,"journal":{"name":"2015 International Conference on Systems, Signals and Image Processing (IWSSIP)","volume":"224 7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-11-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129893310","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Comparison of segmentation accuracy for different LUTs applied to digital mammograms 不同lut在数字乳房x光片上的分割精度比较
Pub Date : 2015-11-02 DOI: 10.1109/IWSSIP.2015.7314190
M. Mustra, G. Peros, B. Zovko-Cihlar
In this paper we try to compare usage of different image processing techniques applied to mammograms. Different imaging devices produce images with different properties, which can be determined by histogram comparison. Digital imaging devices store images according to the DICOM standard, but to view images properly, images need to be processed for displaying on a desired display. Linear grayscale transformation does not provide a good solution and therefore grayscale values are being converted using look-up-tables (LUTs). Once converted for optimal displaying, images suffer from a loss of details and sometimes information which could be useful for computer-aided detection (CAD) algorithms. In this paper we will compare segmentation accuracy of the breast tissue and nipple detection accuracy when using different image preprocessing techniques.
在本文中,我们试图比较使用不同的图像处理技术应用于乳房x线照片。不同的成像设备产生的图像具有不同的属性,这可以通过直方图比较来确定。数字成像设备根据DICOM标准存储图像,但要正确查看图像,需要对图像进行处理,以便在所需的显示器上显示。线性灰度变换不提供一个很好的解决方案,因此灰度值被转换使用查找表(lut)。一旦转换为最佳显示,图像就会丢失细节,有时还会丢失对计算机辅助检测(CAD)算法有用的信息。本文将比较不同图像预处理技术对乳腺组织的分割精度和乳头的检测精度。
{"title":"Comparison of segmentation accuracy for different LUTs applied to digital mammograms","authors":"M. Mustra, G. Peros, B. Zovko-Cihlar","doi":"10.1109/IWSSIP.2015.7314190","DOIUrl":"https://doi.org/10.1109/IWSSIP.2015.7314190","url":null,"abstract":"In this paper we try to compare usage of different image processing techniques applied to mammograms. Different imaging devices produce images with different properties, which can be determined by histogram comparison. Digital imaging devices store images according to the DICOM standard, but to view images properly, images need to be processed for displaying on a desired display. Linear grayscale transformation does not provide a good solution and therefore grayscale values are being converted using look-up-tables (LUTs). Once converted for optimal displaying, images suffer from a loss of details and sometimes information which could be useful for computer-aided detection (CAD) algorithms. In this paper we will compare segmentation accuracy of the breast tissue and nipple detection accuracy when using different image preprocessing techniques.","PeriodicalId":249021,"journal":{"name":"2015 International Conference on Systems, Signals and Image Processing (IWSSIP)","volume":"50 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-11-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132206163","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Improved reference picture list sorting in video coding 改进了视频编码中的参考图片列表排序
Pub Date : 2015-11-02 DOI: 10.1109/IWSSIP.2015.7314175
S. Schwarz, M. Mrak
Motion compensated prediction using multiple references is a key element of any modern video coding standard. A smart and meaningful selection of references contributes significantly to encoding efficiency. This selection could either be performed through computationally expensive pre-analysis and reference picture selection optimisation, or through improved reference picture list structures based on general encoder decisions. This paper analyses encoder reference picture decisions for a maximum set of available reference pictures. Characteristic encoder decision properties are identified, then two variants of the standard HEVC reference picture list structure and sorting approach are derived and implemented. Evaluation verifies that the conclusions drawn from the general encoder decisions hold. The proposed changes provide coding efficiency benefits in terms of bit rate savings of up to 6%, with a limited increase in computational complexity of around 11%.
使用多个参考的运动补偿预测是任何现代视频编码标准的关键要素。明智而有意义的引用选择对编码效率有很大的帮助。这种选择可以通过计算昂贵的预分析和参考图片选择优化来执行,或者通过基于一般编码器决策的改进参考图片列表结构来执行。本文分析了最大可用参考图片集下编码器参考图片的选择问题。首先确定了编码器的特征判断属性,然后推导并实现了HEVC标准参考图片列表结构和排序方法的两种变体。求值验证从一般编码器决策得出的结论是否成立。提出的更改提供了编码效率方面的好处,即比特率节省高达6%,计算复杂性增加有限,约为11%。
{"title":"Improved reference picture list sorting in video coding","authors":"S. Schwarz, M. Mrak","doi":"10.1109/IWSSIP.2015.7314175","DOIUrl":"https://doi.org/10.1109/IWSSIP.2015.7314175","url":null,"abstract":"Motion compensated prediction using multiple references is a key element of any modern video coding standard. A smart and meaningful selection of references contributes significantly to encoding efficiency. This selection could either be performed through computationally expensive pre-analysis and reference picture selection optimisation, or through improved reference picture list structures based on general encoder decisions. This paper analyses encoder reference picture decisions for a maximum set of available reference pictures. Characteristic encoder decision properties are identified, then two variants of the standard HEVC reference picture list structure and sorting approach are derived and implemented. Evaluation verifies that the conclusions drawn from the general encoder decisions hold. The proposed changes provide coding efficiency benefits in terms of bit rate savings of up to 6%, with a limited increase in computational complexity of around 11%.","PeriodicalId":249021,"journal":{"name":"2015 International Conference on Systems, Signals and Image Processing (IWSSIP)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-11-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116955676","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Diphone spanish text-to-speech synthesizer Diphone西班牙语文本语音合成器
Pub Date : 2015-11-02 DOI: 10.1109/IWSSIP.2015.7314192
R. Rybarova, Gonzalo del Corral, G. Rozinaj
The paper deals with development of speech synthesizer for Spanish language within a complex modular speech synthesis system with a multilingual support. The whole concept of the system architecture is described in the paper, together with a method for the quality improvement of a synthesized speech. A short comparison of Slovak and Spanish languages is discussed from phonetic transcription point of view. The quality of the final synthesized speech based on the new Spanish synthesizer has also been tested and evaluated.
本文研究了在一个多语言支持的复杂模块化语音合成系统中西班牙语语音合成器的开发。本文描述了系统架构的整体概念,并提出了一种提高合成语音质量的方法。从音标的角度对斯洛伐克语和西班牙语进行了简要的比较。基于新的西班牙语合成器的最终合成语音的质量也进行了测试和评估。
{"title":"Diphone spanish text-to-speech synthesizer","authors":"R. Rybarova, Gonzalo del Corral, G. Rozinaj","doi":"10.1109/IWSSIP.2015.7314192","DOIUrl":"https://doi.org/10.1109/IWSSIP.2015.7314192","url":null,"abstract":"The paper deals with development of speech synthesizer for Spanish language within a complex modular speech synthesis system with a multilingual support. The whole concept of the system architecture is described in the paper, together with a method for the quality improvement of a synthesized speech. A short comparison of Slovak and Spanish languages is discussed from phonetic transcription point of view. The quality of the final synthesized speech based on the new Spanish synthesizer has also been tested and evaluated.","PeriodicalId":249021,"journal":{"name":"2015 International Conference on Systems, Signals and Image Processing (IWSSIP)","volume":"62 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-11-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115315749","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Face filtering — Insights from real-world data 面部过滤-来自真实世界数据的见解
Pub Date : 2015-11-02 DOI: 10.1109/IWSSIP.2015.7314178
Yohannes Biadgligne, Ognjen Arandjelovic
Digital image processing filters continue to be used widely for the normalization of illumination effects in face recognition, both in research and in practice. Their appeal stems from their simplicity, efficiency, predictable and well-understood behaviour, and importantly, lack of catastrophic failure modes. Notwithstanding this widespread use, no work to date has performed a comparative analysis of different filters in challenging, realistic conditions expected in practice - filters in previous work are either adopted in isolation or evaluated in constrained conditions unrepresentative of real-world challenges. In this paper we perform, report, and discuss a comparative evaluation of a number of popular filters on a challenging, real-world data set which contains major changes in illumination, pose (yaw and pitch), camera-user distance, image resolution, and (often neglected) camera type. Our results demonstrate that relative performances of different filters in realistic imaging conditions such as those examined in this paper are vastly different than when the same filters are evaluated in a controlled setting as in previous work. Therefore our results provide important insight for practical application of image filters and future research.
无论是在研究还是在实践中,数字图像处理滤波器都被广泛用于人脸识别中照明效果的归一化。它们的吸引力源于它们的简单、高效、可预测和易于理解的行为,重要的是,它们没有灾难性的失效模式。尽管这种方法得到了广泛的应用,但迄今为止还没有研究在具有挑战性的实际条件下对不同的过滤器进行了比较分析——以前的研究中,过滤器要么是单独采用的,要么是在不代表现实世界挑战的受限条件下进行评估的。在本文中,我们在具有挑战性的真实世界数据集上执行,报告和讨论了许多流行滤镜的比较评估,这些数据集包含照明,姿势(偏航和俯距),相机用户距离,图像分辨率和(经常被忽视的)相机类型的主要变化。我们的研究结果表明,不同的滤波器在现实成像条件下的相对性能,如本文所研究的,与在以前的工作中在控制设置中评估相同的滤波器时的相对性能有很大不同。因此,我们的研究结果为图像滤波器的实际应用和未来的研究提供了重要的见解。
{"title":"Face filtering — Insights from real-world data","authors":"Yohannes Biadgligne, Ognjen Arandjelovic","doi":"10.1109/IWSSIP.2015.7314178","DOIUrl":"https://doi.org/10.1109/IWSSIP.2015.7314178","url":null,"abstract":"Digital image processing filters continue to be used widely for the normalization of illumination effects in face recognition, both in research and in practice. Their appeal stems from their simplicity, efficiency, predictable and well-understood behaviour, and importantly, lack of catastrophic failure modes. Notwithstanding this widespread use, no work to date has performed a comparative analysis of different filters in challenging, realistic conditions expected in practice - filters in previous work are either adopted in isolation or evaluated in constrained conditions unrepresentative of real-world challenges. In this paper we perform, report, and discuss a comparative evaluation of a number of popular filters on a challenging, real-world data set which contains major changes in illumination, pose (yaw and pitch), camera-user distance, image resolution, and (often neglected) camera type. Our results demonstrate that relative performances of different filters in realistic imaging conditions such as those examined in this paper are vastly different than when the same filters are evaluated in a controlled setting as in previous work. Therefore our results provide important insight for practical application of image filters and future research.","PeriodicalId":249021,"journal":{"name":"2015 International Conference on Systems, Signals and Image Processing (IWSSIP)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-11-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115743465","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Discriminative training of HMM using MASPER procedure 基于MASPER程序的HMM判别训练
Pub Date : 2015-11-02 DOI: 10.1109/IWSSIP.2015.7314185
J. Kacur, Tibor Trnovsky, R. Vargic
The main focus of the article is on incorporating discriminative training into MASPER multilingual training procedure by some necessary modifications. Next the performance of discriminative rules like Maximal Mutual Information (MMI) and Minimal Phone Error (MPE), application of I smoothing technique, setting up convergence parameter, benefits of discriminative training for different hidden Markov models (HMM), etc. are tested and evaluated. Moreover an overview of discriminative training strategies and their relations to the classical Maximum Likelihood (ML) estimation is given. All experiments have been accomplished on Slovak part of MobilDat training database that contains wide range of noises and specific GSM distortions. Achieved results show that discriminative training if properly adjusted can improve performance over ML training on average by 5% depending on the model complexity, training strategies and deployment scenarios. Finally, MPE when properly set may outperform MMI, however it is prone to higher sensitivity to the set parameters, used models and application domain.
本文的主要重点是通过一些必要的修改,将判别训练纳入MASPER多语训练程序。然后对最大互信息(MMI)和最小电话误差(MPE)等判别规则的性能、I平滑技术的应用、收敛参数的设置、不同隐马尔可夫模型(HMM)的判别训练效果等进行了测试和评价。此外,还概述了判别训练策略及其与经典最大似然(ML)估计的关系。所有实验都是在MobilDat训练数据库的斯洛伐克部分完成的,该部分包含广泛的噪声和特定的GSM失真。所取得的结果表明,根据模型复杂性、训练策略和部署场景的不同,适当调整判别训练可以使机器学习训练的性能平均提高5%。最后,当适当设置时,MPE可能优于MMI,但是它容易对设置的参数,使用的模型和应用领域具有更高的灵敏度。
{"title":"Discriminative training of HMM using MASPER procedure","authors":"J. Kacur, Tibor Trnovsky, R. Vargic","doi":"10.1109/IWSSIP.2015.7314185","DOIUrl":"https://doi.org/10.1109/IWSSIP.2015.7314185","url":null,"abstract":"The main focus of the article is on incorporating discriminative training into MASPER multilingual training procedure by some necessary modifications. Next the performance of discriminative rules like Maximal Mutual Information (MMI) and Minimal Phone Error (MPE), application of I smoothing technique, setting up convergence parameter, benefits of discriminative training for different hidden Markov models (HMM), etc. are tested and evaluated. Moreover an overview of discriminative training strategies and their relations to the classical Maximum Likelihood (ML) estimation is given. All experiments have been accomplished on Slovak part of MobilDat training database that contains wide range of noises and specific GSM distortions. Achieved results show that discriminative training if properly adjusted can improve performance over ML training on average by 5% depending on the model complexity, training strategies and deployment scenarios. Finally, MPE when properly set may outperform MMI, however it is prone to higher sensitivity to the set parameters, used models and application domain.","PeriodicalId":249021,"journal":{"name":"2015 International Conference on Systems, Signals and Image Processing (IWSSIP)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-11-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127894250","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Clustering algorithms for face recognition based on client-server architecture 基于客户端-服务器架构的人脸识别聚类算法
Pub Date : 2015-11-02 DOI: 10.1109/IWSSIP.2015.7314221
M. Oravec, Dominik Sopiak, V. Jirka, J. Pavlovičová, Mark Budiak
Mobile devices like smartphones and tablets have become an integral part of our everyday life. These devices often store private information, which needs to be protected. To preserve this data we mainly use passwords, codes or SMS confirmation. They are easy to use, however there is always a risk of forgetting the password and also the risk of an impostor. On the other hand, there are other methods to identify a person, which overcome these threats. Biometric methods use the person itself to verify its identity. Many mobile devices like smartphones or tablets already have an implementation of biometric systems, but their usage often caused problems like shorter battery life, because of their computational complexity. Here a client-server architecture can be used, where the recognition process is divided into computational part running on the server and the acquisitional part running on the mobile device. In this paper a client-server face recognition system is presented with several clustering algorithms like k-means, self-organizing map etc. used for automatic training sample selection. The paper provides a comparative study of these algorithms and their impact on the implemented systems success rate.
像智能手机和平板电脑这样的移动设备已经成为我们日常生活中不可或缺的一部分。这些设备通常存储需要保护的私人信息。为了保存这些数据,我们主要使用密码、代码或短信确认。他们很容易使用,但总是有忘记密码的风险,也有冒名顶替的风险。另一方面,还有其他方法来识别一个人,这些方法克服了这些威胁。生物识别方法使用人本身来验证其身份。许多移动设备,如智能手机或平板电脑已经实现了生物识别系统,但由于它们的计算复杂性,它们的使用往往会导致电池寿命缩短等问题。这里可以使用客户机-服务器架构,其中识别过程分为运行在服务器上的计算部分和运行在移动设备上的获取部分。本文提出了一种基于客户端-服务器的人脸识别系统,采用k-means、自组织映射等聚类算法自动选择训练样本。本文提供了这些算法的比较研究及其对实现系统成功率的影响。
{"title":"Clustering algorithms for face recognition based on client-server architecture","authors":"M. Oravec, Dominik Sopiak, V. Jirka, J. Pavlovičová, Mark Budiak","doi":"10.1109/IWSSIP.2015.7314221","DOIUrl":"https://doi.org/10.1109/IWSSIP.2015.7314221","url":null,"abstract":"Mobile devices like smartphones and tablets have become an integral part of our everyday life. These devices often store private information, which needs to be protected. To preserve this data we mainly use passwords, codes or SMS confirmation. They are easy to use, however there is always a risk of forgetting the password and also the risk of an impostor. On the other hand, there are other methods to identify a person, which overcome these threats. Biometric methods use the person itself to verify its identity. Many mobile devices like smartphones or tablets already have an implementation of biometric systems, but their usage often caused problems like shorter battery life, because of their computational complexity. Here a client-server architecture can be used, where the recognition process is divided into computational part running on the server and the acquisitional part running on the mobile device. In this paper a client-server face recognition system is presented with several clustering algorithms like k-means, self-organizing map etc. used for automatic training sample selection. The paper provides a comparative study of these algorithms and their impact on the implemented systems success rate.","PeriodicalId":249021,"journal":{"name":"2015 International Conference on Systems, Signals and Image Processing (IWSSIP)","volume":"111 10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-11-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122633968","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Long term availability analysis of experimental free space optics system 实验自由空间光学系统的长期可用性分析
Pub Date : 2015-11-02 DOI: 10.1109/IWSSIP.2015.7313929
J. Toth, M. Tatarko, Ľ. Ovseník, J. Turán
This paper deals with Free Space Optics (FSO) systems. Availability and reliability of FSO is taken under the scope. FSO links transmit data within the infrared wavelength region. Lasers are used to create medium to carry informational stream. FSO systems need Line of Sight (LOS) technology in order to maintain connection between two points. Weather conditions have quite significant impact on FSO operation in terms of availability and reliability because of free space transmission. It is necessary to evaluate the air quality at the actual geographical location where FSO link is located. It is important to determine the impact of a light scattering, absorption, turbulence and receiving power at the particular FSO link. Visibility has one of the most critical impacts on the quality of an optical transmission channel. Moreover, it is very helpful to monitor and store the information about rain, snow and fog values. This paper introduces a device which measures all mentioned weather indicators such as a fog density, a relative humidity and the temperature. FSO availability and reliability estimation is made from measured data. These results evaluate weather conditions for Kosice, Slovakia in term of FSO operation.
本文研究自由空间光学系统。对无线通信系统的可用性和可靠性进行了研究。FSO链路在红外波长范围内传输数据。激光被用来制造传输信息流的介质。FSO系统需要视距(LOS)技术来保持两点之间的连接。由于自由空间传输,天气条件对无线通信系统的可用性和可靠性有很大的影响。有必要评估固网连接所处实际地理位置的空气质素。确定光散射、吸收、湍流和接收功率对特定FSO链路的影响是很重要的。可见性是影响光传输信道质量的最关键因素之一。此外,对雨、雪、雾值信息的监测和存储也很有帮助。本文介绍了一种测量上述气象指标如雾密度、相对湿度和温度的装置。利用实测数据对FSO的可用性和可靠性进行了估计。这些结果评估了斯洛伐克科希策的天气条件,就FSO操作而言。
{"title":"Long term availability analysis of experimental free space optics system","authors":"J. Toth, M. Tatarko, Ľ. Ovseník, J. Turán","doi":"10.1109/IWSSIP.2015.7313929","DOIUrl":"https://doi.org/10.1109/IWSSIP.2015.7313929","url":null,"abstract":"This paper deals with Free Space Optics (FSO) systems. Availability and reliability of FSO is taken under the scope. FSO links transmit data within the infrared wavelength region. Lasers are used to create medium to carry informational stream. FSO systems need Line of Sight (LOS) technology in order to maintain connection between two points. Weather conditions have quite significant impact on FSO operation in terms of availability and reliability because of free space transmission. It is necessary to evaluate the air quality at the actual geographical location where FSO link is located. It is important to determine the impact of a light scattering, absorption, turbulence and receiving power at the particular FSO link. Visibility has one of the most critical impacts on the quality of an optical transmission channel. Moreover, it is very helpful to monitor and store the information about rain, snow and fog values. This paper introduces a device which measures all mentioned weather indicators such as a fog density, a relative humidity and the temperature. FSO availability and reliability estimation is made from measured data. These results evaluate weather conditions for Kosice, Slovakia in term of FSO operation.","PeriodicalId":249021,"journal":{"name":"2015 International Conference on Systems, Signals and Image Processing (IWSSIP)","volume":"66 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-11-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131927149","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
期刊
2015 International Conference on Systems, Signals and Image Processing (IWSSIP)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1