首页 > 最新文献

2014 XIX Symposium on Image, Signal Processing and Artificial Vision最新文献

英文 中文
Identification and cadastral registration of water bodies through multispectral image processing with multi-layer Perceptron Neural Network 基于多层感知器神经网络的多光谱图像识别与水体地籍登记
Pub Date : 2014-09-01 DOI: 10.1109/STSIVA.2014.7010132
E. Dianderas, K. Rojas, G. Kemper
In this article is developed a technique that allows to calculate the presence of vegetation, glaciers and water bodies through multispectral image processing employing a Multi-layer Perceptron Neural Netwok, giving the option to discriminate the presence of lakes to generate the cadastral registration of these. The supervised classification that was implemented has a high level of robustness and reliability, since the validation of the data obtained at a geolocation level have a 0% of error and the parameters of the area and perimeter an approximate error of 10%.
本文开发了一种技术,该技术可以通过多层感知器神经网络的多光谱图像处理来计算植被、冰川和水体的存在,并提供了区分湖泊存在的选项,从而生成这些水体的地籍登记。所实现的监督分类具有很高的鲁棒性和可靠性,因为在地理位置级别上获得的数据的验证误差为0%,面积和周长参数的误差约为10%。
{"title":"Identification and cadastral registration of water bodies through multispectral image processing with multi-layer Perceptron Neural Network","authors":"E. Dianderas, K. Rojas, G. Kemper","doi":"10.1109/STSIVA.2014.7010132","DOIUrl":"https://doi.org/10.1109/STSIVA.2014.7010132","url":null,"abstract":"In this article is developed a technique that allows to calculate the presence of vegetation, glaciers and water bodies through multispectral image processing employing a Multi-layer Perceptron Neural Netwok, giving the option to discriminate the presence of lakes to generate the cadastral registration of these. The supervised classification that was implemented has a high level of robustness and reliability, since the validation of the data obtained at a geolocation level have a 0% of error and the parameters of the area and perimeter an approximate error of 10%.","PeriodicalId":114554,"journal":{"name":"2014 XIX Symposium on Image, Signal Processing and Artificial Vision","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121109817","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Evaluation and comparison using activity signals of speech methods in river plate spanish using BEPPA corpus 基于BEPPA语料库的河床西班牙语语音方法的活动信号评价与比较
Pub Date : 2014-09-01 DOI: 10.1109/STSIVA.2014.7010159
Horderlin Vrangel Robles, V. Molina, Luis Martinez, Hermann Dávila
The results obtained after comparing several algorithms which use basic methods of signal processing for speech activity detection of voice or VAD (Voice Activity Detection-VAD), were assessed in order to determine their effectiveness. The algorithms presented in this article are short-time or spectral energy based endpoint detection algorithm, the zero crossing rate method, and the higher order differential (High Order Difference, HOD) method. First, an introduction of the concept of VAD is presented and the need to apply such language algorithms in River Plate is Spanish. Then a summary of the state of the art techniques and algorithms for detecting voice activity is shown with evidence and experiments used to implement algorithms with BEPPA corpus (Evaluation Battery for Patients with Auditive Prostheses, BEPPA - in Spanish).
在比较了几种使用基本信号处理方法进行语音活动检测或VAD (voice activity detection -VAD)的算法后,评估了它们的有效性。本文提出的算法有基于短时或谱能的端点检测算法、过零率法和高阶差分法。首先,介绍了VAD的概念,并指出在西班牙语中需要应用这种语言算法。然后总结了用于检测语音活动的最新技术和算法,并展示了使用BEPPA语料库(听力假肢患者评估电池,BEPPA -西班牙语)实现算法的证据和实验。
{"title":"Evaluation and comparison using activity signals of speech methods in river plate spanish using BEPPA corpus","authors":"Horderlin Vrangel Robles, V. Molina, Luis Martinez, Hermann Dávila","doi":"10.1109/STSIVA.2014.7010159","DOIUrl":"https://doi.org/10.1109/STSIVA.2014.7010159","url":null,"abstract":"The results obtained after comparing several algorithms which use basic methods of signal processing for speech activity detection of voice or VAD (Voice Activity Detection-VAD), were assessed in order to determine their effectiveness. The algorithms presented in this article are short-time or spectral energy based endpoint detection algorithm, the zero crossing rate method, and the higher order differential (High Order Difference, HOD) method. First, an introduction of the concept of VAD is presented and the need to apply such language algorithms in River Plate is Spanish. Then a summary of the state of the art techniques and algorithms for detecting voice activity is shown with evidence and experiments used to implement algorithms with BEPPA corpus (Evaluation Battery for Patients with Auditive Prostheses, BEPPA - in Spanish).","PeriodicalId":114554,"journal":{"name":"2014 XIX Symposium on Image, Signal Processing and Artificial Vision","volume":"46 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127471517","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Pattern recognition of hypernasality in voice of patients with Cleft and Lip Palate 唇腭裂患者鼻音异常的模式识别
Pub Date : 2014-09-01 DOI: 10.1109/STSIVA.2014.7010187
Roger Gomez Nieto, J. I. Marin-Hurtado, Luis Miguel Capacho-Valbuena, Alexander Amaya Suarez, Elkyn Alexander Belalcazar Bolanos
The Cleft and Lip Palate (CLP) is a malformation with high recurrence in Colombia, which affects the ability of the phonation system, making difficult the effective communication of the patient. This research seeks to find patterns that enable to detect hypernasality without using invasive diagnostic methods. We performed an analysis of a large range of acoustic features to identify those capable of discriminating hypernasality. The analyzed features include: Teager energy operator (TEO), linear predictive coding (LPC), Mel Frequency Cepstral Coefficients (MFCC), Pitch, Jitter, Shimmer, and the first three formants together with the bandwidth of the first formant. With the correct configuration is achieved discriminant patterns classify 99 percent of patients hypernasal of the database with a false positive rate of less than 1 percent of healthy patients, which are promising results as a starting point for creating a tool for automatic noninvasive detection of hypernasality.
唇腭裂(Cleft and Lip Palate, CLP)是哥伦比亚一种复发率很高的畸形,它影响了发声系统的能力,使患者难以进行有效的沟通。本研究旨在寻找不使用侵入性诊断方法即可检测鼻音亢进的模式。我们进行了一个大范围的声学特征的分析,以确定那些能够区分高鼻音。分析的特征包括:Teager能量算子(TEO)、线性预测编码(LPC)、Mel频率倒谱系数(MFCC)、基音、抖动、闪烁、前三个共振峰以及第一共振峰的带宽。通过正确的配置,鉴别模式将数据库中99%的高鼻窦炎患者与不到1%的健康患者进行了分类,这是一个有希望的结果,可以作为创建自动无创检测高鼻窦炎工具的起点。
{"title":"Pattern recognition of hypernasality in voice of patients with Cleft and Lip Palate","authors":"Roger Gomez Nieto, J. I. Marin-Hurtado, Luis Miguel Capacho-Valbuena, Alexander Amaya Suarez, Elkyn Alexander Belalcazar Bolanos","doi":"10.1109/STSIVA.2014.7010187","DOIUrl":"https://doi.org/10.1109/STSIVA.2014.7010187","url":null,"abstract":"The Cleft and Lip Palate (CLP) is a malformation with high recurrence in Colombia, which affects the ability of the phonation system, making difficult the effective communication of the patient. This research seeks to find patterns that enable to detect hypernasality without using invasive diagnostic methods. We performed an analysis of a large range of acoustic features to identify those capable of discriminating hypernasality. The analyzed features include: Teager energy operator (TEO), linear predictive coding (LPC), Mel Frequency Cepstral Coefficients (MFCC), Pitch, Jitter, Shimmer, and the first three formants together with the bandwidth of the first formant. With the correct configuration is achieved discriminant patterns classify 99 percent of patients hypernasal of the database with a false positive rate of less than 1 percent of healthy patients, which are promising results as a starting point for creating a tool for automatic noninvasive detection of hypernasality.","PeriodicalId":114554,"journal":{"name":"2014 XIX Symposium on Image, Signal Processing and Artificial Vision","volume":"35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116530222","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Facial expression analysis for emotion recognition using kernel methods and statistical models 基于核方法和统计模型的面部表情分析及其情感识别
Pub Date : 2014-09-01 DOI: 10.1109/STSIVA.2014.7010188
Hernán García, C. A. Torres, Jorge Ivan Marin Hurtado
In this paper we present our framework for facial expression analysis using static models and kernel methods for classification. We describe the characterization methodology from parametric model. Also quantitatively evaluated the accuracy for feature detection and estimation of the parameters associated with facial expressions, analyzing its robustness to variations in pose. Then, a methodology of emotion characterization is introduced to perform the recognition. Furthermore, a cascade classifiers using kernel methods it is performed for emotion recognition. The experimental results show that the proposed model can effectively detect the different facial expressions. The model used and characterization methodology showed efficient to detect the emotion type in 93.4% of the cases.
在本文中,我们提出了一个使用静态模型和核方法进行分类的面部表情分析框架。我们从参数模型描述了表征方法。还定量评估了特征检测和面部表情相关参数估计的准确性,分析了其对姿态变化的鲁棒性。然后,引入了一种情感表征方法来进行识别。在此基础上,提出了一种基于核方法的级联分类器用于情感识别。实验结果表明,该模型能够有效地检测不同的面部表情。所使用的模型和表征方法有效地检测了93.4%的病例的情绪类型。
{"title":"Facial expression analysis for emotion recognition using kernel methods and statistical models","authors":"Hernán García, C. A. Torres, Jorge Ivan Marin Hurtado","doi":"10.1109/STSIVA.2014.7010188","DOIUrl":"https://doi.org/10.1109/STSIVA.2014.7010188","url":null,"abstract":"In this paper we present our framework for facial expression analysis using static models and kernel methods for classification. We describe the characterization methodology from parametric model. Also quantitatively evaluated the accuracy for feature detection and estimation of the parameters associated with facial expressions, analyzing its robustness to variations in pose. Then, a methodology of emotion characterization is introduced to perform the recognition. Furthermore, a cascade classifiers using kernel methods it is performed for emotion recognition. The experimental results show that the proposed model can effectively detect the different facial expressions. The model used and characterization methodology showed efficient to detect the emotion type in 93.4% of the cases.","PeriodicalId":114554,"journal":{"name":"2014 XIX Symposium on Image, Signal Processing and Artificial Vision","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127751166","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dual core architecture on FPGA as an integrated solution in the development of Telemedicine-capable devices 基于FPGA的双核架构作为开发远程医疗设备的集成解决方案
Pub Date : 2014-09-01 DOI: 10.1109/STSIVA.2014.7010128
Jose P. Pinilla, S. Plata, Oscar Alberto Mantilla, L. A. Rodriguez
Medical devices response is highly sensitive because of the dependence of human lives on them, thus common problems in electronics like synchronization and delays are often targeted to be reduced to its bare minimum. The use of FPGA ICs as co-processing slave modules for CPU chips in embedded systems is a common approach to solve these issues, not only among commercial medical equipment, but also in different industry sectors. This paper takes into account Medical Devices such as (Vital Signs Monitors) VSMs using this topology. Furthermore, a project at (Fundación Cardiovascular de Colombia) FCV to upgrade the PC-based systems used for Telemedicine services in Colombia, takes FPGAs as an integrated solution for an embedded system, achieving the replacement of the CPU and additional integrated circuits, with softcore architectures, IP cores, and hardware modules description. Moreover, critical applications such as medical devices need characteristics like real-time response and high accuracy in order to make health care personnel able to respond appropriately in case of an event. A working prototype, based on a Dual-Core architecture using the Nios II softcore processor running uC/OSII along with Video IP cores and custom hardware description for a 22kLE FPGA has been designed and implemented to create a VSM embedded system, with similar characteristics and behaviour to a commercial device, plus the advantages of higher integration, lower power consumption, WLAN, and WWAN connectivity.
由于人类生命对医疗设备的依赖,医疗设备的响应是高度敏感的,因此电子设备中常见的问题,如同步和延迟,往往被目标减少到最低限度。在嵌入式系统中,使用FPGA作为CPU芯片的协同处理从模块是解决这些问题的常用方法,不仅适用于商业医疗设备,也适用于不同的工业部门。本文考虑了使用这种拓扑的医疗设备,如(生命体征监视器)vsm。此外,(Fundación Cardiovascular de Colombia) FCV的一个项目旨在升级哥伦比亚用于远程医疗服务的基于pc的系统,该项目将fpga作为嵌入式系统的集成解决方案,通过软核架构、IP核和硬件模块描述实现了CPU和额外集成电路的替换。此外,医疗设备等关键应用需要实时响应和高精度等特性,以便医疗保健人员能够在发生事件时做出适当的响应。一个基于双核架构的工作原型,使用运行uC/OSII的Nios II软核处理器以及视频IP核和22kLE FPGA的定制硬件描述,设计并实现了一个VSM嵌入式系统,具有与商用设备相似的特性和行为,以及更高集成,更低功耗,WLAN和WWAN连接的优势。
{"title":"Dual core architecture on FPGA as an integrated solution in the development of Telemedicine-capable devices","authors":"Jose P. Pinilla, S. Plata, Oscar Alberto Mantilla, L. A. Rodriguez","doi":"10.1109/STSIVA.2014.7010128","DOIUrl":"https://doi.org/10.1109/STSIVA.2014.7010128","url":null,"abstract":"Medical devices response is highly sensitive because of the dependence of human lives on them, thus common problems in electronics like synchronization and delays are often targeted to be reduced to its bare minimum. The use of FPGA ICs as co-processing slave modules for CPU chips in embedded systems is a common approach to solve these issues, not only among commercial medical equipment, but also in different industry sectors. This paper takes into account Medical Devices such as (Vital Signs Monitors) VSMs using this topology. Furthermore, a project at (Fundación Cardiovascular de Colombia) FCV to upgrade the PC-based systems used for Telemedicine services in Colombia, takes FPGAs as an integrated solution for an embedded system, achieving the replacement of the CPU and additional integrated circuits, with softcore architectures, IP cores, and hardware modules description. Moreover, critical applications such as medical devices need characteristics like real-time response and high accuracy in order to make health care personnel able to respond appropriately in case of an event. A working prototype, based on a Dual-Core architecture using the Nios II softcore processor running uC/OSII along with Video IP cores and custom hardware description for a 22kLE FPGA has been designed and implemented to create a VSM embedded system, with similar characteristics and behaviour to a commercial device, plus the advantages of higher integration, lower power consumption, WLAN, and WWAN connectivity.","PeriodicalId":114554,"journal":{"name":"2014 XIX Symposium on Image, Signal Processing and Artificial Vision","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127883879","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
OpenRISC-based System-on-Chip for digital signal processing 基于openrisc的单片系统用于数字信号处理
Pub Date : 2014-09-01 DOI: 10.1109/STSIVA.2014.7010123
A. López-Parrado, Juan-Camilo Valderrama-Cuervo
This paper presents the design and implementation of an OpenRISC-based System-on-Chip (SoC), which is composed of hardware cores implementing the Digital Signal Processing (DSP) functions: Finite Impulse Response (FIR) filter, Infinite Impulse Response (IIR) filter and Fast Fourier Transform (FFT). The FIR-filter core is based on the transpose realization form, the IIR-filter core is based on the Second Order Sections (SOS) architecture and the FFT core is based on the Radix 22 Single Delay Feedback (R22SDF) architecture. The three cores are compatible with the Wishbone SoC bus, and they were described using generic and structural VHDL. In-system hardware verification was performed by using an OpenRisc-based SoC synthesized on an Altera FPGA. Tests showed that the designed DSP cores are suitable for building SoC based on the OpenRisc processor and the Wishbone bus.
本文介绍了基于openrisc的片上系统(SoC)的设计和实现,该系统由实现数字信号处理(DSP)功能的硬件核心组成:有限脉冲响应(FIR)滤波器,无限脉冲响应(IIR)滤波器和快速傅里叶变换(FFT)。fir -滤波器核心基于转置实现形式,iir -滤波器核心基于二阶分段(SOS)架构,FFT核心基于Radix 22单延迟反馈(R22SDF)架构。这三个核心与Wishbone SoC总线兼容,并使用通用和结构VHDL进行描述。系统内硬件验证是通过在Altera FPGA上合成的基于openrisc的SoC进行的。测试表明,所设计的DSP内核适合于基于OpenRisc处理器和Wishbone总线构建SoC。
{"title":"OpenRISC-based System-on-Chip for digital signal processing","authors":"A. López-Parrado, Juan-Camilo Valderrama-Cuervo","doi":"10.1109/STSIVA.2014.7010123","DOIUrl":"https://doi.org/10.1109/STSIVA.2014.7010123","url":null,"abstract":"This paper presents the design and implementation of an OpenRISC-based System-on-Chip (SoC), which is composed of hardware cores implementing the Digital Signal Processing (DSP) functions: Finite Impulse Response (FIR) filter, Infinite Impulse Response (IIR) filter and Fast Fourier Transform (FFT). The FIR-filter core is based on the transpose realization form, the IIR-filter core is based on the Second Order Sections (SOS) architecture and the FFT core is based on the Radix 22 Single Delay Feedback (R22SDF) architecture. The three cores are compatible with the Wishbone SoC bus, and they were described using generic and structural VHDL. In-system hardware verification was performed by using an OpenRisc-based SoC synthesized on an Altera FPGA. Tests showed that the designed DSP cores are suitable for building SoC based on the OpenRisc processor and the Wishbone bus.","PeriodicalId":114554,"journal":{"name":"2014 XIX Symposium on Image, Signal Processing and Artificial Vision","volume":"69 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126208218","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
An image retrieval system for tomato disease assessment 番茄病害评估图像检索系统
Pub Date : 2014-09-01 DOI: 10.1109/STSIVA.2014.7010156
Douglas Baquero, Juan F. Molina, R. Gil, C. Bojacá, Hugo Franco, Francisco Gomez
Tomato represents an important vegetable crop worldwide. During cropping cycle several diseases and abnormal conditions may affect tomato plants resulting on considerable losses of production. A precise identification of these pathologies in early phases is fundamental for the implementation of control strategies. Nevertheless, the right identification of symptoms of plants diseases require highly specialized knowledge and facilities, which are not available for small growers. Recently, computer vision tools have been proposed as an alternative for tomato diseases characterization. These works mainly focus on identification of affected regions and classification tasks. Nevertheless, non-specialists may lack of clarity about what they are looking for during the assessment. In these cases, Content Based Image Retrieval (CBIR) systems can be helpful as a complementary strategy to improve the quality of the search by allowing exploration of databases with supplementary information. This work presents a novel strategy for image retrieval of tomato leaves for greenhouse crops suitable to support disease diagnosis. The strategy is based on color structure descriptors and nearest neighbors. Experimental results show that the proposed approach can successfully characterize in several abnormal conditions, such as, chlorosis, sooty moulds and early blight.
番茄是世界上重要的蔬菜作物。在种植周期中,一些病害和异常条件会影响番茄植株,造成相当大的生产损失。在早期阶段准确识别这些病理是实施控制策略的基础。然而,正确识别植物病害的症状需要高度专业化的知识和设施,这是小型种植者所不具备的。最近,计算机视觉工具被提出作为番茄疾病表征的替代方法。这些工作主要集中在识别受影响区域和分类任务上。然而,非专业人士可能不清楚他们在评估过程中寻找的是什么。在这些情况下,基于内容的图像检索(CBIR)系统可以作为一种补充策略,通过允许使用补充信息探索数据库来提高搜索质量。本文提出了一种适合于支持病害诊断的温室作物番茄叶片图像检索的新策略。该策略基于颜色结构描述符和最近邻。实验结果表明,该方法可以成功地对黄萎病、烟霉病和早疫病等多种异常情况进行表征。
{"title":"An image retrieval system for tomato disease assessment","authors":"Douglas Baquero, Juan F. Molina, R. Gil, C. Bojacá, Hugo Franco, Francisco Gomez","doi":"10.1109/STSIVA.2014.7010156","DOIUrl":"https://doi.org/10.1109/STSIVA.2014.7010156","url":null,"abstract":"Tomato represents an important vegetable crop worldwide. During cropping cycle several diseases and abnormal conditions may affect tomato plants resulting on considerable losses of production. A precise identification of these pathologies in early phases is fundamental for the implementation of control strategies. Nevertheless, the right identification of symptoms of plants diseases require highly specialized knowledge and facilities, which are not available for small growers. Recently, computer vision tools have been proposed as an alternative for tomato diseases characterization. These works mainly focus on identification of affected regions and classification tasks. Nevertheless, non-specialists may lack of clarity about what they are looking for during the assessment. In these cases, Content Based Image Retrieval (CBIR) systems can be helpful as a complementary strategy to improve the quality of the search by allowing exploration of databases with supplementary information. This work presents a novel strategy for image retrieval of tomato leaves for greenhouse crops suitable to support disease diagnosis. The strategy is based on color structure descriptors and nearest neighbors. Experimental results show that the proposed approach can successfully characterize in several abnormal conditions, such as, chlorosis, sooty moulds and early blight.","PeriodicalId":114554,"journal":{"name":"2014 XIX Symposium on Image, Signal Processing and Artificial Vision","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121961192","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Comparative analysis of physiological signals and electroencephalogram (EEG) for multimodal emotion recognition using generative models 基于生成模型的多模态情绪识别生理信号与脑电图的对比分析
Pub Date : 2014-09-01 DOI: 10.1109/STSIVA.2014.7010181
Cristian A. Torres-Valencia, Hernan F. Garcia-Arias, Mauricio A. Alvarez Lopez, A. Orozco-Gutierrez
Multimodal Emotion recognition (MER) is an application of machine learning were different biological signals are used in order to automatically classify a determined affective state. MER systems has been developed for different type of applications from psychological evaluation, anxiety assessment, human-machine interfaces and marketing. There are several spaces of classification proposed in the state of art for the emotion recognition task, the most known are discrete and dimensional spaces were the emotions are described in terms of some basic emotions and latent dimensions respectively. The use of dimensional spaces of classification allows a higher range of emotional states to be analyzed. The most common dimensional space used for this purpose is the Arousal/Valence space were emotions are described in terms of the intensity of the emotion that goes from inactive to active in the arousal dimension, and from unpleasant to pleasant in the valence dimension. The use of physiological signals and the EEG is well suited for emotion recognition due to the fact that an emotional states generates responses from different biological systems of the human body. Since the expression of an emotion is a dynamic process, we propose the use of generative models as Hidden Markov Models (HMM) to capture de dynamics of the signals for further classification of emotional states in terms of arousal and valence. For the development of this work an international database for emotion classification known as Dataset for Emotion Analysis using Physiological signals (DEAP) is used. The objective of this work is to determine which of the physiological and EEG signals brings more relevant information in the emotion recognition task, several experiments using HMMs from different signals and combinations of them are performed, and the results shows that some of those signals brings more discrimination between arousal and valence levels as the EEG and the Galvanic Skin Response (GSR) and the Heart rate (HR).
多模态情感识别(MER)是机器学习的一种应用,它使用不同的生物信号来自动分类确定的情感状态。市场营销系统已被开发用于心理评估、焦虑评估、人机界面和市场营销等不同类型的应用。目前提出了几种用于情绪识别任务的分类空间,其中最著名的是离散空间和维度空间,分别用一些基本情绪和潜在维度来描述情绪。使用维度空间的分类允许分析更高范围的情绪状态。用于此目的的最常见的维度空间是唤醒/效价空间,在这个空间中,情绪是根据情绪的强度来描述的,在唤醒维度中,情绪从不活跃到活跃,在效价维度中,情绪从不愉快到愉快。由于情绪状态会从人体的不同生物系统中产生反应,因此生理信号和脑电图的使用非常适合于情绪识别。由于情绪的表达是一个动态过程,我们建议使用生成模型作为隐马尔可夫模型(HMM)来捕获信号的非动态,以便根据唤醒和价态进一步分类情绪状态。为了开展这项工作,使用了一个国际情绪分类数据库,即使用生理信号进行情绪分析数据集(DEAP)。本研究的目的是确定生理和脑电图信号中哪一个在情绪识别任务中带来更多的相关信息,使用不同信号及其组合的hmm进行了几个实验,结果表明,一些信号如脑电图和皮肤电反应(GSR)和心率(HR)在唤醒和价态水平之间带来了更多的区别。
{"title":"Comparative analysis of physiological signals and electroencephalogram (EEG) for multimodal emotion recognition using generative models","authors":"Cristian A. Torres-Valencia, Hernan F. Garcia-Arias, Mauricio A. Alvarez Lopez, A. Orozco-Gutierrez","doi":"10.1109/STSIVA.2014.7010181","DOIUrl":"https://doi.org/10.1109/STSIVA.2014.7010181","url":null,"abstract":"Multimodal Emotion recognition (MER) is an application of machine learning were different biological signals are used in order to automatically classify a determined affective state. MER systems has been developed for different type of applications from psychological evaluation, anxiety assessment, human-machine interfaces and marketing. There are several spaces of classification proposed in the state of art for the emotion recognition task, the most known are discrete and dimensional spaces were the emotions are described in terms of some basic emotions and latent dimensions respectively. The use of dimensional spaces of classification allows a higher range of emotional states to be analyzed. The most common dimensional space used for this purpose is the Arousal/Valence space were emotions are described in terms of the intensity of the emotion that goes from inactive to active in the arousal dimension, and from unpleasant to pleasant in the valence dimension. The use of physiological signals and the EEG is well suited for emotion recognition due to the fact that an emotional states generates responses from different biological systems of the human body. Since the expression of an emotion is a dynamic process, we propose the use of generative models as Hidden Markov Models (HMM) to capture de dynamics of the signals for further classification of emotional states in terms of arousal and valence. For the development of this work an international database for emotion classification known as Dataset for Emotion Analysis using Physiological signals (DEAP) is used. The objective of this work is to determine which of the physiological and EEG signals brings more relevant information in the emotion recognition task, several experiments using HMMs from different signals and combinations of them are performed, and the results shows that some of those signals brings more discrimination between arousal and valence levels as the EEG and the Galvanic Skin Response (GSR) and the Heart rate (HR).","PeriodicalId":114554,"journal":{"name":"2014 XIX Symposium on Image, Signal Processing and Artificial Vision","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124663926","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 48
Automatic detection of early blight infection on tomato crops using a color based classification strategy 基于颜色分类策略的番茄早疫病自动检测
Pub Date : 2014-09-01 DOI: 10.1109/STSIVA.2014.7010166
Juan F. Molina, R. Gil, C. Bojacá, Francisco Gomez, Hugo Franco
This work presents a Computer Vision prototype strategy for the automatic detection of mycotic infections on tomato crops. This Computer Vision method is based on the characterization of tomato leaflets (both healthy and early blight-infected regions of interest - ROIs) by color description (MPEG-7 standard descriptors). A small size ROI collection manually annotated by experts is used for both training and testing of a simple classifier (1-NN). The performance of each descriptor under study (Color Structure Descriptor, CSD; Color Layout descriptor, CLD; and Scalable Color Descriptor, SCD) is analysed by a nested-leave-one-out cross validation. The inner loop permits a individual descriptor configuration evaluation, while the outer loop yields an average performance comparison between different descriptors. Our results show that CSD had a better performance than SCD and CLD.
本文提出了一种用于番茄真菌感染自动检测的计算机视觉原型策略。这种计算机视觉方法是基于通过颜色描述(MPEG-7标准描述符)对番茄小叶(健康和早期枯萎病感染的感兴趣区域- roi)进行表征。由专家手动标注的小型ROI集合用于简单分类器(1-NN)的训练和测试。所研究的每个描述子的性能(颜色结构描述子,CSD;颜色布局描述符,CLD;和可伸缩颜色描述符(SCD)通过嵌套留一交叉验证进行分析。内部循环允许单独的描述符配置评估,而外部循环产生不同描述符之间的平均性能比较。结果表明,CSD比SCD和CLD具有更好的性能。
{"title":"Automatic detection of early blight infection on tomato crops using a color based classification strategy","authors":"Juan F. Molina, R. Gil, C. Bojacá, Francisco Gomez, Hugo Franco","doi":"10.1109/STSIVA.2014.7010166","DOIUrl":"https://doi.org/10.1109/STSIVA.2014.7010166","url":null,"abstract":"This work presents a Computer Vision prototype strategy for the automatic detection of mycotic infections on tomato crops. This Computer Vision method is based on the characterization of tomato leaflets (both healthy and early blight-infected regions of interest - ROIs) by color description (MPEG-7 standard descriptors). A small size ROI collection manually annotated by experts is used for both training and testing of a simple classifier (1-NN). The performance of each descriptor under study (Color Structure Descriptor, CSD; Color Layout descriptor, CLD; and Scalable Color Descriptor, SCD) is analysed by a nested-leave-one-out cross validation. The inner loop permits a individual descriptor configuration evaluation, while the outer loop yields an average performance comparison between different descriptors. Our results show that CSD had a better performance than SCD and CLD.","PeriodicalId":114554,"journal":{"name":"2014 XIX Symposium on Image, Signal Processing and Artificial Vision","volume":"2015 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132467048","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 18
A computational architecture for discrete wavelet transform using lifting scheme 基于提升格式的离散小波变换计算体系
Pub Date : 2014-09-01 DOI: 10.1109/STSIVA.2014.7010160
Fabian Sanchez, Carlos A. Fajardo, Carlos A. Angulo, Oscar M. Reyes, C. Bouman
The Discrete Wavelet Transform (DWT) is an important technique for signal analysis, compressing and denoising due to its excellent locality in the time-frequency domain. The DWT is developed by convolutions which demand both a large number of mathematical operations and a large amount of storage. The lifting scheme reduces both computational and storage requirements. We have developed a computational architecture for inverse DWT using the lifting scheme. The design was developed in VHDL and then implemented into a Virtex 5 FPGA. We aim to reach a high throughput and reduce the design area. The architecture takes 3L + N(1-0.5L) clock cycles to compute L levels of 1D reconstruction for data of size N. Some comparisons suggest that our work could be faster than previous works.
离散小波变换(DWT)由于其在时频域中优异的局域性,是一种重要的信号分析、压缩和去噪技术。DWT是由需要大量数学运算和大量存储空间的卷积发展而来的。提升方案减少了计算和存储需求。我们已经开发了一个使用提升方案的反DWT计算架构。该设计是用VHDL语言开发的,然后在Virtex 5 FPGA上实现。我们的目标是达到高吞吐量和减少设计面积。对于大小为N的数据,该架构需要3L + N(1-0.5L)个时钟周期来计算L个级别的1D重建。一些比较表明,我们的工作可能比以前的工作更快。
{"title":"A computational architecture for discrete wavelet transform using lifting scheme","authors":"Fabian Sanchez, Carlos A. Fajardo, Carlos A. Angulo, Oscar M. Reyes, C. Bouman","doi":"10.1109/STSIVA.2014.7010160","DOIUrl":"https://doi.org/10.1109/STSIVA.2014.7010160","url":null,"abstract":"The Discrete Wavelet Transform (DWT) is an important technique for signal analysis, compressing and denoising due to its excellent locality in the time-frequency domain. The DWT is developed by convolutions which demand both a large number of mathematical operations and a large amount of storage. The lifting scheme reduces both computational and storage requirements. We have developed a computational architecture for inverse DWT using the lifting scheme. The design was developed in VHDL and then implemented into a Virtex 5 FPGA. We aim to reach a high throughput and reduce the design area. The architecture takes 3L + N(1-0.5L) clock cycles to compute L levels of 1D reconstruction for data of size N. Some comparisons suggest that our work could be faster than previous works.","PeriodicalId":114554,"journal":{"name":"2014 XIX Symposium on Image, Signal Processing and Artificial Vision","volume":"140 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130818947","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2014 XIX Symposium on Image, Signal Processing and Artificial Vision
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1