首页 > 最新文献

2014 International Conference on Orange Technologies最新文献

英文 中文
Applying PAD three dimensional emotion model to convert prosody of emotional speech 应用PAD三维情感模型转换情感言语的韵律
Pub Date : 2014-11-20 DOI: 10.1109/ICOT.2014.6956606
Xiaoyong Lu, Hongwu Yang, Aibao Zhou
Happiness has attracted much attention of the researchers in various fields. This paper realizes prosodic conversion of emotional speech for happiness computing on speech communication. An emotional speech corpus includes 11 kinds of typical emotional utterances is designed, where each utterance is labeled the emotional information with PAD value in a psychological sense. A five-scale tone model is employed to model the pitch contour of emotional utterances on the syllable level. A generalized regression neural network (GRNN) based prosody conversion model is built to realize the transformation of pitch contour, duration and pause duration of emotional utterance, in which the PAD values of emotion and context parameter are adopted to predict the prosodic features. Emotional utterance is then re-synthesized with the STRAIGHT algorithm by modifying pitch contour, duration and pause duration. Experimental results on Emotional Mean Opining Score (EMOS) demonstrate that the prosody conversion effect of proposed method can express corresponding feelings.
幸福已经引起了各个领域研究者的广泛关注。本文实现了情感语音的韵律转换,用于语音交际中的幸福感计算。设计了一个包含11种典型情感话语的情感语料库,并将每个话语标记为具有心理PAD值的情感信息。在音节水平上,采用五音阶声调模型对情感话语的音高轮廓进行建模。建立了一种基于广义回归神经网络(GRNN)的韵律转换模型,实现情绪话语的音高轮廓、持续时间和停顿时间的转换,其中采用情绪参数和语境参数的PAD值来预测韵律特征。然后通过修改音高轮廓、持续时间和停顿时间,用STRAIGHT算法重新合成情绪话语。情感平均意见评分(EMOS)实验结果表明,所提方法的韵律转换效果能够表达相应的情感。
{"title":"Applying PAD three dimensional emotion model to convert prosody of emotional speech","authors":"Xiaoyong Lu, Hongwu Yang, Aibao Zhou","doi":"10.1109/ICOT.2014.6956606","DOIUrl":"https://doi.org/10.1109/ICOT.2014.6956606","url":null,"abstract":"Happiness has attracted much attention of the researchers in various fields. This paper realizes prosodic conversion of emotional speech for happiness computing on speech communication. An emotional speech corpus includes 11 kinds of typical emotional utterances is designed, where each utterance is labeled the emotional information with PAD value in a psychological sense. A five-scale tone model is employed to model the pitch contour of emotional utterances on the syllable level. A generalized regression neural network (GRNN) based prosody conversion model is built to realize the transformation of pitch contour, duration and pause duration of emotional utterance, in which the PAD values of emotion and context parameter are adopted to predict the prosodic features. Emotional utterance is then re-synthesized with the STRAIGHT algorithm by modifying pitch contour, duration and pause duration. Experimental results on Emotional Mean Opining Score (EMOS) demonstrate that the prosody conversion effect of proposed method can express corresponding feelings.","PeriodicalId":343641,"journal":{"name":"2014 International Conference on Orange Technologies","volume":"313 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122967847","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Optimized radix-2 FFT and Mel-filter bank in MFCC-based events sound recognition chip design for active smart warming care 基于mfcc的事件声音识别芯片中基于基数-2 FFT和mel滤波器组的优化设计
Pub Date : 2014-11-20 DOI: 10.1109/ICOT.2014.6956633
Ta-Wen Kuan, Jhing-Fa Wang, Tsai Shang-Hung
The paper proposes a first sound chip design for security-sensitive event sounds recognition that extended the interaction of Orange warming care from human-to-human to environment-to-human perception. The proposed chip is fittingly embedded in smart sensors or appliances at home to surroundingly detect the event sounds, which can timely care the elderly or children who live alone thus actively call for assistance. In order to realize the chip in a high-accuracy performance, a small-size area and a low-power dissipation, the MFCC several sub-modules including, radix-2 FFT, Mel-filter bank etc are optimized for chip design to reach the required characteristics. In the simulation results, the proposed MFCC with k-NN framework performs the higher recognition accuracy than LPCC and MP features having k-NN classifier. For chip realization, the optimized MFCC sub-modules indeed improve the hardware resource utilization, where the chip is designed and simulated by verilog and synthesized by TSMC 90nm library.
本文首次提出了一种安全敏感事件声音识别的声音芯片设计,将橙色暖化关怀的交互从人与人之间的感知扩展到环境对人的感知。该芯片可以嵌入智能传感器或家用电器中,探测周围的事件声音,及时照顾独居老人或儿童,主动求助。为了实现芯片的高精度性能、小面积和低功耗,对MFCC的几个子模块,包括基数-2 FFT、mel滤波器组等进行了优化设计,以达到芯片设计所要求的特性。仿真结果表明,基于k-NN框架的MFCC比基于k-NN分类器的LPCC和MP特征具有更高的识别精度。在芯片实现方面,优化后的MFCC子模块确实提高了硬件资源利用率,其中芯片采用verilog进行设计和仿真,并采用台积电90nm库进行合成。
{"title":"Optimized radix-2 FFT and Mel-filter bank in MFCC-based events sound recognition chip design for active smart warming care","authors":"Ta-Wen Kuan, Jhing-Fa Wang, Tsai Shang-Hung","doi":"10.1109/ICOT.2014.6956633","DOIUrl":"https://doi.org/10.1109/ICOT.2014.6956633","url":null,"abstract":"The paper proposes a first sound chip design for security-sensitive event sounds recognition that extended the interaction of Orange warming care from human-to-human to environment-to-human perception. The proposed chip is fittingly embedded in smart sensors or appliances at home to surroundingly detect the event sounds, which can timely care the elderly or children who live alone thus actively call for assistance. In order to realize the chip in a high-accuracy performance, a small-size area and a low-power dissipation, the MFCC several sub-modules including, radix-2 FFT, Mel-filter bank etc are optimized for chip design to reach the required characteristics. In the simulation results, the proposed MFCC with k-NN framework performs the higher recognition accuracy than LPCC and MP features having k-NN classifier. For chip realization, the optimized MFCC sub-modules indeed improve the hardware resource utilization, where the chip is designed and simulated by verilog and synthesized by TSMC 90nm library.","PeriodicalId":343641,"journal":{"name":"2014 International Conference on Orange Technologies","volume":"48 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123515154","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A new strategy for improving the self-positioning precision of an autonomous mobile robot 一种提高自主移动机器人自定位精度的新策略
Pub Date : 2014-11-20 DOI: 10.1109/ICOT.2014.6956605
An Zhanfu, Pei Dong, Yong HongWu, Wang Quanzhou
We address the problem of precise self-positioning of an autonomous mobile robot. This problem is formulated as a manifold perception algorithm such that the precision position of a mobile robot is evaluated based on the distance from an obstacle, critical features or signs of surroundings and the depth of its surrounding images. We propose to accurately localize the position of a mobile robot using an algorithm that fusing the local plane coordinates information getting from laser ranging and space visual information represented by features of a depth image with variational weights, by which the local distance information of laser ranging and depth vision information are relatively complemented. First, we utilize EKF algorithm on the data gathered by laser to get coarse location of a robot, then open RGB-D camera to capture depth images and we extract SURF features of images, when the features are matched with training examples, the RANSAC algorithm is used to check consistency of spatial structures. Finally, extensive experiments show that our fusion method has significantly improved location results of accuracy compared with the results using either EKF on laser data or SURF features matching on depth images. Especially, experiments with variational fusion weights demonstrated that with this method our robot was capable of accomplishing self-location precisely in real time.
研究了自主移动机器人的精确自定位问题。这个问题被表述为一种流形感知算法,这样移动机器人的精确位置是基于与障碍物的距离、周围环境的关键特征或标志以及周围图像的深度来评估的。提出了一种将激光测距得到的局部平面坐标信息与变权深度图像特征表示的空间视觉信息融合的算法,使激光测距的局部距离信息与深度视觉信息相对互补,实现移动机器人位置的精确定位。首先利用EKF算法对激光采集的数据进行粗略定位,然后打开RGB-D相机采集深度图像,提取图像的SURF特征,当特征与训练样例匹配后,使用RANSAC算法检查空间结构的一致性。最后,大量的实验表明,与在激光数据上使用EKF或在深度图像上使用SURF特征匹配的结果相比,我们的融合方法显著提高了定位结果的精度。通过变分融合权值的实验表明,该方法能够实时精确地实现机器人的自定位。
{"title":"A new strategy for improving the self-positioning precision of an autonomous mobile robot","authors":"An Zhanfu, Pei Dong, Yong HongWu, Wang Quanzhou","doi":"10.1109/ICOT.2014.6956605","DOIUrl":"https://doi.org/10.1109/ICOT.2014.6956605","url":null,"abstract":"We address the problem of precise self-positioning of an autonomous mobile robot. This problem is formulated as a manifold perception algorithm such that the precision position of a mobile robot is evaluated based on the distance from an obstacle, critical features or signs of surroundings and the depth of its surrounding images. We propose to accurately localize the position of a mobile robot using an algorithm that fusing the local plane coordinates information getting from laser ranging and space visual information represented by features of a depth image with variational weights, by which the local distance information of laser ranging and depth vision information are relatively complemented. First, we utilize EKF algorithm on the data gathered by laser to get coarse location of a robot, then open RGB-D camera to capture depth images and we extract SURF features of images, when the features are matched with training examples, the RANSAC algorithm is used to check consistency of spatial structures. Finally, extensive experiments show that our fusion method has significantly improved location results of accuracy compared with the results using either EKF on laser data or SURF features matching on depth images. Especially, experiments with variational fusion weights demonstrated that with this method our robot was capable of accomplishing self-location precisely in real time.","PeriodicalId":343641,"journal":{"name":"2014 International Conference on Orange Technologies","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125685887","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Detecting the needs for happiness and meaning in life from google books 从谷歌图书中发现对幸福和生活意义的需求
Pub Date : 2014-11-20 DOI: 10.1109/ICOT.2014.6956620
Lin Qiu, Jiahui Lu, C. Chiu
Research has shown that subjective well-being has two related but distinct dimensions, eudaimonic well-being and hedonic well-being. Hedonic well-being refers to one's overall positive affective experiences, while eudaimonic well-being is related to having a meaningful and noble purpose for life. While people are striving to have a happy and meaningful life, their motivations can be influenced by socio-economic conditions and contexts. In this study, we analyzed words frequencies in the Google Books corpus to measure the changing needs for eudaimonic and hedonic well-being and their relationships with economic growth. Results show that the frequencies of words related to hedonic well-being decrease while those related to eudaimonic well-being increase over the years. Furthermore, when people are poor, their motivation for hedonic well-being is relatively high. The hedonic motivational strength dramatically decreases and becomes stable when income reaches at a certain level. In contrast, people have relatively low motivation for eudaimonic well-being when they are poor. The eudaimonic motivational strength dramatically increases and becomes stable when income reaches at a certain level. Our study demonstrates an example of measuring subjective well-being through analysis of digital media.
研究表明,主观幸福感有两个相关但不同的维度,即现实幸福感和享乐幸福感。享乐幸福是指一个人的整体积极的情感体验,而现实幸福是指拥有一个有意义和崇高的生活目标。当人们努力过上幸福而有意义的生活时,他们的动机会受到社会经济条件和背景的影响。在这项研究中,我们分析了谷歌图书语料库中的词汇频率,以衡量人们对幸福和享乐幸福的需求变化及其与经济增长的关系。结果表明,随着时间的推移,与享乐幸福相关的词汇频率降低,而与快乐幸福相关的词汇频率增加。此外,当人们贫穷时,他们追求享乐幸福的动机相对较高。当收入达到一定水平时,享乐动机强度急剧下降并趋于稳定。相比之下,当人们贫穷时,他们追求幸福的动机相对较低。当收入达到一定水平时,理想动机强度急剧增加并趋于稳定。我们的研究展示了一个通过分析数字媒体来衡量主观幸福感的例子。
{"title":"Detecting the needs for happiness and meaning in life from google books","authors":"Lin Qiu, Jiahui Lu, C. Chiu","doi":"10.1109/ICOT.2014.6956620","DOIUrl":"https://doi.org/10.1109/ICOT.2014.6956620","url":null,"abstract":"Research has shown that subjective well-being has two related but distinct dimensions, eudaimonic well-being and hedonic well-being. Hedonic well-being refers to one's overall positive affective experiences, while eudaimonic well-being is related to having a meaningful and noble purpose for life. While people are striving to have a happy and meaningful life, their motivations can be influenced by socio-economic conditions and contexts. In this study, we analyzed words frequencies in the Google Books corpus to measure the changing needs for eudaimonic and hedonic well-being and their relationships with economic growth. Results show that the frequencies of words related to hedonic well-being decrease while those related to eudaimonic well-being increase over the years. Furthermore, when people are poor, their motivation for hedonic well-being is relatively high. The hedonic motivational strength dramatically decreases and becomes stable when income reaches at a certain level. In contrast, people have relatively low motivation for eudaimonic well-being when they are poor. The eudaimonic motivational strength dramatically increases and becomes stable when income reaches at a certain level. Our study demonstrates an example of measuring subjective well-being through analysis of digital media.","PeriodicalId":343641,"journal":{"name":"2014 International Conference on Orange Technologies","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128729009","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhancement of lightning electric field signals using empirical mode decomposition method 利用经验模态分解方法增强雷电电场信号
Pub Date : 2014-11-20 DOI: 10.1109/ICOT.2014.6956604
Huo Yuanlian, Qiao Yongfeng
In this paper, a new lightning electric field signals denoising approach based on noise reduction algorithms in empirical mode decomposition(EMD) which was widely used for analyzing nonlinear and nonstationary data was applied. The data from the simulation and measurements were analyzed to evaluate this method comparing with the traditional FIR low-pass filter. The results showed that the denoising methods based on EMD provides very good results for denoising lightning electric field signals and it was effective and superior to the FIR filter method.
本文提出了一种基于经验模态分解(EMD)降噪算法的雷电电场信号去噪方法,该方法广泛应用于分析非线性和非平稳数据。通过对仿真和实测数据的分析,将该方法与传统FIR低通滤波器进行比较。结果表明,基于EMD的雷电电场信号去噪方法具有较好的降噪效果,且优于FIR滤波方法。
{"title":"Enhancement of lightning electric field signals using empirical mode decomposition method","authors":"Huo Yuanlian, Qiao Yongfeng","doi":"10.1109/ICOT.2014.6956604","DOIUrl":"https://doi.org/10.1109/ICOT.2014.6956604","url":null,"abstract":"In this paper, a new lightning electric field signals denoising approach based on noise reduction algorithms in empirical mode decomposition(EMD) which was widely used for analyzing nonlinear and nonstationary data was applied. The data from the simulation and measurements were analyzed to evaluate this method comparing with the traditional FIR low-pass filter. The results showed that the denoising methods based on EMD provides very good results for denoising lightning electric field signals and it was effective and superior to the FIR filter method.","PeriodicalId":343641,"journal":{"name":"2014 International Conference on Orange Technologies","volume":"43 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124055182","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Automatic emotion variation detection using multi-scaled sliding window 基于多尺度滑动窗口的情绪变化自动检测
Pub Date : 2014-11-20 DOI: 10.1109/ICOT.2014.6956642
Yuchao Fan, Mingxing Xu, Zhiyong Wu, Lianhong Cai
Emotion recognition from speech plays an important role in developing affective and intelligent Human Computer Interaction. The goal of this work is to build an Automatic Emotion Variation Detection (AEVD) system to determine each emotional salient segment in continuous speech. We focus on emotion detection in angry-neutral speech, which is common in recent studies of AEVD. This study proposes a novel framework for AEVD using Multi-scaled Sliding Window (MSW-AEVD) to assign an emotion class to each window-shift by fusion decisions of all the sliding windows containing the shift. Firstly, sliding window with fixed-length is introduced as the basic procedure, in which several different fusion methods are investigated. Then multi-scaled sliding window is employed to support multi-classifiers with different timescale features, in which another two fusion strategies are provided. Finally, a postprocessing is applied to refine the final outputs. Performance evaluation is carried out on the public Berlin database EMO-DB. Our experimental results show that proposed MSW-AEVD significantly outperforms the traditional HMM-based AEVD.
语音情感识别在发展情感智能人机交互中起着重要作用。本文的目标是建立一个自动情绪变化检测(AEVD)系统,以确定连续语音中的每个情绪显著段。我们关注的是愤怒中性言语的情绪检测,这在最近的AEVD研究中很常见。本研究提出了一种新的AEVD框架,该框架使用多尺度滑动窗口(MSW-AEVD),通过融合包含移动的所有滑动窗口的决策,为每个窗口移动分配一个情感类。首先介绍了固定长度滑动窗口的基本步骤,并对几种不同的融合方法进行了研究。然后利用多尺度滑动窗口支持具有不同时间尺度特征的多分类器,其中又提供了两种融合策略;最后,应用后处理来细化最终输出。对公共柏林数据库EMO-DB进行性能评估。实验结果表明,本文提出的MSW-AEVD显著优于传统的基于hmm的AEVD。
{"title":"Automatic emotion variation detection using multi-scaled sliding window","authors":"Yuchao Fan, Mingxing Xu, Zhiyong Wu, Lianhong Cai","doi":"10.1109/ICOT.2014.6956642","DOIUrl":"https://doi.org/10.1109/ICOT.2014.6956642","url":null,"abstract":"Emotion recognition from speech plays an important role in developing affective and intelligent Human Computer Interaction. The goal of this work is to build an Automatic Emotion Variation Detection (AEVD) system to determine each emotional salient segment in continuous speech. We focus on emotion detection in angry-neutral speech, which is common in recent studies of AEVD. This study proposes a novel framework for AEVD using Multi-scaled Sliding Window (MSW-AEVD) to assign an emotion class to each window-shift by fusion decisions of all the sliding windows containing the shift. Firstly, sliding window with fixed-length is introduced as the basic procedure, in which several different fusion methods are investigated. Then multi-scaled sliding window is employed to support multi-classifiers with different timescale features, in which another two fusion strategies are provided. Finally, a postprocessing is applied to refine the final outputs. Performance evaluation is carried out on the public Berlin database EMO-DB. Our experimental results show that proposed MSW-AEVD significantly outperforms the traditional HMM-based AEVD.","PeriodicalId":343641,"journal":{"name":"2014 International Conference on Orange Technologies","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128745268","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Analysis of eye gaze points based on visual search 基于视觉搜索的人眼注视点分析
Pub Date : 2014-11-20 DOI: 10.1109/ICOT.2014.6954665
Wang Jian, Zhao Xin-Bo
Recently there have been a lot of researches on eye gaze points in naturalistic scenes. But in many of these studies, subjects are instructed to view scenes without any particular task. So what are they different in visual research? In this paper, we provide detailed analysis of the eye gaze points from 11 subjects' eye movements when they performed a search task in 1307 images. The eye tracking data was analyzed in the following four aspects: agreement among subjects, center bias, difference of gaze points for each stimulus between target-present and target absent stimuli and the distribution of gaze points in the target-present image, The results of the analysis show that during visual search tasks, in which subjects are asked to find a particular target in a display, target object playa dominant role in the guidance of eye movements.
近年来,人们对自然场景中的眼睛注视点进行了大量的研究。但在许多这样的研究中,受试者被指示观看场景,没有任何特定的任务。那么它们在视觉研究中有什么不同呢?在本文中,我们详细分析了11名受试者在执行1307幅图像的搜索任务时眼睛的注视点。对眼动追踪数据从被试之间的一致性、中心偏差、目标在场与目标不在场刺激下各刺激注视点的差异以及目标在场图像中注视点的分布等四个方面进行分析。分析结果表明,在视觉搜索任务中,当被试被要求寻找显示中的特定目标时,目标物体对眼球运动的引导起主导作用。
{"title":"Analysis of eye gaze points based on visual search","authors":"Wang Jian, Zhao Xin-Bo","doi":"10.1109/ICOT.2014.6954665","DOIUrl":"https://doi.org/10.1109/ICOT.2014.6954665","url":null,"abstract":"Recently there have been a lot of researches on eye gaze points in naturalistic scenes. But in many of these studies, subjects are instructed to view scenes without any particular task. So what are they different in visual research? In this paper, we provide detailed analysis of the eye gaze points from 11 subjects' eye movements when they performed a search task in 1307 images. The eye tracking data was analyzed in the following four aspects: agreement among subjects, center bias, difference of gaze points for each stimulus between target-present and target absent stimuli and the distribution of gaze points in the target-present image, The results of the analysis show that during visual search tasks, in which subjects are asked to find a particular target in a display, target object playa dominant role in the guidance of eye movements.","PeriodicalId":343641,"journal":{"name":"2014 International Conference on Orange Technologies","volume":"54 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116830672","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Visual attention based visual vocabulary 基于视觉词汇的视觉注意
Pub Date : 2014-11-20 DOI: 10.1109/ICOT.2014.6954669
Ma Zhong, Zhao Xin-Bo
We aim to build a visual vocabulary by applying a model of visual attention. Concretely, we first learn a computational visual attention model from the real eye tracking data. Then using this model to find the most salient regions in the images, and extracting features from these regions to build a visual vocabulary with more expressive power. The experiment was conducted to verify the effectiveness of the proposed visual attention based visual vocabulary. The results show that the proposed vocabulary boosts the performance of the category recognition, which means the proposed vocabulary outperforms the traditional one.
我们的目标是通过应用视觉注意模型来建立一个视觉词汇。具体来说,我们首先从真实的眼动追踪数据中学习一个计算视觉注意模型。然后利用该模型找到图像中最显著的区域,并从这些区域中提取特征,构建具有更强表达能力的视觉词汇表。实验验证了基于视觉注意的视觉词汇的有效性。结果表明,提出的词汇能提高分类识别的性能,优于传统词汇。
{"title":"Visual attention based visual vocabulary","authors":"Ma Zhong, Zhao Xin-Bo","doi":"10.1109/ICOT.2014.6954669","DOIUrl":"https://doi.org/10.1109/ICOT.2014.6954669","url":null,"abstract":"We aim to build a visual vocabulary by applying a model of visual attention. Concretely, we first learn a computational visual attention model from the real eye tracking data. Then using this model to find the most salient regions in the images, and extracting features from these regions to build a visual vocabulary with more expressive power. The experiment was conducted to verify the effectiveness of the proposed visual attention based visual vocabulary. The results show that the proposed vocabulary boosts the performance of the category recognition, which means the proposed vocabulary outperforms the traditional one.","PeriodicalId":343641,"journal":{"name":"2014 International Conference on Orange Technologies","volume":"111 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124787667","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Social determinants of mild cognitive impairment among the elderly — A case study of Taiwan 老年人轻度认知障碍的社会影响因素——以台湾为例
Pub Date : 2014-11-20 DOI: 10.1109/ICOT.2014.6956621
Wen-Jen Hsieh, S. Tsai, Yuh-Tyng Chen, Jenq-daw Lee, Jun-jen Huang, Ignacio Jose Minambres Garcia
According to the World Health Organization (WHO) and Alzheimer Disease International (ADI), there are at least 35.6 million people suffering from dementia in the world. Mild cognitive impairment (MCI) is considered as a risk state or a prodromal of dementia. This paper aims to make further exploration into the risk factors of mild cognitive impairment, analyzing the longitudinal data of three waves of surveys in 1999, 2003 and 2007 from “Taiwan Longitudinal Study on Aging” (TLSA). The hierarchical linear model (HLM) is applied to analyze samples from the TLSA of Taiwan's elderly 65 years old and over in 1999. Empirical results suggest that cognitive function worsening differs among individuals, but clearly increases with age. Depressive symptoms show statistically positively significance but educational attainment show the opposite direction, whereas gender, marital status, ethnic, health behavior, and family support are all not statistically significant.
根据世界卫生组织(WHO)和国际阿尔茨海默病组织(ADI)的数据,全球至少有3560万人患有痴呆症。轻度认知障碍(MCI)被认为是痴呆的一种危险状态或前驱症状。本文旨在进一步探讨轻度认知障碍的危险因素,分析1999年、2003年和2007年“台湾老龄化纵向研究”(Taiwan longitudinal Study on Aging, TLSA)三次调查的纵向数据。本文采用层次线性模型(HLM)对1999年台湾65岁及以上老年人的TLSA样本进行分析。实证结果表明,认知功能恶化在个体之间存在差异,但随着年龄的增长而明显增加。抑郁症状与受教育程度呈显著正相关,性别、婚姻状况、民族、健康行为、家庭支持均无显著统计学意义。
{"title":"Social determinants of mild cognitive impairment among the elderly — A case study of Taiwan","authors":"Wen-Jen Hsieh, S. Tsai, Yuh-Tyng Chen, Jenq-daw Lee, Jun-jen Huang, Ignacio Jose Minambres Garcia","doi":"10.1109/ICOT.2014.6956621","DOIUrl":"https://doi.org/10.1109/ICOT.2014.6956621","url":null,"abstract":"According to the World Health Organization (WHO) and Alzheimer Disease International (ADI), there are at least 35.6 million people suffering from dementia in the world. Mild cognitive impairment (MCI) is considered as a risk state or a prodromal of dementia. This paper aims to make further exploration into the risk factors of mild cognitive impairment, analyzing the longitudinal data of three waves of surveys in 1999, 2003 and 2007 from “Taiwan Longitudinal Study on Aging” (TLSA). The hierarchical linear model (HLM) is applied to analyze samples from the TLSA of Taiwan's elderly 65 years old and over in 1999. Empirical results suggest that cognitive function worsening differs among individuals, but clearly increases with age. Depressive symptoms show statistically positively significance but educational attainment show the opposite direction, whereas gender, marital status, ethnic, health behavior, and family support are all not statistically significant.","PeriodicalId":343641,"journal":{"name":"2014 International Conference on Orange Technologies","volume":"40 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128859860","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Supervised segmentation of vasculature in retinal images using neural networks 利用神经网络对视网膜图像中的血管系统进行监督分割
Pub Date : 2014-11-20 DOI: 10.1109/ICOT.2014.6954694
Chen Ding, Yong Xia, Ying Li
This paper proposes a neural network based supervised segmentation algorithm for retinal vessel delineation. The histogram of each training image patch and its optimal threshold acquired through iteratively comparing the binaryzation result to the manual segmentation are applied to a BP neural network to establish the correspondence between the intensity distribution and optimal segmentation parameter. Finally, each test image can be segmented by using a number of local thresholds that are predicted by the trained the neural network according the histograms of image patches. The propose algorithm has been evaluated on the DRIVE database that contains forty retinal images with manually segmented vessel trees. Our results show that the proposed algorithm can effective segment the vasculature in retinal images.
提出了一种基于神经网络的视网膜血管分割算法。将二值化结果与人工分割迭代比较得到的每个训练图像patch的直方图及其最优阈值应用到BP神经网络中,建立强度分布与最优分割参数的对应关系。最后,利用训练后的神经网络根据图像patch的直方图预测的多个局部阈值对每个测试图像进行分割。该算法已在DRIVE数据库中进行了评估,该数据库包含40张人工分割血管树的视网膜图像。实验结果表明,该算法可以有效地分割视网膜图像中的血管。
{"title":"Supervised segmentation of vasculature in retinal images using neural networks","authors":"Chen Ding, Yong Xia, Ying Li","doi":"10.1109/ICOT.2014.6954694","DOIUrl":"https://doi.org/10.1109/ICOT.2014.6954694","url":null,"abstract":"This paper proposes a neural network based supervised segmentation algorithm for retinal vessel delineation. The histogram of each training image patch and its optimal threshold acquired through iteratively comparing the binaryzation result to the manual segmentation are applied to a BP neural network to establish the correspondence between the intensity distribution and optimal segmentation parameter. Finally, each test image can be segmented by using a number of local thresholds that are predicted by the trained the neural network according the histograms of image patches. The propose algorithm has been evaluated on the DRIVE database that contains forty retinal images with manually segmented vessel trees. Our results show that the proposed algorithm can effective segment the vasculature in retinal images.","PeriodicalId":343641,"journal":{"name":"2014 International Conference on Orange Technologies","volume":"84 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124859064","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 21
期刊
2014 International Conference on Orange Technologies
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1