2014 International Conference on Orange Technologies最新文献

英文中文

Efficient and portable content-based music retrieval system 高效、便携的基于内容的音乐检索系统

2014 International Conference on Orange Technologies

Pub Date : 2014-11-20 DOI: 10.1109/ICOT.2014.6956622

Yen-Lin Chiang, Yuan-Shan Lee, Wen-Chi Hsieh, Jia-Ching Wang

In this work, a query-by-singing (QBS) content-based music retrieval (CBMR) system is proposed. The proposed QBS-CBMR system shows high efficiency and portability. The proposed QBS-CBMR system uses a music clip as a search key. First, a 13 dimensional Mel-frequency cepstral coefficients (MFCCs) is extracted from an input music clip. Second, each dimension of MFCCs is transformed into a symbolic sequence using the adapted symbolic aggregate approximation (adapted SAX). Each symbolic sequence corresponding to each dimension of MFCCs is then converted into a tree structure called advanced fast pattern index (AFPI) tree. In order to evaluate the similarity between the query music clip and the songs in the database, a partial score is calculated for each AFPI tree first. The final score is obtained by the weighted summation of all partial scores, where the weighting of each partial score is determined by its entropy. The experimental results show that the proposed music retrieval system outperforms other approaches.

本文提出了一种基于歌曲查询(QBS)的基于内容的音乐检索(CBMR)系统。所提出的QBS-CBMR系统具有高效率和可移植性。提出的QBS-CBMR系统使用音乐片段作为搜索键。首先，从输入音乐片段中提取13维mel频率倒谱系数(MFCCs)。其次，使用自适应符号聚合近似(自适应SAX)将mfccc的每个维度转换为符号序列。每个符号序列对应于mfccc的每个维度，然后转换成一个称为高级快速模式索引(AFPI)树的树结构。为了评估查询音乐片段与数据库中歌曲之间的相似性，首先为每个AFPI树计算部分分数。最终分数由所有部分分数的加权和得到，其中每个部分分数的权重由其熵决定。实验结果表明，所提出的音乐检索系统优于其他方法。

引用次数: 2

Optimized radix-2 FFT and Mel-filter bank in MFCC-based events sound recognition chip design for active smart warming care 基于mfcc的事件声音识别芯片中基于基数-2 FFT和mel滤波器组的优化设计

2014 International Conference on Orange Technologies

Pub Date : 2014-11-20 DOI: 10.1109/ICOT.2014.6956633

Ta-Wen Kuan, Jhing-Fa Wang, Tsai Shang-Hung

The paper proposes a first sound chip design for security-sensitive event sounds recognition that extended the interaction of Orange warming care from human-to-human to environment-to-human perception. The proposed chip is fittingly embedded in smart sensors or appliances at home to surroundingly detect the event sounds, which can timely care the elderly or children who live alone thus actively call for assistance. In order to realize the chip in a high-accuracy performance, a small-size area and a low-power dissipation, the MFCC several sub-modules including, radix-2 FFT, Mel-filter bank etc are optimized for chip design to reach the required characteristics. In the simulation results, the proposed MFCC with k-NN framework performs the higher recognition accuracy than LPCC and MP features having k-NN classifier. For chip realization, the optimized MFCC sub-modules indeed improve the hardware resource utilization, where the chip is designed and simulated by verilog and synthesized by TSMC 90nm library.

本文首次提出了一种安全敏感事件声音识别的声音芯片设计，将橙色暖化关怀的交互从人与人之间的感知扩展到环境对人的感知。该芯片可以嵌入智能传感器或家用电器中，探测周围的事件声音，及时照顾独居老人或儿童，主动求助。为了实现芯片的高精度性能、小面积和低功耗，对MFCC的几个子模块，包括基数-2 FFT、mel滤波器组等进行了优化设计，以达到芯片设计所要求的特性。仿真结果表明，基于k-NN框架的MFCC比基于k-NN分类器的LPCC和MP特征具有更高的识别精度。在芯片实现方面，优化后的MFCC子模块确实提高了硬件资源利用率，其中芯片采用verilog进行设计和仿真，并采用台积电90nm库进行合成。

引用次数: 2

Detecting the needs for happiness and meaning in life from google books 从谷歌图书中发现对幸福和生活意义的需求

2014 International Conference on Orange Technologies

Pub Date : 2014-11-20 DOI: 10.1109/ICOT.2014.6956620

Lin Qiu, Jiahui Lu, C. Chiu

Research has shown that subjective well-being has two related but distinct dimensions, eudaimonic well-being and hedonic well-being. Hedonic well-being refers to one's overall positive affective experiences, while eudaimonic well-being is related to having a meaningful and noble purpose for life. While people are striving to have a happy and meaningful life, their motivations can be influenced by socio-economic conditions and contexts. In this study, we analyzed words frequencies in the Google Books corpus to measure the changing needs for eudaimonic and hedonic well-being and their relationships with economic growth. Results show that the frequencies of words related to hedonic well-being decrease while those related to eudaimonic well-being increase over the years. Furthermore, when people are poor, their motivation for hedonic well-being is relatively high. The hedonic motivational strength dramatically decreases and becomes stable when income reaches at a certain level. In contrast, people have relatively low motivation for eudaimonic well-being when they are poor. The eudaimonic motivational strength dramatically increases and becomes stable when income reaches at a certain level. Our study demonstrates an example of measuring subjective well-being through analysis of digital media.

研究表明，主观幸福感有两个相关但不同的维度，即现实幸福感和享乐幸福感。享乐幸福是指一个人的整体积极的情感体验，而现实幸福是指拥有一个有意义和崇高的生活目标。当人们努力过上幸福而有意义的生活时，他们的动机会受到社会经济条件和背景的影响。在这项研究中，我们分析了谷歌图书语料库中的词汇频率，以衡量人们对幸福和享乐幸福的需求变化及其与经济增长的关系。结果表明，随着时间的推移，与享乐幸福相关的词汇频率降低，而与快乐幸福相关的词汇频率增加。此外，当人们贫穷时，他们追求享乐幸福的动机相对较高。当收入达到一定水平时，享乐动机强度急剧下降并趋于稳定。相比之下，当人们贫穷时，他们追求幸福的动机相对较低。当收入达到一定水平时，理想动机强度急剧增加并趋于稳定。我们的研究展示了一个通过分析数字媒体来衡量主观幸福感的例子。

{"title":"Detecting the needs for happiness and meaning in life from google books","authors":"Lin Qiu, Jiahui Lu, C. Chiu","doi":"10.1109/ICOT.2014.6956620","DOIUrl":"https://doi.org/10.1109/ICOT.2014.6956620","url":null,"abstract":"Research has shown that subjective well-being has two related but distinct dimensions, eudaimonic well-being and hedonic well-being. Hedonic well-being refers to one's overall positive affective experiences, while eudaimonic well-being is related to having a meaningful and noble purpose for life. While people are striving to have a happy and meaningful life, their motivations can be influenced by socio-economic conditions and contexts. In this study, we analyzed words frequencies in the Google Books corpus to measure the changing needs for eudaimonic and hedonic well-being and their relationships with economic growth. Results show that the frequencies of words related to hedonic well-being decrease while those related to eudaimonic well-being increase over the years. Furthermore, when people are poor, their motivation for hedonic well-being is relatively high. The hedonic motivational strength dramatically decreases and becomes stable when income reaches at a certain level. In contrast, people have relatively low motivation for eudaimonic well-being when they are poor. The eudaimonic motivational strength dramatically increases and becomes stable when income reaches at a certain level. Our study demonstrates an example of measuring subjective well-being through analysis of digital media.","PeriodicalId":343641,"journal":{"name":"2014 International Conference on Orange Technologies","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128729009","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Applying PAD three dimensional emotion model to convert prosody of emotional speech 应用PAD三维情感模型转换情感言语的韵律

2014 International Conference on Orange Technologies

Pub Date : 2014-11-20 DOI: 10.1109/ICOT.2014.6956606

Xiaoyong Lu, Hongwu Yang, Aibao Zhou

Happiness has attracted much attention of the researchers in various fields. This paper realizes prosodic conversion of emotional speech for happiness computing on speech communication. An emotional speech corpus includes 11 kinds of typical emotional utterances is designed, where each utterance is labeled the emotional information with PAD value in a psychological sense. A five-scale tone model is employed to model the pitch contour of emotional utterances on the syllable level. A generalized regression neural network (GRNN) based prosody conversion model is built to realize the transformation of pitch contour, duration and pause duration of emotional utterance, in which the PAD values of emotion and context parameter are adopted to predict the prosodic features. Emotional utterance is then re-synthesized with the STRAIGHT algorithm by modifying pitch contour, duration and pause duration. Experimental results on Emotional Mean Opining Score (EMOS) demonstrate that the prosody conversion effect of proposed method can express corresponding feelings.

幸福已经引起了各个领域研究者的广泛关注。本文实现了情感语音的韵律转换，用于语音交际中的幸福感计算。设计了一个包含11种典型情感话语的情感语料库，并将每个话语标记为具有心理PAD值的情感信息。在音节水平上，采用五音阶声调模型对情感话语的音高轮廓进行建模。建立了一种基于广义回归神经网络(GRNN)的韵律转换模型，实现情绪话语的音高轮廓、持续时间和停顿时间的转换，其中采用情绪参数和语境参数的PAD值来预测韵律特征。然后通过修改音高轮廓、持续时间和停顿时间，用STRAIGHT算法重新合成情绪话语。情感平均意见评分(EMOS)实验结果表明，所提方法的韵律转换效果能够表达相应的情感。

引用次数: 0

Enhancement of lightning electric field signals using empirical mode decomposition method 利用经验模态分解方法增强雷电电场信号

2014 International Conference on Orange Technologies

Pub Date : 2014-11-20 DOI: 10.1109/ICOT.2014.6956604

Huo Yuanlian, Qiao Yongfeng

In this paper, a new lightning electric field signals denoising approach based on noise reduction algorithms in empirical mode decomposition(EMD) which was widely used for analyzing nonlinear and nonstationary data was applied. The data from the simulation and measurements were analyzed to evaluate this method comparing with the traditional FIR low-pass filter. The results showed that the denoising methods based on EMD provides very good results for denoising lightning electric field signals and it was effective and superior to the FIR filter method.

本文提出了一种基于经验模态分解(EMD)降噪算法的雷电电场信号去噪方法，该方法广泛应用于分析非线性和非平稳数据。通过对仿真和实测数据的分析，将该方法与传统FIR低通滤波器进行比较。结果表明，基于EMD的雷电电场信号去噪方法具有较好的降噪效果，且优于FIR滤波方法。

引用次数: 3

Automatic emotion variation detection using multi-scaled sliding window 基于多尺度滑动窗口的情绪变化自动检测

2014 International Conference on Orange Technologies

Pub Date : 2014-11-20 DOI: 10.1109/ICOT.2014.6956642

Yuchao Fan, Mingxing Xu, Zhiyong Wu, Lianhong Cai

Emotion recognition from speech plays an important role in developing affective and intelligent Human Computer Interaction. The goal of this work is to build an Automatic Emotion Variation Detection (AEVD) system to determine each emotional salient segment in continuous speech. We focus on emotion detection in angry-neutral speech, which is common in recent studies of AEVD. This study proposes a novel framework for AEVD using Multi-scaled Sliding Window (MSW-AEVD) to assign an emotion class to each window-shift by fusion decisions of all the sliding windows containing the shift. Firstly, sliding window with fixed-length is introduced as the basic procedure, in which several different fusion methods are investigated. Then multi-scaled sliding window is employed to support multi-classifiers with different timescale features, in which another two fusion strategies are provided. Finally, a postprocessing is applied to refine the final outputs. Performance evaluation is carried out on the public Berlin database EMO-DB. Our experimental results show that proposed MSW-AEVD significantly outperforms the traditional HMM-based AEVD.

语音情感识别在发展情感智能人机交互中起着重要作用。本文的目标是建立一个自动情绪变化检测(AEVD)系统，以确定连续语音中的每个情绪显著段。我们关注的是愤怒中性言语的情绪检测，这在最近的AEVD研究中很常见。本研究提出了一种新的AEVD框架，该框架使用多尺度滑动窗口(MSW-AEVD)，通过融合包含移动的所有滑动窗口的决策，为每个窗口移动分配一个情感类。首先介绍了固定长度滑动窗口的基本步骤，并对几种不同的融合方法进行了研究。然后利用多尺度滑动窗口支持具有不同时间尺度特征的多分类器，其中又提供了两种融合策略;最后，应用后处理来细化最终输出。对公共柏林数据库EMO-DB进行性能评估。实验结果表明，本文提出的MSW-AEVD显著优于传统的基于hmm的AEVD。

{"title":"Automatic emotion variation detection using multi-scaled sliding window","authors":"Yuchao Fan, Mingxing Xu, Zhiyong Wu, Lianhong Cai","doi":"10.1109/ICOT.2014.6956642","DOIUrl":"https://doi.org/10.1109/ICOT.2014.6956642","url":null,"abstract":"Emotion recognition from speech plays an important role in developing affective and intelligent Human Computer Interaction. The goal of this work is to build an Automatic Emotion Variation Detection (AEVD) system to determine each emotional salient segment in continuous speech. We focus on emotion detection in angry-neutral speech, which is common in recent studies of AEVD. This study proposes a novel framework for AEVD using Multi-scaled Sliding Window (MSW-AEVD) to assign an emotion class to each window-shift by fusion decisions of all the sliding windows containing the shift. Firstly, sliding window with fixed-length is introduced as the basic procedure, in which several different fusion methods are investigated. Then multi-scaled sliding window is employed to support multi-classifiers with different timescale features, in which another two fusion strategies are provided. Finally, a postprocessing is applied to refine the final outputs. Performance evaluation is carried out on the public Berlin database EMO-DB. Our experimental results show that proposed MSW-AEVD significantly outperforms the traditional HMM-based AEVD.","PeriodicalId":343641,"journal":{"name":"2014 International Conference on Orange Technologies","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128745268","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Analysis of eye gaze points based on visual search 基于视觉搜索的人眼注视点分析

2014 International Conference on Orange Technologies

Pub Date : 2014-11-20 DOI: 10.1109/ICOT.2014.6954665

Wang Jian, Zhao Xin-Bo

Recently there have been a lot of researches on eye gaze points in naturalistic scenes. But in many of these studies, subjects are instructed to view scenes without any particular task. So what are they different in visual research? In this paper, we provide detailed analysis of the eye gaze points from 11 subjects' eye movements when they performed a search task in 1307 images. The eye tracking data was analyzed in the following four aspects: agreement among subjects, center bias, difference of gaze points for each stimulus between target-present and target absent stimuli and the distribution of gaze points in the target-present image, The results of the analysis show that during visual search tasks, in which subjects are asked to find a particular target in a display, target object playa dominant role in the guidance of eye movements.

近年来，人们对自然场景中的眼睛注视点进行了大量的研究。但在许多这样的研究中，受试者被指示观看场景，没有任何特定的任务。那么它们在视觉研究中有什么不同呢?在本文中，我们详细分析了11名受试者在执行1307幅图像的搜索任务时眼睛的注视点。对眼动追踪数据从被试之间的一致性、中心偏差、目标在场与目标不在场刺激下各刺激注视点的差异以及目标在场图像中注视点的分布等四个方面进行分析。分析结果表明，在视觉搜索任务中，当被试被要求寻找显示中的特定目标时，目标物体对眼球运动的引导起主导作用。

引用次数: 1

Visual attention based visual vocabulary 基于视觉词汇的视觉注意

2014 International Conference on Orange Technologies

Pub Date : 2014-11-20 DOI: 10.1109/ICOT.2014.6954669

Ma Zhong, Zhao Xin-Bo

We aim to build a visual vocabulary by applying a model of visual attention. Concretely, we first learn a computational visual attention model from the real eye tracking data. Then using this model to find the most salient regions in the images, and extracting features from these regions to build a visual vocabulary with more expressive power. The experiment was conducted to verify the effectiveness of the proposed visual attention based visual vocabulary. The results show that the proposed vocabulary boosts the performance of the category recognition, which means the proposed vocabulary outperforms the traditional one.

我们的目标是通过应用视觉注意模型来建立一个视觉词汇。具体来说，我们首先从真实的眼动追踪数据中学习一个计算视觉注意模型。然后利用该模型找到图像中最显著的区域，并从这些区域中提取特征，构建具有更强表达能力的视觉词汇表。实验验证了基于视觉注意的视觉词汇的有效性。结果表明，提出的词汇能提高分类识别的性能，优于传统词汇。

引用次数: 0

Social determinants of mild cognitive impairment among the elderly — A case study of Taiwan 老年人轻度认知障碍的社会影响因素——以台湾为例

2014 International Conference on Orange Technologies

Pub Date : 2014-11-20 DOI: 10.1109/ICOT.2014.6956621

Wen-Jen Hsieh, S. Tsai, Yuh-Tyng Chen, Jenq-daw Lee, Jun-jen Huang, Ignacio Jose Minambres Garcia

According to the World Health Organization (WHO) and Alzheimer Disease International (ADI), there are at least 35.6 million people suffering from dementia in the world. Mild cognitive impairment (MCI) is considered as a risk state or a prodromal of dementia. This paper aims to make further exploration into the risk factors of mild cognitive impairment, analyzing the longitudinal data of three waves of surveys in 1999, 2003 and 2007 from “Taiwan Longitudinal Study on Aging” (TLSA). The hierarchical linear model (HLM) is applied to analyze samples from the TLSA of Taiwan's elderly 65 years old and over in 1999. Empirical results suggest that cognitive function worsening differs among individuals, but clearly increases with age. Depressive symptoms show statistically positively significance but educational attainment show the opposite direction, whereas gender, marital status, ethnic, health behavior, and family support are all not statistically significant.

根据世界卫生组织(WHO)和国际阿尔茨海默病组织(ADI)的数据，全球至少有3560万人患有痴呆症。轻度认知障碍(MCI)被认为是痴呆的一种危险状态或前驱症状。本文旨在进一步探讨轻度认知障碍的危险因素，分析1999年、2003年和2007年“台湾老龄化纵向研究”(Taiwan longitudinal Study on Aging, TLSA)三次调查的纵向数据。本文采用层次线性模型(HLM)对1999年台湾65岁及以上老年人的TLSA样本进行分析。实证结果表明，认知功能恶化在个体之间存在差异，但随着年龄的增长而明显增加。抑郁症状与受教育程度呈显著正相关，性别、婚姻状况、民族、健康行为、家庭支持均无显著统计学意义。

引用次数: 0

Supervised segmentation of vasculature in retinal images using neural networks 利用神经网络对视网膜图像中的血管系统进行监督分割

2014 International Conference on Orange Technologies

Pub Date : 2014-11-20 DOI: 10.1109/ICOT.2014.6954694

Chen Ding, Yong Xia, Ying Li

This paper proposes a neural network based supervised segmentation algorithm for retinal vessel delineation. The histogram of each training image patch and its optimal threshold acquired through iteratively comparing the binaryzation result to the manual segmentation are applied to a BP neural network to establish the correspondence between the intensity distribution and optimal segmentation parameter. Finally, each test image can be segmented by using a number of local thresholds that are predicted by the trained the neural network according the histograms of image patches. The propose algorithm has been evaluated on the DRIVE database that contains forty retinal images with manually segmented vessel trees. Our results show that the proposed algorithm can effective segment the vasculature in retinal images.

提出了一种基于神经网络的视网膜血管分割算法。将二值化结果与人工分割迭代比较得到的每个训练图像patch的直方图及其最优阈值应用到BP神经网络中，建立强度分布与最优分割参数的对应关系。最后，利用训练后的神经网络根据图像patch的直方图预测的多个局部阈值对每个测试图像进行分割。该算法已在DRIVE数据库中进行了评估，该数据库包含40张人工分割血管树的视网膜图像。实验结果表明，该算法可以有效地分割视网膜图像中的血管。

引用次数: 21

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2014 International Conference on Orange Technologies

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀