首页 > 最新文献

Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific最新文献

英文 中文
Discovering and analyzing learning pattern on web based learning using social network analysis 利用社会网络分析发现和分析基于web的学习模式
P. Temdee, Wacharawan Intayoad
Web based learning has been promoting alternative way of learning for decades. The difficulty of web based learning is to provide the appropriate support for the learners so that the learners will not get lost and their learning achievements can be ensured. This paper thus proposes the method for discovering learning patterns of the learners on web based learning particularly for ensuring the learning achievement. The learning pattern is discovered by analyzing the interactions among the learners and the learning objects with social network analysis. Then, the achievement learning pattern is finally determined by analyzing the sets of obtained social network measurements. The interaction data is gathered from online course named introduction to Information Technology in the 2013 academic year, particularly for spreadsheet content module having 10 learning objects. The interaction patterns only of two groups of students including scientific and nonscientific background knowledge who pass the spreadsheet examination are analyzed. Finally, learning patterns ensuring learning achievement for spreadsheet content module of those students having different background knowledge is revealed.
几十年来,基于网络的学习一直在推广另一种学习方式。网络学习的难点在于为学习者提供适当的支持,使学习者不迷失方向,保证学习成果。在此基础上,本文提出了在网络学习中发现学习者学习模式的方法,以保证学习者的学习效果。运用社会网络分析法分析学习者与学习对象之间的相互作用,发现学习模式。然后,通过分析获得的社会网络测量集,最终确定成就学习模式。交互数据收集自2013学年的《信息技术导论》在线课程,特别是包含10个学习对象的电子表格内容模块。分析了通过电子表格考试的两组学生(包括科学背景知识和非科学背景知识)的交互模式。最后揭示了具有不同背景知识的学生电子表格内容模块的学习模式,保证了学生的学习成果。
{"title":"Discovering and analyzing learning pattern on web based learning using social network analysis","authors":"P. Temdee, Wacharawan Intayoad","doi":"10.1109/APSIPA.2014.7041814","DOIUrl":"https://doi.org/10.1109/APSIPA.2014.7041814","url":null,"abstract":"Web based learning has been promoting alternative way of learning for decades. The difficulty of web based learning is to provide the appropriate support for the learners so that the learners will not get lost and their learning achievements can be ensured. This paper thus proposes the method for discovering learning patterns of the learners on web based learning particularly for ensuring the learning achievement. The learning pattern is discovered by analyzing the interactions among the learners and the learning objects with social network analysis. Then, the achievement learning pattern is finally determined by analyzing the sets of obtained social network measurements. The interaction data is gathered from online course named introduction to Information Technology in the 2013 academic year, particularly for spreadsheet content module having 10 learning objects. The interaction patterns only of two groups of students including scientific and nonscientific background knowledge who pass the spreadsheet examination are analyzed. Finally, learning patterns ensuring learning achievement for spreadsheet content module of those students having different background knowledge is revealed.","PeriodicalId":231382,"journal":{"name":"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129695065","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Phase detection of multi-channel SSVEPs via complex sparse spatial weighting 基于复稀疏空间加权的多通道ssvep相位检测
Keita Shimpo, Toshihisa Tanaka
A brain-computer interface (BCI) based on steady-state visual evoked potentials (SSVEP) is one of the most practical BCI, because of high recognition accuracies and short time training. Phase of SSVEPs can be potentially applicable for generating device commands. However, the effective method of estimating the phase of SSVEPs has not yet been established, especially, in the case of using multi-channel electroencephalogram (EEG). In this paper, we propose a novel method for estimating the phase of SSVEPs from multi-channel EEG, which uses complex sparse spatial weighting. We conducted experiments with the phase-coded SSVEPs based BCI for evaluating performance of our proposed method. As a result, our proposed method showed higher recognition accuracies than conventional methods in all six subjects.
基于稳态视觉诱发电位(SSVEP)的脑机接口(BCI)具有识别准确率高、训练时间短等优点,是目前最实用的脑机接口之一。ssvep阶段可能潜在地适用于生成设备命令。然而,目前还没有有效的方法来估计ssvep的相位,特别是在使用多通道脑电图(EEG)的情况下。本文提出了一种基于复稀疏空间加权的多通道脑电信号相位估计方法。我们用基于BCI的相位编码ssvep进行了实验,以评估我们提出的方法的性能。结果表明,本文提出的方法在所有6个主题上都比传统方法具有更高的识别准确率。
{"title":"Phase detection of multi-channel SSVEPs via complex sparse spatial weighting","authors":"Keita Shimpo, Toshihisa Tanaka","doi":"10.1109/APSIPA.2014.7041666","DOIUrl":"https://doi.org/10.1109/APSIPA.2014.7041666","url":null,"abstract":"A brain-computer interface (BCI) based on steady-state visual evoked potentials (SSVEP) is one of the most practical BCI, because of high recognition accuracies and short time training. Phase of SSVEPs can be potentially applicable for generating device commands. However, the effective method of estimating the phase of SSVEPs has not yet been established, especially, in the case of using multi-channel electroencephalogram (EEG). In this paper, we propose a novel method for estimating the phase of SSVEPs from multi-channel EEG, which uses complex sparse spatial weighting. We conducted experiments with the phase-coded SSVEPs based BCI for evaluating performance of our proposed method. As a result, our proposed method showed higher recognition accuracies than conventional methods in all six subjects.","PeriodicalId":231382,"journal":{"name":"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124638003","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Proportional feedback based rate control for intra frame of H.264/AVC high profile 基于比例反馈的H.264/AVC高清帧内速率控制
Yanping Zhou, Y. Duan, Jun Sun, Zongming Guo
This paper focuses on the intra frame rate control of H.264/AVC High Profile and introduces a new frame gradient-based rate control algorithm. In this algorithm, a rate-gradient-quantization parameter model with frame gradient employed as frame complexity is proposed. Then, a proportional feedback scheme, along with an adaptive optimization method, is presented to achieve constant bitrate. Rigorous experiments covering various sequences of different target rates are carried out. Experimental results show that the proposed rate control method outperforms JM16.0 by offering a more constant rate output and reducing rate fluctuation, without video quality loss.
本文重点研究了H.264/AVC High Profile的帧内速率控制,提出了一种基于帧梯度的帧内速率控制算法。在该算法中,提出了一种以帧梯度作为帧复杂度的速率梯度量化参数模型。然后,提出了一种比例反馈方案,并结合自适应优化方法来实现恒定比特率。对不同目标速率的各种序列进行了严格的实验。实验结果表明,该方法在不影响视频质量的前提下,提供了更稳定的速率输出,减少了速率波动,优于JM16.0。
{"title":"Proportional feedback based rate control for intra frame of H.264/AVC high profile","authors":"Yanping Zhou, Y. Duan, Jun Sun, Zongming Guo","doi":"10.1109/APSIPA.2014.7041537","DOIUrl":"https://doi.org/10.1109/APSIPA.2014.7041537","url":null,"abstract":"This paper focuses on the intra frame rate control of H.264/AVC High Profile and introduces a new frame gradient-based rate control algorithm. In this algorithm, a rate-gradient-quantization parameter model with frame gradient employed as frame complexity is proposed. Then, a proportional feedback scheme, along with an adaptive optimization method, is presented to achieve constant bitrate. Rigorous experiments covering various sequences of different target rates are carried out. Experimental results show that the proposed rate control method outperforms JM16.0 by offering a more constant rate output and reducing rate fluctuation, without video quality loss.","PeriodicalId":231382,"journal":{"name":"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127201372","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Range extrapolation of Head-Related Transfer Function using improved Higher Order Ambisonics 基于改进高阶双声的头部相关传递函数范围外推
Ling-song Zhou, C. Bao, Mao-shen Jia, Bing Bu
3D audio technology based on binaural reproduction requires the Head-Related Transfer Function (HRTF) datasets to be available for all possible distance. However, due to the tedious work of measurement and large volume of resulting datasets, the HRTF is typically measured only for sources located at a fixed distance. In this paper, the concept of virtual loudspeaker arrays is utilized to achieve range extrapolation of the measured HRTF datasets at a single range. The virtual loudspeaker is driven by Higher Order Ambisonics (HOA). Specially, to restrict the near-field effect of HOA, a compensation method of modified Wiener filter is proposed. The simulation results indicate that the proposed method provides effective range extrapolation of HRTF.
基于双耳再现的3D音频技术要求头部相关传递函数(HRTF)数据集在所有可能的距离都可用。然而,由于测量工作繁琐且产生的数据集量大,通常仅对位于固定距离的源进行HRTF测量。本文利用虚拟扬声器阵列的概念,对实测的HRTF数据集进行单量程外推。该虚拟扬声器由高阶立体声(HOA)驱动。特别地,提出了一种改进的维纳滤波补偿方法,以限制高噪点的近场效应。仿真结果表明,该方法能够有效地进行距离外推。
{"title":"Range extrapolation of Head-Related Transfer Function using improved Higher Order Ambisonics","authors":"Ling-song Zhou, C. Bao, Mao-shen Jia, Bing Bu","doi":"10.1109/APSIPA.2014.7041527","DOIUrl":"https://doi.org/10.1109/APSIPA.2014.7041527","url":null,"abstract":"3D audio technology based on binaural reproduction requires the Head-Related Transfer Function (HRTF) datasets to be available for all possible distance. However, due to the tedious work of measurement and large volume of resulting datasets, the HRTF is typically measured only for sources located at a fixed distance. In this paper, the concept of virtual loudspeaker arrays is utilized to achieve range extrapolation of the measured HRTF datasets at a single range. The virtual loudspeaker is driven by Higher Order Ambisonics (HOA). Specially, to restrict the near-field effect of HOA, a compensation method of modified Wiener filter is proposed. The simulation results indicate that the proposed method provides effective range extrapolation of HRTF.","PeriodicalId":231382,"journal":{"name":"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127604411","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Compressed sensing based channel estimation for uplink OFDMA systems 基于压缩感知的OFDMA上行系统信道估计
K. Hayashi, Masanori Sakai, Takuya Kamenosono, Megumi Kaneko
The paper considers a time domain channel estimation approach for uplink OFDMA (Orthogonal Frequency Division Multiple Access) systems. Although frequency domain channel estimation schemes are widely used for those systems, we propose time domain channel estimation schemes by taking advantage of the sparsity of channel impulse response with compressed sensing. Numerical simulations show the merit of the proposed schemes, which demonstrates the validity of the time domain channel estimation approach for OFDMA systems.
本文研究了一种用于上行OFDMA(正交频分多址)系统的时域信道估计方法。虽然频域信道估计方案被广泛用于这些系统,但我们提出了利用信道脉冲响应的稀疏性和压缩感知的时域信道估计方案。数值仿真结果表明了所提方案的优点,验证了时域信道估计方法在OFDMA系统中的有效性。
{"title":"Compressed sensing based channel estimation for uplink OFDMA systems","authors":"K. Hayashi, Masanori Sakai, Takuya Kamenosono, Megumi Kaneko","doi":"10.1109/APSIPA.2014.7041569","DOIUrl":"https://doi.org/10.1109/APSIPA.2014.7041569","url":null,"abstract":"The paper considers a time domain channel estimation approach for uplink OFDMA (Orthogonal Frequency Division Multiple Access) systems. Although frequency domain channel estimation schemes are widely used for those systems, we propose time domain channel estimation schemes by taking advantage of the sparsity of channel impulse response with compressed sensing. Numerical simulations show the merit of the proposed schemes, which demonstrates the validity of the time domain channel estimation approach for OFDMA systems.","PeriodicalId":231382,"journal":{"name":"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121555654","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Analysis of customer communication by employee in restaurant and lead time estimation 分析餐厅员工与客户的沟通情况,预估交货期
Masanori Takehara, Hiroya Nojiri, S. Tamura, S. Hayamizu, T. Kurata
Human behavior sensing and their analysis are great role to improve service quality and education of employees. This paper shows novel frameworks of detection of customer communication and lead time estimation(LTE) by using multi-sensored data, sound data and accounting data in the restaurant. They are useful for management about work environments and problems for employees. Lead time from order to delivery shows the quality of the service for customers. We found sound data of an employee's speech is useful for these techniques by speech ratio smoothing and POS sound detection.
人的行为感知及其分析对提高服务质量和员工教育具有重要作用。本文展示了通过使用餐厅的多传感器数据、声音数据和会计数据来检测客户沟通和交货时间估计(LTE)的新框架。它们对管理工作环境和员工问题很有用。从订货到交货的交货期显示了为客户提供的服务质量。通过语音比例平滑和POS语音检测,我们发现员工的语音数据对这些技术是有用的。
{"title":"Analysis of customer communication by employee in restaurant and lead time estimation","authors":"Masanori Takehara, Hiroya Nojiri, S. Tamura, S. Hayamizu, T. Kurata","doi":"10.1109/APSIPA.2014.7041701","DOIUrl":"https://doi.org/10.1109/APSIPA.2014.7041701","url":null,"abstract":"Human behavior sensing and their analysis are great role to improve service quality and education of employees. This paper shows novel frameworks of detection of customer communication and lead time estimation(LTE) by using multi-sensored data, sound data and accounting data in the restaurant. They are useful for management about work environments and problems for employees. Lead time from order to delivery shows the quality of the service for customers. We found sound data of an employee's speech is useful for these techniques by speech ratio smoothing and POS sound detection.","PeriodicalId":231382,"journal":{"name":"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131017694","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Denoising autoencoder and environment adaptation for distant-talking speech recognition with asynchronous speech recording 异步语音录制远程语音识别的去噪自编码器和环境自适应
Longbiao Wang, Bo Ren, Yuma Ueda, A. Kai, Shunta Teraoka, T. Fukushima
In this paper, we propose a robust distant-talking speech recognition system with asynchronous speech recording. This is implemented by combining denoising autoencoder-based cepstral-domain dereverberation, automatic asynchronous speech (microphone or mobile terminal) selection and environment adaptation. Although applications using mobile terminals have attracted increasing attention, there are few studies that focus on distant-talking speech recognition with asynchronous mobile terminals. For the system proposed in this paper, after applying a denoising autoencoder in the cepstral domain of speech to suppress reverberation and performing Large Vocabulary Continuous Speech Recognition (LVCSR), we adopted automatic asynchronous mobile terminal selection and environment adaptation using speech segments from optimal mobile terminals. The proposed method was evaluated using a reverberant WSJCAMO corpus, which was emitted by a loudspeaker and recorded in a meeting room with multiple speakers by far-field multiple mobile terminals. By integrating a cepstral-domain denoising autoencoder and automatic mobile terminal selection with environment adaptation, the average Word Error Rate (WER) was reduced from 51.8% of the baseline system to 28.8%, i.e., the relative error reduction rate was 44.4% when using multi-condition acoustic models.
本文提出了一种鲁棒的异步语音录制远程语音识别系统。这是通过结合去噪自编码器的倒频域去噪、自动异步语音(麦克风或移动终端)选择和环境适应来实现的。尽管基于移动终端的语音识别应用越来越受到人们的关注,但针对基于异步移动终端的远程语音识别的研究却很少。本文提出的系统在语音的倒谱域采用去噪自编码器抑制混响,并进行大词汇量连续语音识别(LVCSR)后,利用最优移动终端的语音片段进行自动异步移动终端选择和环境自适应。采用WSJCAMO混响语料库对该方法进行了评价,该语料库由扬声器发射,并由远场多移动终端在有多个扬声器的会议室中录制。通过集成倒谱域去噪自编码器和具有环境自适应功能的移动终端自动选择,将平均单词错误率(WER)从基线系统的51.8%降低到28.8%,即多条件声学模型的相对错误率为44.4%。
{"title":"Denoising autoencoder and environment adaptation for distant-talking speech recognition with asynchronous speech recording","authors":"Longbiao Wang, Bo Ren, Yuma Ueda, A. Kai, Shunta Teraoka, T. Fukushima","doi":"10.1109/APSIPA.2014.7041548","DOIUrl":"https://doi.org/10.1109/APSIPA.2014.7041548","url":null,"abstract":"In this paper, we propose a robust distant-talking speech recognition system with asynchronous speech recording. This is implemented by combining denoising autoencoder-based cepstral-domain dereverberation, automatic asynchronous speech (microphone or mobile terminal) selection and environment adaptation. Although applications using mobile terminals have attracted increasing attention, there are few studies that focus on distant-talking speech recognition with asynchronous mobile terminals. For the system proposed in this paper, after applying a denoising autoencoder in the cepstral domain of speech to suppress reverberation and performing Large Vocabulary Continuous Speech Recognition (LVCSR), we adopted automatic asynchronous mobile terminal selection and environment adaptation using speech segments from optimal mobile terminals. The proposed method was evaluated using a reverberant WSJCAMO corpus, which was emitted by a loudspeaker and recorded in a meeting room with multiple speakers by far-field multiple mobile terminals. By integrating a cepstral-domain denoising autoencoder and automatic mobile terminal selection with environment adaptation, the average Word Error Rate (WER) was reduced from 51.8% of the baseline system to 28.8%, i.e., the relative error reduction rate was 44.4% when using multi-condition acoustic models.","PeriodicalId":231382,"journal":{"name":"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128533131","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Contactless palmprint alignment based on intrinsic local affine-invariant feature points 基于内禀局部仿射不变特征点的非接触式掌纹对准
C. Phromsuthirak, W. Tangsuksant, A. Sanpanich, C. Pintavirooj
A Palmprint, biométrie characteristics, was mostly found in civil and commercial applications for security system because it has more reliable and easy to capture by low resolution devices. This paper was to develop a new contactless palmprint alignment with general USB camera on tripod. The palmprint image is acquired by this camera and using intrinsic local affine-invariant key points residing on the area patches spanning between two successive fingers to align palmprint image. The key points are relative affine invariant to affine transformations so this algorithm does not need the guidance pegs in acquisition process to fix hand position to avoid the scaling, translation and rotation problems for correctly palmprint image alignment. Finally, the developed algorithm was tested by 10 left-handed palmprint images collected from different subjects. The simulation results indicate by distance map error of 1.4899 pixels.
掌纹具有生物质变特征,由于其可靠性高,且易于低分辨率设备捕获,因此主要应用于民用和商用安防系统。本文研究了一种新型三脚架通用USB相机的非接触式掌纹对准方法。该相机采集掌纹图像,利用两个连续手指之间的区域斑块上的固有局部仿射不变关键点对掌纹图像进行对齐。该算法的关键在于对仿射变换的相对仿射不变性,因此该算法不需要在采集过程中使用导引杆来固定手的位置,从而避免了正确对齐掌纹图像时的缩放、平移和旋转问题。最后,用10张不同受试者的左手掌纹图像对算法进行了验证。仿真结果表明,距离图误差为1.4899像素。
{"title":"Contactless palmprint alignment based on intrinsic local affine-invariant feature points","authors":"C. Phromsuthirak, W. Tangsuksant, A. Sanpanich, C. Pintavirooj","doi":"10.1109/APSIPA.2014.7041563","DOIUrl":"https://doi.org/10.1109/APSIPA.2014.7041563","url":null,"abstract":"A Palmprint, biométrie characteristics, was mostly found in civil and commercial applications for security system because it has more reliable and easy to capture by low resolution devices. This paper was to develop a new contactless palmprint alignment with general USB camera on tripod. The palmprint image is acquired by this camera and using intrinsic local affine-invariant key points residing on the area patches spanning between two successive fingers to align palmprint image. The key points are relative affine invariant to affine transformations so this algorithm does not need the guidance pegs in acquisition process to fix hand position to avoid the scaling, translation and rotation problems for correctly palmprint image alignment. Finally, the developed algorithm was tested by 10 left-handed palmprint images collected from different subjects. The simulation results indicate by distance map error of 1.4899 pixels.","PeriodicalId":231382,"journal":{"name":"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131795318","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Support Vector Machine (SVM) based classifier for Khmer Printed Character-set Recognition 基于支持向量机的高棉印刷字符集识别分类器
Pongsametrey Sok, Nguonly Taing
This paper describes on the use of Support Vector Machine (SVM) based classification method on Khmer Printed Character-set Recognition (PCR) in bitmap document. Khmer language has been identified as one of the most complex language with the total of 74 alphabets and the wording compound can has up to 5 vertical levels. This paper proposes one new method, SVM for Khmer character classification system by using 3 different SVM kernels (Gaussian, Polynomial and Linear Kernel) on data training and recognition to find out the best kernel for Khmer language. The method allows us to use small training dataset by training different pieces of character training instead of training big amount of clusters. The classification uses binary data of 0 as white space and 1 as black pixel area of the character; each training piece of character has been stretched into a matrix of the binary data in all kinds of image size. Feature extraction is extracted from the matrix to use in SVM classification. After recognition, there are some rules to combine each cluster or character by using character levels or common mistake correction. The experiment of about pure 750 Khmer words or around 3000 characters show that SVM method with Gaussian Kernel produces a good result with better performance among all kernels. The system uses one font "Khmer OS Content" of the training data with font size 32pt to recognize 3 different font sizes. The accuracy of 28pt font size is 98.17%, 32pt is 98.62% and 36pt is 98.54% respectively.
本文介绍了基于支持向量机(SVM)的分类方法在位图文档高棉印刷字符集识别(PCR)中的应用。高棉语被认为是最复杂的语言之一,共有74个字母,措辞复合可以有多达5个垂直层次。本文提出了一种新的方法——支持向量机(SVM)用于高棉语字符分类系统,通过使用3种不同的支持向量机核(高斯核、多项式核和线性核)进行数据训练和识别,找出高棉语的最佳核。该方法允许我们通过训练不同的字符训练片段来使用小的训练数据集,而不是训练大量的聚类。分类使用二进制数据0作为字符的空白区域,1作为字符的黑色像素区域;每个训练字符块被拉伸成各种图像大小的二进制数据矩阵。从矩阵中提取特征提取用于支持向量机分类。识别后,通过使用字符级别或常见错误纠错来组合每个聚类或字符。对750个高棉纯词或3000个左右字符的实验表明,高斯核支持向量机方法取得了较好的结果,在所有核中都有较好的性能。系统使用字号为32pt的训练数据中的一种字体“Khmer OS Content”来识别3种不同的字体大小。字号28pt的准确率为98.17%,字号32pt的准确率为98.62%,字号36pt的准确率为98.54%。
{"title":"Support Vector Machine (SVM) based classifier for Khmer Printed Character-set Recognition","authors":"Pongsametrey Sok, Nguonly Taing","doi":"10.1109/APSIPA.2014.7041823","DOIUrl":"https://doi.org/10.1109/APSIPA.2014.7041823","url":null,"abstract":"This paper describes on the use of Support Vector Machine (SVM) based classification method on Khmer Printed Character-set Recognition (PCR) in bitmap document. Khmer language has been identified as one of the most complex language with the total of 74 alphabets and the wording compound can has up to 5 vertical levels. This paper proposes one new method, SVM for Khmer character classification system by using 3 different SVM kernels (Gaussian, Polynomial and Linear Kernel) on data training and recognition to find out the best kernel for Khmer language. The method allows us to use small training dataset by training different pieces of character training instead of training big amount of clusters. The classification uses binary data of 0 as white space and 1 as black pixel area of the character; each training piece of character has been stretched into a matrix of the binary data in all kinds of image size. Feature extraction is extracted from the matrix to use in SVM classification. After recognition, there are some rules to combine each cluster or character by using character levels or common mistake correction. The experiment of about pure 750 Khmer words or around 3000 characters show that SVM method with Gaussian Kernel produces a good result with better performance among all kernels. The system uses one font \"Khmer OS Content\" of the training data with font size 32pt to recognize 3 different font sizes. The accuracy of 28pt font size is 98.17%, 32pt is 98.62% and 36pt is 98.54% respectively.","PeriodicalId":231382,"journal":{"name":"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134026233","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Modeling spatial uncertainty of imprecise information in images 图像中不精确信息的空间不确定性建模
T. Pham
The description of information content in images is imprecise in nature. Quantification of uncertainty in images for pattern analysis has been addressed with the theories of probability and fuzzy sets. In this paper, an approach for modeling the spatial uncertainty of images is proposed in the setting of geostatistics and probability measure of fuzzy events. The proposed approach can be utilized to extract an effective feature for image classification.
图像中信息内容的描述本质上是不精确的。用概率论和模糊集理论对图像的不确定性进行了定量分析。本文提出了一种基于地质统计学和模糊事件概率测度的图像空间不确定性建模方法。该方法可用于提取图像分类的有效特征。
{"title":"Modeling spatial uncertainty of imprecise information in images","authors":"T. Pham","doi":"10.1109/APSIPA.2014.7041514","DOIUrl":"https://doi.org/10.1109/APSIPA.2014.7041514","url":null,"abstract":"The description of information content in images is imprecise in nature. Quantification of uncertainty in images for pattern analysis has been addressed with the theories of probability and fuzzy sets. In this paper, an approach for modeling the spatial uncertainty of images is proposed in the setting of geostatistics and probability measure of fuzzy events. The proposed approach can be utilized to extract an effective feature for image classification.","PeriodicalId":231382,"journal":{"name":"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134376025","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1