Pub Date : 2014-12-01DOI: 10.1109/APSIPA.2014.7041688
Rui Yang
Digital audio recordings can be manipulated by pervasive audio editing software easily. Often forgery would not be naive splicing. Post-processing would be a part of tampering. Post-processing can eliminate the obvious traces of forgery. Noise can cover audible evidence of forgery and destroy traces of other tampering operations. The detection of additive noise in audio signal is a useful tool for audio forensics. In this paper, we investigate the effect of additive noise on audio signal, and propose a feature named "sign change rate" for detecting additive noise. Via theoretical analyze and extensive experiments, it shows the proposed feature is effective in additive noise detection. Also the method can be a potential tool for forgery localization of digital audio.
{"title":"Additive noise detection and its application to audio forensics","authors":"Rui Yang","doi":"10.1109/APSIPA.2014.7041688","DOIUrl":"https://doi.org/10.1109/APSIPA.2014.7041688","url":null,"abstract":"Digital audio recordings can be manipulated by pervasive audio editing software easily. Often forgery would not be naive splicing. Post-processing would be a part of tampering. Post-processing can eliminate the obvious traces of forgery. Noise can cover audible evidence of forgery and destroy traces of other tampering operations. The detection of additive noise in audio signal is a useful tool for audio forensics. In this paper, we investigate the effect of additive noise on audio signal, and propose a feature named \"sign change rate\" for detecting additive noise. Via theoretical analyze and extensive experiments, it shows the proposed feature is effective in additive noise detection. Also the method can be a potential tool for forgery localization of digital audio.","PeriodicalId":231382,"journal":{"name":"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific","volume":"34 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114983336","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2014-12-01DOI: 10.1109/APSIPA.2014.7041766
Kanokphan Lertniphonphan, S. Aramvith, T. Chalidabhongse
Human actions in video have the variation in both spatial and time domains which cause the difficulty for action classification. According to the nature of articulated body, an amount of movement from point-to-point is not constant, which can be illustrated as a bell-shape. In this paper, key frames are detected for specifying a starting and ending point for an action cycle. The time between key frames determines the window length for feature extraction in time domain. Since the cycles are varying, the key frame interval is varying and adaptive to performer and action. A local orientation histogram of Key Pose Energy Image (KPEI) and Motion History Image (MHI) is constructed during the period. The experimental results on WEIZMANN dataset demonstrate that the feature within the adaptive key frame interval can effectively classify actions.
{"title":"Feature extraction for human action classification using adaptive key frame interval","authors":"Kanokphan Lertniphonphan, S. Aramvith, T. Chalidabhongse","doi":"10.1109/APSIPA.2014.7041766","DOIUrl":"https://doi.org/10.1109/APSIPA.2014.7041766","url":null,"abstract":"Human actions in video have the variation in both spatial and time domains which cause the difficulty for action classification. According to the nature of articulated body, an amount of movement from point-to-point is not constant, which can be illustrated as a bell-shape. In this paper, key frames are detected for specifying a starting and ending point for an action cycle. The time between key frames determines the window length for feature extraction in time domain. Since the cycles are varying, the key frame interval is varying and adaptive to performer and action. A local orientation histogram of Key Pose Energy Image (KPEI) and Motion History Image (MHI) is constructed during the period. The experimental results on WEIZMANN dataset demonstrate that the feature within the adaptive key frame interval can effectively classify actions.","PeriodicalId":231382,"journal":{"name":"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific","volume":"384 2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115488882","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2014-12-01DOI: 10.1109/APSIPA.2014.7041606
Daisuke Ikefuji, H. Tsujii, S. Masunaga, M. Nakayama, T. Nishiura, Y. Yamashita
Recently, technologies for reproducing a 3-dimensional sound field are required for providing highly realistic sensations. Therefore, we previously proposed a system with multiple parametric array loudspeakers (PAL). PALs can design sound images on walls, ceilings, and floors by using the higher directivity of ultrasound. Thus, the proposed system can easily present incoming sound from various directions. However, it is difficult to provide a realistic sensation depending on the reverberation time. In addition, the listening area of one PAL is small. In this paper, we therefore propose two approaches for overcoming these problems. First, we propose reverberation steering with indirect electrodynamic loudspeakers and PALs. We also attempt to expand the listening area of the sound image with a curved-type PAL. As a result of evaluation experiments for each proposed approach, we could confirm the effectiveness of each approach.
{"title":"Reverberation steering and listening area expansion on 3-D sound field reproduction with parametric array loudspeaker","authors":"Daisuke Ikefuji, H. Tsujii, S. Masunaga, M. Nakayama, T. Nishiura, Y. Yamashita","doi":"10.1109/APSIPA.2014.7041606","DOIUrl":"https://doi.org/10.1109/APSIPA.2014.7041606","url":null,"abstract":"Recently, technologies for reproducing a 3-dimensional sound field are required for providing highly realistic sensations. Therefore, we previously proposed a system with multiple parametric array loudspeakers (PAL). PALs can design sound images on walls, ceilings, and floors by using the higher directivity of ultrasound. Thus, the proposed system can easily present incoming sound from various directions. However, it is difficult to provide a realistic sensation depending on the reverberation time. In addition, the listening area of one PAL is small. In this paper, we therefore propose two approaches for overcoming these problems. First, we propose reverberation steering with indirect electrodynamic loudspeakers and PALs. We also attempt to expand the listening area of the sound image with a curved-type PAL. As a result of evaluation experiments for each proposed approach, we could confirm the effectiveness of each approach.","PeriodicalId":231382,"journal":{"name":"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115829766","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2014-12-01DOI: 10.1109/APSIPA.2014.7041516
K. Kondo
We proposed and evaluated an estimation method for the forced selection Japanese Diagnostic Rhyme Test (DRT). The proposed measure takes into account the forced selection manner of the DRT from a pair of rhyming words. The objective distance measure used here was based on the Articulation index Band Correlation (ABC), which showed favorable results for the English Modified Rhyme Test (MRT). The correlation of time-frequency patterns between the test word and the template word speech of the two words in the candidate word pair was calculated. The word with the higher correlation was decided to be the likely candidate word. The time-frequency (T-F) pattern was calculated in the Articulation Index (AI) bands, and the correlation was calculated between the corresponding bands of the test and candidate word sample. The candidate word with more AI bands showing higher correlation values was finally chosen. The ratio of bands with higher correlation with the candidate word vs. the total number of bands is calculated to quantify how well the test word matches the candidate word in the word pair. We estimated a logistic mapping function from this ratio to intelligibility scores using speech mixed with known noise. The mapping functions were then used to estimate the intelligibility of speech mixed with unknown noise. This estimation was compared to another measure that we previously have evaluated, the frequency-weighed segmental SNR, and was proven to be more accurate, with the correlation between estimated and estimated intelligibility over 0.93, and the root mean square below 0.15. Thus, it should be possible to "screen" the intelligibility in many of the noise conditions to be tested, and cut down on the scale of the subjective test needed.
{"title":"Estimation of Japanese DRT intelligibility using Articulation Index Band Correlations","authors":"K. Kondo","doi":"10.1109/APSIPA.2014.7041516","DOIUrl":"https://doi.org/10.1109/APSIPA.2014.7041516","url":null,"abstract":"We proposed and evaluated an estimation method for the forced selection Japanese Diagnostic Rhyme Test (DRT). The proposed measure takes into account the forced selection manner of the DRT from a pair of rhyming words. The objective distance measure used here was based on the Articulation index Band Correlation (ABC), which showed favorable results for the English Modified Rhyme Test (MRT). The correlation of time-frequency patterns between the test word and the template word speech of the two words in the candidate word pair was calculated. The word with the higher correlation was decided to be the likely candidate word. The time-frequency (T-F) pattern was calculated in the Articulation Index (AI) bands, and the correlation was calculated between the corresponding bands of the test and candidate word sample. The candidate word with more AI bands showing higher correlation values was finally chosen. The ratio of bands with higher correlation with the candidate word vs. the total number of bands is calculated to quantify how well the test word matches the candidate word in the word pair. We estimated a logistic mapping function from this ratio to intelligibility scores using speech mixed with known noise. The mapping functions were then used to estimate the intelligibility of speech mixed with unknown noise. This estimation was compared to another measure that we previously have evaluated, the frequency-weighed segmental SNR, and was proven to be more accurate, with the correlation between estimated and estimated intelligibility over 0.93, and the root mean square below 0.15. Thus, it should be possible to \"screen\" the intelligibility in many of the noise conditions to be tested, and cut down on the scale of the subjective test needed.","PeriodicalId":231382,"journal":{"name":"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific","volume":"92 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127119363","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2014-12-01DOI: 10.1109/APSIPA.2014.7041704
Hirotaka Osawa, Wataru Kayano
Our daily household activities are supported by many complicated home appliances whose functions are difficult to learn. In order to clearly explain the functions of home appliances to users, we design attachable agential triggers to render home appliances as explanatory agents. We detail how our application helps explain the use of home appliances to users. Our proposed robotic arms are easier to use than previously used attachable arms in order to point to a home appliance.
{"title":"Attachable robotic arm for anthropomorphized explanation by pointing","authors":"Hirotaka Osawa, Wataru Kayano","doi":"10.1109/APSIPA.2014.7041704","DOIUrl":"https://doi.org/10.1109/APSIPA.2014.7041704","url":null,"abstract":"Our daily household activities are supported by many complicated home appliances whose functions are difficult to learn. In order to clearly explain the functions of home appliances to users, we design attachable agential triggers to render home appliances as explanatory agents. We detail how our application helps explain the use of home appliances to users. Our proposed robotic arms are easier to use than previously used attachable arms in order to point to a home appliance.","PeriodicalId":231382,"journal":{"name":"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123704857","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2014-12-01DOI: 10.1109/APSIPA.2014.7041749
Wuttichai Luangruangrong, P. Kulkasem, Suwanna Rasmequan, Annupan Rodtook, K. Chinnasarn
Diabetic Retinopathy with exudates causes a major problem in human visualization and becomes a cause of blindness to diabetic patients. In addition, the numbers of diabetic retinopathy patients are increasing while the numbers of doctors are not easily increased in the same proportion. This circumstance causes a heavy work load for doctors. In the past, the medical image processing research has shown that simply getting a second opinion can significantly help physician's diagnosis. This research proposes a method to detect exudates from diabetic retinopathy images. The early exudates detection of diabetic retinopathy patients will reduce seriousness in diabetic retinopathy. The proposed method for detecting exudates consists of 5 major steps as follows: 1) To improve the quality of images by using the contrast limited adaptive histogram equalization (CLAHE) 2) To apply the object attribute thresholding algorithm (OAT) for non-retinal object removal, 3) To implement Frangi's algorithm based on Hessian filtering for blood vessel detection 4) To detect the retinal optic disc by applying the combination between multi-resolution analysis and Hough transform and 5) To classify exudates in the remaining region with algorithms of hierarchical fuzzy-c-mean clustering. The performance of the proposed method is evaluated on DIARETDB, which is the retinal image database of the Lappeenranta University of Technology, where the performance is good enough for exudates detection.
伴有渗出物的糖尿病视网膜病变对人体视觉造成了严重的影响,并成为糖尿病患者失明的主要原因。此外,糖尿病视网膜病变患者的数量在不断增加,而医生的数量却不容易按比例增加。这种情况给医生带来了沉重的工作量。在过去,医学图像处理研究表明,简单地获得第二意见可以显著地帮助医生的诊断。本研究提出一种检测糖尿病视网膜病变影像渗出物的方法。糖尿病视网膜病变患者的早期渗出物检测将降低糖尿病视网膜病变的严重程度。本文提出的渗出物检测方法包括以下5个主要步骤:1)利用对比度限制自适应直方图均衡化(CLAHE)提高图像质量2)应用目标属性阈值算法(OAT)去除非视网膜目标;3)实现基于Hessian滤波的Frangi算法进行血管检测;4)采用多分辨率分析与Hough变换相结合的方法检测视网膜视盘;5)采用分层模糊c均值聚类算法对剩余区域的渗出物进行分类。在拉彭兰塔理工大学(Lappeenranta University of Technology)的视网膜图像数据库DIARETDB上对该方法的性能进行了评估,其性能足以用于渗出物检测。
{"title":"Automatic exudates detection in retinal images using efficient integrated approaches","authors":"Wuttichai Luangruangrong, P. Kulkasem, Suwanna Rasmequan, Annupan Rodtook, K. Chinnasarn","doi":"10.1109/APSIPA.2014.7041749","DOIUrl":"https://doi.org/10.1109/APSIPA.2014.7041749","url":null,"abstract":"Diabetic Retinopathy with exudates causes a major problem in human visualization and becomes a cause of blindness to diabetic patients. In addition, the numbers of diabetic retinopathy patients are increasing while the numbers of doctors are not easily increased in the same proportion. This circumstance causes a heavy work load for doctors. In the past, the medical image processing research has shown that simply getting a second opinion can significantly help physician's diagnosis. This research proposes a method to detect exudates from diabetic retinopathy images. The early exudates detection of diabetic retinopathy patients will reduce seriousness in diabetic retinopathy. The proposed method for detecting exudates consists of 5 major steps as follows: 1) To improve the quality of images by using the contrast limited adaptive histogram equalization (CLAHE) 2) To apply the object attribute thresholding algorithm (OAT) for non-retinal object removal, 3) To implement Frangi's algorithm based on Hessian filtering for blood vessel detection 4) To detect the retinal optic disc by applying the combination between multi-resolution analysis and Hough transform and 5) To classify exudates in the remaining region with algorithms of hierarchical fuzzy-c-mean clustering. The performance of the proposed method is evaluated on DIARETDB, which is the retinal image database of the Lappeenranta University of Technology, where the performance is good enough for exudates detection.","PeriodicalId":231382,"journal":{"name":"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122051957","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2014-12-01DOI: 10.1109/APSIPA.2014.7041618
Sakura Tsuruta, Kou Tanaka, T. Toda, Graham Neubig, S. Sakti, Satoshi Nakamura
Nonaudible murmur (NAM) is a soft whispered voice recorded with NAM microphone through body conduction. NAM allows for silent speech communication as it makes it possible for the speaker to convey their message in a nonaudible voice. However, its intelligibility and naturalness are significantly degraded compared to those of natural speech owing to acoustic changes caused by body conduction. To address this issue, statistical voice conversion (VC) methods from NAM to normal speech (NAM-to-Speech) and to a whispered voice (NAM-to-Whisper) have been proposed. It has been reported that these NAM enhancement methods significantly improve speech quality and intelligibility of NAM, and NAM-to-Whisper is more effective than NAM-to-Speech. However, it is still not obvious which method is more effective if a listener listens to the enhanced speech in noisy environments, a situation that often happens in silent speech communication. In this paper, assuming a typical situation in which NAM is uttered by a speaker in a quiet environment and conveyed to a listener in noisy environments, we investigate what kinds of target speech are more effective for NAM enhancement. We also propose NAM enhancement methods for converting NAM to other types of target voiced speech. Experiments show that the conversion process into voiced speech is more effective than that into unvoiced speech for generating more intelligible speech in noisy environments.
非听杂音(NAM)是用NAM麦克风通过身体传导录下的一种轻声细语。不结盟运动允许无声语音通信,因为它使说话者能够以听不见的声音传达他们的信息。然而,由于身体传导引起的声学变化,其可理解性和自然性与自然语音相比明显下降。为了解决这个问题,已经提出了从NAM到正常语音(NAM- To - speech)和到低声语音(NAM- To - whisper)的统计语音转换(VC)方法。有报道称,这些NAM增强方法显著提高了NAM的语音质量和可理解性,并且NAM-to- whisper比NAM-to- speech更有效。然而,如果听者在嘈杂的环境中听增强语音,哪种方法更有效,这在无声语言交流中经常发生。在本文中,我们假设一个典型的情况,即在安静的环境中由说话者发出非NAM,并在嘈杂的环境中传达给听者,我们研究了什么样的目标语音对非NAM增强更有效。我们还提出了不发音增强方法,将不发音转换为其他类型的目标语音。实验表明,在嘈杂环境下,将语音转换成浊音比将语音转换成浊音更能有效地生成可理解的语音。
{"title":"An evaluation of target speech for a nonaudible murmur enhancement system in noisy environments","authors":"Sakura Tsuruta, Kou Tanaka, T. Toda, Graham Neubig, S. Sakti, Satoshi Nakamura","doi":"10.1109/APSIPA.2014.7041618","DOIUrl":"https://doi.org/10.1109/APSIPA.2014.7041618","url":null,"abstract":"Nonaudible murmur (NAM) is a soft whispered voice recorded with NAM microphone through body conduction. NAM allows for silent speech communication as it makes it possible for the speaker to convey their message in a nonaudible voice. However, its intelligibility and naturalness are significantly degraded compared to those of natural speech owing to acoustic changes caused by body conduction. To address this issue, statistical voice conversion (VC) methods from NAM to normal speech (NAM-to-Speech) and to a whispered voice (NAM-to-Whisper) have been proposed. It has been reported that these NAM enhancement methods significantly improve speech quality and intelligibility of NAM, and NAM-to-Whisper is more effective than NAM-to-Speech. However, it is still not obvious which method is more effective if a listener listens to the enhanced speech in noisy environments, a situation that often happens in silent speech communication. In this paper, assuming a typical situation in which NAM is uttered by a speaker in a quiet environment and conveyed to a listener in noisy environments, we investigate what kinds of target speech are more effective for NAM enhancement. We also propose NAM enhancement methods for converting NAM to other types of target voiced speech. Experiments show that the conversion process into voiced speech is more effective than that into unvoiced speech for generating more intelligible speech in noisy environments.","PeriodicalId":231382,"journal":{"name":"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128451359","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2014-12-01DOI: 10.1109/APSIPA.2014.7041820
P. Artameeyanant, Sivarit Sultornsanee, K. Chamnongthai, K. Higuchi
Analyzing the electromyogram is an important issue on diagnosis of neuromuscular diseases. The classification of electromyogram signal plays a significant role in this issue. Since the characteristic of the signals is complex and non-stationary, so the complex network is an appropriate tool in extracting feature of the signal. In this paper we propose a novel feature extraction technique based on transforming the signal to complex network via vertical visibility algorithm. Characteristic on the measurements of community structure and distance property are examined. The pattern on the relationship of nodes in the network is investigated. Support vector machine was employed for classification. The proposed method can classify the signals into 3 cases, i.e., healthy, myopathy, and neuropathy, with remarkable experimental results.
{"title":"Classification of electromyogram using vertical visibility algorithm with support vector machine","authors":"P. Artameeyanant, Sivarit Sultornsanee, K. Chamnongthai, K. Higuchi","doi":"10.1109/APSIPA.2014.7041820","DOIUrl":"https://doi.org/10.1109/APSIPA.2014.7041820","url":null,"abstract":"Analyzing the electromyogram is an important issue on diagnosis of neuromuscular diseases. The classification of electromyogram signal plays a significant role in this issue. Since the characteristic of the signals is complex and non-stationary, so the complex network is an appropriate tool in extracting feature of the signal. In this paper we propose a novel feature extraction technique based on transforming the signal to complex network via vertical visibility algorithm. Characteristic on the measurements of community structure and distance property are examined. The pattern on the relationship of nodes in the network is investigated. Support vector machine was employed for classification. The proposed method can classify the signals into 3 cases, i.e., healthy, myopathy, and neuropathy, with remarkable experimental results.","PeriodicalId":231382,"journal":{"name":"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific","volume":"85 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128579549","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2014-12-01DOI: 10.1109/APSIPA.2014.7041675
Farid Abedan Kondori, Li Liu, Haibo Li
In recent years, emergence of telerehabilitation systems for home-based therapy has altered healthcare systems. Telerehabilitation enables therapists to observe patients status via Internet, thus a patient does not have to visit rehabilitation facilities for every rehabilitation session. Despite the fact that telerehabilitation provides great opportunities, there are two major issues that affect effectiveness of telerehabilitation: relegation of the patient at home, and loss of direct supervision of the therapist. Since patients have no actual interaction with other persons during the rehabilitation period, they will become isolated and gradually lose their social skills. Moreover, without direct supervision of therapists, rehabilitation exercises can be performed with bad compensation strategies that lead to a poor quality recovery. To resolve these issues, we propose telelife, a new concept for future rehabilitation systems. The idea is to use media technology to create a totally new immersive media experience for rehabilitation. In telerehabilitation patients locally execute exercises, and therapists remotely monitor patients' status. In telelife patients, however, remotely perform exercises and therapists locally monitor. Thus, not only telelife enables rehabilitation at distance, but also improves the patients' social competences, and provides direct supervision of therapists. In this paper we introduce telelife to enhance telerehabilitation, and investigate technical challenges and possible methods to achieve telelife.
{"title":"Telelife: An immersive media experience for rehabilitation","authors":"Farid Abedan Kondori, Li Liu, Haibo Li","doi":"10.1109/APSIPA.2014.7041675","DOIUrl":"https://doi.org/10.1109/APSIPA.2014.7041675","url":null,"abstract":"In recent years, emergence of telerehabilitation systems for home-based therapy has altered healthcare systems. Telerehabilitation enables therapists to observe patients status via Internet, thus a patient does not have to visit rehabilitation facilities for every rehabilitation session. Despite the fact that telerehabilitation provides great opportunities, there are two major issues that affect effectiveness of telerehabilitation: relegation of the patient at home, and loss of direct supervision of the therapist. Since patients have no actual interaction with other persons during the rehabilitation period, they will become isolated and gradually lose their social skills. Moreover, without direct supervision of therapists, rehabilitation exercises can be performed with bad compensation strategies that lead to a poor quality recovery. To resolve these issues, we propose telelife, a new concept for future rehabilitation systems. The idea is to use media technology to create a totally new immersive media experience for rehabilitation. In telerehabilitation patients locally execute exercises, and therapists remotely monitor patients' status. In telelife patients, however, remotely perform exercises and therapists locally monitor. Thus, not only telelife enables rehabilitation at distance, but also improves the patients' social competences, and provides direct supervision of therapists. In this paper we introduce telelife to enhance telerehabilitation, and investigate technical challenges and possible methods to achieve telelife.","PeriodicalId":231382,"journal":{"name":"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128619605","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2014-12-01DOI: 10.1109/APSIPA.2014.7041682
Yili Xia, Kai Wang, Wenjiang Pei, D. Mandic
This paper addresses the detection of the fundamental frequency of power systems under unbalanced and distorted conditions. By using the second order information, both the autocorrelation and pseudo-autocorrelation, within the Clarke's transformed voltage, a novel balancing voltage transformation (BVT) is proposed to accurately detect the underlying phase angle evolution of the positive sequence component. This removes the biggest obstacle in current power systems and makes possible to use any frequency estimator for single-tone exponential on unbalanced power systems. The robustness of the proposed phase angle detection technique is illustrated for two well-known and efficient frequency estimators, that is, a discrete Fourier transform (DFT) coefficient interpolation method [1] and the weighted linear predictor (WLP) [2]. A window technique is used to cater for the fast and computationally affordable frequency estimation purposes. Simulations over a range of unbalanced conditions, including voltage dips and swells, frequency deviations and the presence of higher order harmonics support the analysis.
{"title":"A balancing voltage transformation for robust frequency estimation in unbalanced power systems","authors":"Yili Xia, Kai Wang, Wenjiang Pei, D. Mandic","doi":"10.1109/APSIPA.2014.7041682","DOIUrl":"https://doi.org/10.1109/APSIPA.2014.7041682","url":null,"abstract":"This paper addresses the detection of the fundamental frequency of power systems under unbalanced and distorted conditions. By using the second order information, both the autocorrelation and pseudo-autocorrelation, within the Clarke's transformed voltage, a novel balancing voltage transformation (BVT) is proposed to accurately detect the underlying phase angle evolution of the positive sequence component. This removes the biggest obstacle in current power systems and makes possible to use any frequency estimator for single-tone exponential on unbalanced power systems. The robustness of the proposed phase angle detection technique is illustrated for two well-known and efficient frequency estimators, that is, a discrete Fourier transform (DFT) coefficient interpolation method [1] and the weighted linear predictor (WLP) [2]. A window technique is used to cater for the fast and computationally affordable frequency estimation purposes. Simulations over a range of unbalanced conditions, including voltage dips and swells, frequency deviations and the presence of higher order harmonics support the analysis.","PeriodicalId":231382,"journal":{"name":"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129024626","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}