首页 > 最新文献

2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA)最新文献

英文 中文
Explicit duration modelling in HMM-based speech synthesis using continuous hidden Markov Model 基于连续隐马尔可夫模型的hmm语音合成中的显式持续时间建模
K. Ogbureke, João P. Cabral, Julie Carson-Berndsen
This paper presents a novel approach to explicit duration modelling for HMM-based speech synthesis. The proposed approach is a two-step process. The first step in this process is state level phone alignment and conversion of phone durations into the number of frames. In the second step, a hidden Markov model (HMM) is trained whereby the observation is the number of frames in each state and the hidden state the phone. Finally, the duration of each state (the number of frames) is generated from the trained HMM. Hidden semi-Markov model (HSMM) is the baseline for explicit duration modelling in HMM-based speech synthesis. Both objective and perceptual evaluation on a held-out test set showed comparable results with a baseline HSMM-based speech synthesis. This duration modelling approach is computationally simpler than HSMM and produces comparable results in terms of the quality of synthetic speech.
本文提出了一种新的基于hmm的语音合成显式持续时间建模方法。提议的方法分为两步。这个过程的第一步是状态级电话对齐,并将电话持续时间转换为帧数。在第二步中,训练隐马尔可夫模型(HMM),其中观察值是每个状态下的帧数和手机的隐藏状态。最后,从训练好的HMM生成每个状态的持续时间(帧数)。隐半马尔可夫模型(HSMM)是基于隐半马尔可夫模型的语音合成中显式持续时间建模的基础。在测试集上的客观和感知评价与基于基线hsmm的语音合成结果相当。这种持续时间建模方法在计算上比HSMM简单,并且在合成语音质量方面产生可比较的结果。
{"title":"Explicit duration modelling in HMM-based speech synthesis using continuous hidden Markov Model","authors":"K. Ogbureke, João P. Cabral, Julie Carson-Berndsen","doi":"10.1109/ISSPA.2012.6310643","DOIUrl":"https://doi.org/10.1109/ISSPA.2012.6310643","url":null,"abstract":"This paper presents a novel approach to explicit duration modelling for HMM-based speech synthesis. The proposed approach is a two-step process. The first step in this process is state level phone alignment and conversion of phone durations into the number of frames. In the second step, a hidden Markov model (HMM) is trained whereby the observation is the number of frames in each state and the hidden state the phone. Finally, the duration of each state (the number of frames) is generated from the trained HMM. Hidden semi-Markov model (HSMM) is the baseline for explicit duration modelling in HMM-based speech synthesis. Both objective and perceptual evaluation on a held-out test set showed comparable results with a baseline HSMM-based speech synthesis. This duration modelling approach is computationally simpler than HSMM and produces comparable results in terms of the quality of synthetic speech.","PeriodicalId":248763,"journal":{"name":"2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131990065","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
A discrete linear chirp transform (DLCT) for data compression 用于数据压缩的离散线性啁啾变换(dct)
Osama A S Alkishriwo, L. Chaparro
Compressive sensing attempts to simplify the frequency transformation and thresholding steps, commonly done in data compression, into one. Sparseness of the signal, in either time or frequency, is required for the convex optimization in compressive sensing to perform well. Although sparseness of certain signals, in either time or frequency, is guaranteed by the uncertainty principle signals composed of chirps are not however sparse in either domain. In this paper we propose an orthogonal linear-chirp transform, the discrete linear chirp transform (DLCT), to represent any signal in terms of linear chirps, with modulation and dual properties. Using the DLCT the sparseness of the signal in either time or frequency can be assessed, and if not sparse in neither of these domains, the modulation and dual properties of the DLCT provide a way to transform the signal into a sparse signal. The application of the proposed DLCT is in data compression. The transformation is illustrated by using sparse and not sparse test signals as well as actual signals.
压缩感知试图将通常在数据压缩中完成的频率变换和阈值处理步骤简化为一个步骤。信号在时间和频率上的稀疏性是压缩感知中的凸优化的必要条件。虽然不确定性原理保证了某些信号在时间和频率上的稀疏性,但由啁啾组成的信号在时间和频率上都不是稀疏的。在本文中,我们提出了一个正交线性啁啾变换,离散线性啁啾变换(dct),以表示任何信号的线性啁啾,具有调制和对偶性质。使用dct可以评估信号在时间或频率上的稀疏性,如果在这两个域中都不是稀疏的,dct的调制和对偶特性提供了一种将信号转换为稀疏信号的方法。提出的dct在数据压缩方面的应用。通过使用稀疏和非稀疏测试信号以及实际信号来说明该变换。
{"title":"A discrete linear chirp transform (DLCT) for data compression","authors":"Osama A S Alkishriwo, L. Chaparro","doi":"10.1109/ISSPA.2012.6310490","DOIUrl":"https://doi.org/10.1109/ISSPA.2012.6310490","url":null,"abstract":"Compressive sensing attempts to simplify the frequency transformation and thresholding steps, commonly done in data compression, into one. Sparseness of the signal, in either time or frequency, is required for the convex optimization in compressive sensing to perform well. Although sparseness of certain signals, in either time or frequency, is guaranteed by the uncertainty principle signals composed of chirps are not however sparse in either domain. In this paper we propose an orthogonal linear-chirp transform, the discrete linear chirp transform (DLCT), to represent any signal in terms of linear chirps, with modulation and dual properties. Using the DLCT the sparseness of the signal in either time or frequency can be assessed, and if not sparse in neither of these domains, the modulation and dual properties of the DLCT provide a way to transform the signal into a sparse signal. The application of the proposed DLCT is in data compression. The transformation is illustrated by using sparse and not sparse test signals as well as actual signals.","PeriodicalId":248763,"journal":{"name":"2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA)","volume":"1038 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131513220","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
Feedback-free and hybrid distributed video coding using neural networks 基于神经网络的无反馈混合分布式视频编码
Isaac Nickaein, M. Rahmati, S. S. Ghidary, A. Zohrabi
Distributed Video Coding (DVC) is a new class of video coding techniques with the aim of coding the decentralized video sources. While the Stanford Wyner-Ziv codec is a well-known architecture in DVC literature, one of its main drawbacks is the presence of a feedback channel from the decoder to the encoder. This feedback channel makes the use of the codec impractical in some applications. Since the only application of the feedback channel is in requesting more parity bits from the encoder, it could be omitted if the encoder estimates the required parity bits and sends them at once. In this paper, a new method of bitrate estimation using a neural network trained by a new set of features is proposed. In addition, a Hybrid mode is proposed that reduces computational complexity at the decoder in a conventional Wyner-Ziv codec.
分布式视频编码(DVC)是一种新的视频编码技术,其目的是对分散的视频源进行编码。虽然斯坦福Wyner-Ziv编解码器在DVC文献中是一个著名的架构,但它的主要缺点之一是从解码器到编码器的反馈通道的存在。这种反馈通道使得编解码器的使用在某些应用中不切实际。由于反馈通道的唯一应用是从编码器请求更多的奇偶校验位,如果编码器估计所需的奇偶校验位并立即发送它们,则可以省略反馈通道。本文提出了一种利用一组新的特征训练神经网络进行比特率估计的新方法。此外,提出了一种混合模式,降低了传统Wyner-Ziv编解码器解码器的计算复杂度。
{"title":"Feedback-free and hybrid distributed video coding using neural networks","authors":"Isaac Nickaein, M. Rahmati, S. S. Ghidary, A. Zohrabi","doi":"10.1109/ISSPA.2012.6310608","DOIUrl":"https://doi.org/10.1109/ISSPA.2012.6310608","url":null,"abstract":"Distributed Video Coding (DVC) is a new class of video coding techniques with the aim of coding the decentralized video sources. While the Stanford Wyner-Ziv codec is a well-known architecture in DVC literature, one of its main drawbacks is the presence of a feedback channel from the decoder to the encoder. This feedback channel makes the use of the codec impractical in some applications. Since the only application of the feedback channel is in requesting more parity bits from the encoder, it could be omitted if the encoder estimates the required parity bits and sends them at once. In this paper, a new method of bitrate estimation using a neural network trained by a new set of features is proposed. In addition, a Hybrid mode is proposed that reduces computational complexity at the decoder in a conventional Wyner-Ziv codec.","PeriodicalId":248763,"journal":{"name":"2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA)","volume":"115 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134002834","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
A Kalman filter approach to packet loss replacement in presence of additive noise 一种卡尔曼滤波方法用于加性噪声下的丢包替换
S. Miralavi, S. Ghorshi, Aidin Tahaei
A major problem in real-time packet-based communication systems, is misrouted or delayed packet which results in degraded perceived voice quality. If packets are not available on time, the packets are considered lost. The easiest solution in a network terminal receiver is to replace silence for the duration of lost speech segments. In a high quality communication system, to avoid degradation in speech quality due to packet loss, a suitable method or algorithm is needed to replace the missing segments of speech. In this paper, we introduce an adaptive filter for replacement of lost speech segment. In this method Kalman filter as a state-space based method will be used to predict the clean speech signal in presence of additive noise. The evaluation results show that Kalman filter has lower MSE compared to other methods in presence of White Gaussian Noise and background noise.
在基于分组的实时通信系统中,一个主要问题是分组路由错误或延迟导致感知语音质量下降。如果数据包没有及时可用,则认为数据包丢失。在网络终端接收器中,最简单的解决方案是在丢失的语音段期间替换沉默。在高质量的通信系统中,为了避免丢包导致的语音质量下降,需要一种合适的方法或算法来替换缺失的语音片段。本文介绍了一种自适应滤波器,用于替换丢失的语音片段。该方法将卡尔曼滤波作为一种基于状态空间的方法来预测存在加性噪声的干净语音信号。评价结果表明,在存在高斯白噪声和背景噪声的情况下,卡尔曼滤波具有较低的MSE。
{"title":"A Kalman filter approach to packet loss replacement in presence of additive noise","authors":"S. Miralavi, S. Ghorshi, Aidin Tahaei","doi":"10.1109/ISSPA.2012.6310566","DOIUrl":"https://doi.org/10.1109/ISSPA.2012.6310566","url":null,"abstract":"A major problem in real-time packet-based communication systems, is misrouted or delayed packet which results in degraded perceived voice quality. If packets are not available on time, the packets are considered lost. The easiest solution in a network terminal receiver is to replace silence for the duration of lost speech segments. In a high quality communication system, to avoid degradation in speech quality due to packet loss, a suitable method or algorithm is needed to replace the missing segments of speech. In this paper, we introduce an adaptive filter for replacement of lost speech segment. In this method Kalman filter as a state-space based method will be used to predict the clean speech signal in presence of additive noise. The evaluation results show that Kalman filter has lower MSE compared to other methods in presence of White Gaussian Noise and background noise.","PeriodicalId":248763,"journal":{"name":"2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA)","volume":"268 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132909200","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Design of a neuromuscular disorders diagnostic system using human movement analysis 基于人体运动分析的神经肌肉疾病诊断系统的设计
C. O’Reilly, R. Plamondon
This communication summarizes the outcome of our research program on the design of a diagnostic system for neuromuscular disorders based on the analysis of human movement using the Kinematic Theory of Rapid Human Movements. Herein, this design problem is split in sub-problems which are then described. The solutions adopted at each design step are explained. As an example of application, typical results obtained so far for the assessment of the most important modifiable risk factors of brain stroke (diabetes, hypertension, hypercholesterolemia, obesity, cardiac problems, and cigarette smoking) are reported by the means of the area under the receiver operating characteristic curve (AUC).
本文总结了我们的研究项目的结果,该项目设计了一个基于人体快速运动运动学理论对人体运动分析的神经肌肉疾病诊断系统。在这里,这个设计问题被分解成子问题,然后对子问题进行描述。说明了在每个设计步骤中采用的解决方案。作为应用的一个例子,通过接受者工作特征曲线下面积(AUC)的方法报道了迄今为止获得的用于评估脑中风最重要的可改变危险因素(糖尿病、高血压、高胆固醇血症、肥胖、心脏问题和吸烟)的典型结果。
{"title":"Design of a neuromuscular disorders diagnostic system using human movement analysis","authors":"C. O’Reilly, R. Plamondon","doi":"10.1109/ISSPA.2012.6310660","DOIUrl":"https://doi.org/10.1109/ISSPA.2012.6310660","url":null,"abstract":"This communication summarizes the outcome of our research program on the design of a diagnostic system for neuromuscular disorders based on the analysis of human movement using the Kinematic Theory of Rapid Human Movements. Herein, this design problem is split in sub-problems which are then described. The solutions adopted at each design step are explained. As an example of application, typical results obtained so far for the assessment of the most important modifiable risk factors of brain stroke (diabetes, hypertension, hypercholesterolemia, obesity, cardiac problems, and cigarette smoking) are reported by the means of the area under the receiver operating characteristic curve (AUC).","PeriodicalId":248763,"journal":{"name":"2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA)","volume":"273 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121800623","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 41
Blind source separation towards decentralized modal identification using compressive sampling 基于压缩采样的分散模态识别盲源分离
A. Sadhu, Bo Hu, S. Narasimhan
Wireless sensing technology has gained significant attention in the field of structural health monitoring (SHM). Various decentralized modal identification methods have been developed employing wireless sensors. However, one of themajor bottlenecks - especially dealing with long-term SHM - is the large volume of transmitted data. To overcome this problem, we present compressed sensing as a data reduction preprocessing tool within the framework of blind source separation. The results of source separation are ultimately used for modal identification of linear structures under ambient vibrations. When used together with sparsifying time-frequency decompositions, we show that accurate modal identification results are possible with high compression ratios. The main novelty in the method proposed here is in the application of compressive sensing for decentralized modal identification of civil structures.
无线传感技术在结构健康监测领域受到广泛关注。利用无线传感器开发了各种分散的模态识别方法。然而,主要的瓶颈之一——尤其是处理长期SHM——是传输的大量数据。为了克服这个问题,我们提出了压缩感知作为盲源分离框架下的数据约简预处理工具。源分离的结果最终用于环境振动下线性结构的模态识别。当与稀疏时频分解结合使用时,我们表明在高压缩比的情况下可以获得准确的模态识别结果。该方法的主要新颖之处在于将压缩感知应用于土木结构的分散模态识别。
{"title":"Blind source separation towards decentralized modal identification using compressive sampling","authors":"A. Sadhu, Bo Hu, S. Narasimhan","doi":"10.1109/ISSPA.2012.6310463","DOIUrl":"https://doi.org/10.1109/ISSPA.2012.6310463","url":null,"abstract":"Wireless sensing technology has gained significant attention in the field of structural health monitoring (SHM). Various decentralized modal identification methods have been developed employing wireless sensors. However, one of themajor bottlenecks - especially dealing with long-term SHM - is the large volume of transmitted data. To overcome this problem, we present compressed sensing as a data reduction preprocessing tool within the framework of blind source separation. The results of source separation are ultimately used for modal identification of linear structures under ambient vibrations. When used together with sparsifying time-frequency decompositions, we show that accurate modal identification results are possible with high compression ratios. The main novelty in the method proposed here is in the application of compressive sensing for decentralized modal identification of civil structures.","PeriodicalId":248763,"journal":{"name":"2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA)","volume":"79 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121044197","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
Application of multipoint auscultation for heart sound diagnostic system (MAHDS) 多点听诊在心音诊断系统中的应用
Hadrina Sh-Hussain, S. Salleh, A. K. Ariff, Osama Alhamdani, T. T. Swee, A. M. Noor, H. Oemar, Khalid Yusoff
Humans are different in many ways: fat or thin, young or old, sick or healthy; they may differ in auscultation sites which may vary according to the patient's anatomy. Emphasis must be placed on the characteristics of heart sound based on its intensity which greatly depends on the location of the stethoscope to its pericardium. Each one of these areas will emphasize certain characteristics components of the heart sound. Grouping of the first heart sound (lub) is called the S1 features while the second heart sound (dub) is called the S2 features, the systolic or diastolic features are important factor to determine the types of murmurs. To this end, studies have been limited to reflect on the development and evaluation methods in order to detect the various components constituting signal of the heart sound at one specific auscultation point. The principle area of interest in this paper is, however placing the stethoscope at the semi lunar valve called aortic as position one and pulmonary as position two which will provide better quality of the S2 sound. The S1 heart sound can be heard more clearly in the atroventricle (AV) where the mitral valve as position three and tricuspid valve as position four. Comparative experiments with respect to MFCC feature, different number of HMM states and different number of gaussian mixtures were investigated to measure the influence of these factors on the classification performance at the four locations of auscultation of the heart sound. Interestingly, a five-state model outperformed the four-state model which was supposed to model the four basic components of the heart sounds. It can be said, a five-state average over all Gaussian mixtures model and at the four locations provide the best overall performance of 90.1% accuracy.
人类在很多方面都是不同的:胖或瘦,年轻或年老,生病或健康;听诊部位可能因患者的解剖结构而异。重点必须放在基于其强度的心音特征上,这在很大程度上取决于听诊器对心包的位置。这些区域中的每一个都会强调心音的某些特征组成部分。第一心音(lub)组称为S1特征,第二心音(dub)组称为S2特征,其中收缩期或舒张期特征是确定杂音类型的重要因素。为此,研究仅限于反思在某一听诊点检测构成心音信号的各种成分的开发和评价方法。然而,本文感兴趣的主要领域是,将听诊器放置在半月瓣(称为主动脉瓣的位置1和肺动脉瓣的位置2)上,这样可以提供更好的S2声音质量。在心室(AV)可以更清楚地听到S1心音,其中二尖瓣位于第3位,三尖瓣位于第4位。通过MFCC特征、不同数目的HMM状态和不同数目的高斯混合的对比实验,考察这些因素对心音听诊4个位置分类性能的影响。有趣的是,五状态模型比四状态模型表现得更好,四状态模型被认为是对心音的四个基本组成部分进行建模。可以说,在所有高斯混合模型和四个位置上的五状态平均值提供了90.1%的最佳整体性能。
{"title":"Application of multipoint auscultation for heart sound diagnostic system (MAHDS)","authors":"Hadrina Sh-Hussain, S. Salleh, A. K. Ariff, Osama Alhamdani, T. T. Swee, A. M. Noor, H. Oemar, Khalid Yusoff","doi":"10.1109/ISSPA.2012.6310669","DOIUrl":"https://doi.org/10.1109/ISSPA.2012.6310669","url":null,"abstract":"Humans are different in many ways: fat or thin, young or old, sick or healthy; they may differ in auscultation sites which may vary according to the patient's anatomy. Emphasis must be placed on the characteristics of heart sound based on its intensity which greatly depends on the location of the stethoscope to its pericardium. Each one of these areas will emphasize certain characteristics components of the heart sound. Grouping of the first heart sound (lub) is called the S1 features while the second heart sound (dub) is called the S2 features, the systolic or diastolic features are important factor to determine the types of murmurs. To this end, studies have been limited to reflect on the development and evaluation methods in order to detect the various components constituting signal of the heart sound at one specific auscultation point. The principle area of interest in this paper is, however placing the stethoscope at the semi lunar valve called aortic as position one and pulmonary as position two which will provide better quality of the S2 sound. The S1 heart sound can be heard more clearly in the atroventricle (AV) where the mitral valve as position three and tricuspid valve as position four. Comparative experiments with respect to MFCC feature, different number of HMM states and different number of gaussian mixtures were investigated to measure the influence of these factors on the classification performance at the four locations of auscultation of the heart sound. Interestingly, a five-state model outperformed the four-state model which was supposed to model the four basic components of the heart sounds. It can be said, a five-state average over all Gaussian mixtures model and at the four locations provide the best overall performance of 90.1% accuracy.","PeriodicalId":248763,"journal":{"name":"2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA)","volume":"57 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116128789","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
The SIMCA algorithm for processing ground penetrating radar data and its use in landmine detection 探地雷达数据处理的SIMCA算法及其在地雷探测中的应用
A. Sengodan, W. Cockshott
The main challenge of ground penetrating radar (GPR) based land mine detection is to have an accurate image analysis method that is capable of reducing false alarms. However an accurate image relies on having sufficient spatial resolution in the received signal. But because the diameter of an AP mine can be as low as 2cm and many soils have very high attenuations at frequencies above 3GHz, the accurate detection of landmines is accomplished using advanced algorithms. Using image reconstruction and by carrying out the system level analysis of the issues involved with recognition of landmines allows the landmine detection problem to be solved. The SIMCA ('SIMulated Correlation Algorithm') is a novel and accurate landmine detection tool that carries out correlation between a simulated GPR trace and a clutter1 removed original GPR trace. This correlation is performed using the MATLAB® processing environment. The authors tried using convolution and correlation. But in this paper the correlated results are presented because they produced better results. Validation of the results from the algorithm was done by an expert GPR user and 4 other general users who predict the location of landmines. These predicted results are compared with the ground truth data.
基于探地雷达(GPR)的地雷探测面临的主要挑战是拥有一种能够减少误报的准确图像分析方法。然而,准确的图像依赖于在接收信号中具有足够的空间分辨率。但是,由于AP地雷的直径可以低至2厘米,而且许多土壤在3GHz以上的频率上具有非常高的衰减,因此使用先进的算法可以实现对地雷的精确探测。利用图像重建和对地雷识别所涉及的问题进行系统级分析,可以解决地雷探测问题。SIMCA(“模拟相关算法”)是一种新型和精确的地雷探测工具,它在模拟GPR轨迹和去除杂波的原始GPR轨迹之间进行相关。这种关联是使用MATLAB®处理环境执行的。作者尝试使用卷积和相关。但本文给出了相关结果,因为它们产生了更好的结果。一名探地雷达专家用户和另外4名预测地雷位置的一般用户对算法结果进行了验证。这些预测结果与地面真实数据进行了比较。
{"title":"The SIMCA algorithm for processing ground penetrating radar data and its use in landmine detection","authors":"A. Sengodan, W. Cockshott","doi":"10.1109/ISSPA.2012.6310699","DOIUrl":"https://doi.org/10.1109/ISSPA.2012.6310699","url":null,"abstract":"The main challenge of ground penetrating radar (GPR) based land mine detection is to have an accurate image analysis method that is capable of reducing false alarms. However an accurate image relies on having sufficient spatial resolution in the received signal. But because the diameter of an AP mine can be as low as 2cm and many soils have very high attenuations at frequencies above 3GHz, the accurate detection of landmines is accomplished using advanced algorithms. Using image reconstruction and by carrying out the system level analysis of the issues involved with recognition of landmines allows the landmine detection problem to be solved. The SIMCA ('SIMulated Correlation Algorithm') is a novel and accurate landmine detection tool that carries out correlation between a simulated GPR trace and a clutter1 removed original GPR trace. This correlation is performed using the MATLAB® processing environment. The authors tried using convolution and correlation. But in this paper the correlated results are presented because they produced better results. Validation of the results from the algorithm was done by an expert GPR user and 4 other general users who predict the location of landmines. These predicted results are compared with the ground truth data.","PeriodicalId":248763,"journal":{"name":"2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116512175","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Hyperspectral band selection based on graph clustering 基于图聚类的高光谱波段选择
R. Hedjam, M. Cheriet
In this paper we present a new method for hyperspectral band selection problem. The principle is to create a band adjacency graph (BAG) where the nodes represent the bands and the edges represent the similarity weights between the bands. The Markov Clustering Process (abbreviated MCL process) defines a sequence of stochastic matrices by alternation of two operators on the associated affinity matrix to form distinct clusters of high correlated bands. Each cluster is represented by one band and the representative bands will form the new data cube to be used in subsequent processing. The proposed algorithm is tested on a real dataset and compared against state-of-art. The results are promising.
本文提出了一种新的高光谱波段选择方法。其原理是创建一个带邻接图(BAG),其中节点表示带,边表示带之间的相似度权重。马尔可夫聚类过程(简称MCL过程)通过在关联亲和矩阵上交替两个算子来定义随机矩阵序列,从而形成高相关带的不同簇。每个集群由一个波段表示,这些代表性波段将形成新的数据立方体,用于后续处理。在实际数据集上对该算法进行了测试,并与现有算法进行了比较。结果是有希望的。
{"title":"Hyperspectral band selection based on graph clustering","authors":"R. Hedjam, M. Cheriet","doi":"10.1109/ISSPA.2012.6310665","DOIUrl":"https://doi.org/10.1109/ISSPA.2012.6310665","url":null,"abstract":"In this paper we present a new method for hyperspectral band selection problem. The principle is to create a band adjacency graph (BAG) where the nodes represent the bands and the edges represent the similarity weights between the bands. The Markov Clustering Process (abbreviated MCL process) defines a sequence of stochastic matrices by alternation of two operators on the associated affinity matrix to form distinct clusters of high correlated bands. Each cluster is represented by one band and the representative bands will form the new data cube to be used in subsequent processing. The proposed algorithm is tested on a real dataset and compared against state-of-art. The results are promising.","PeriodicalId":248763,"journal":{"name":"2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA)","volume":"88 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116931776","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 22
Learning optimal warping window size of DTW for time series classification 学习时间序列分类DTW的最优翘曲窗大小
Qian Chen, Guyu Hu, Fang-lin Gu, Peng Xiang
The dynamic time warping (DTW) is a classic similarity measure which can handle the time warping issue in similarity computation of time series. And the DTW with constrained warping window is the most common and practical form of DTW. In this paper, the traditional learning method for optimal warping window of DTW is systematically analyzed. Then the time distance to measure the time deviation between two time series is introduced. Finally a new learning method for optimal warping window size based on DTW and time distance is proposed which can improve DTW classification accuracy with little additional computation. Experimental data show that the optimal DTW with best warping window get better classification accuracy when the new learning method is employed. Additionally, the classification accuracy is better than that of ERP and LCSS, and is close to that of TWED.
动态时间翘曲(DTW)是一种经典的相似性度量方法,可以处理时间序列相似性计算中的时间翘曲问题。而带约束翘曲窗口的DTW是最常见、最实用的DTW形式。本文系统地分析了DTW最优翘曲窗的传统学习方法。然后引入时间距离来度量两个时间序列之间的时间偏差。最后提出了一种基于DTW和时间距离的最优翘曲窗大小学习方法,该方法可以在较少的额外计算量下提高DTW分类精度。实验数据表明,当采用新的学习方法时,具有最佳翘曲窗口的最优DTW具有更好的分类精度。分类精度优于ERP和LCSS,接近TWED的分类精度。
{"title":"Learning optimal warping window size of DTW for time series classification","authors":"Qian Chen, Guyu Hu, Fang-lin Gu, Peng Xiang","doi":"10.1109/ISSPA.2012.6310488","DOIUrl":"https://doi.org/10.1109/ISSPA.2012.6310488","url":null,"abstract":"The dynamic time warping (DTW) is a classic similarity measure which can handle the time warping issue in similarity computation of time series. And the DTW with constrained warping window is the most common and practical form of DTW. In this paper, the traditional learning method for optimal warping window of DTW is systematically analyzed. Then the time distance to measure the time deviation between two time series is introduced. Finally a new learning method for optimal warping window size based on DTW and time distance is proposed which can improve DTW classification accuracy with little additional computation. Experimental data show that the optimal DTW with best warping window get better classification accuracy when the new learning method is employed. Additionally, the classification accuracy is better than that of ERP and LCSS, and is close to that of TWED.","PeriodicalId":248763,"journal":{"name":"2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121187727","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 21
期刊
2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1