首页 > 最新文献

IET Signal Process.最新文献

英文 中文
Fully automated unsupervised artefact removal in multichannel electroencephalogram using wavelet-independent component analysis with density-based spatial clustering of application with noise 基于小波独立分量分析的多通道脑电图全自动无监督伪影去除与噪声应用的密度空间聚类
Pub Date : 2021-06-12 DOI: 10.1049/SIL2.12058
Chong Yeh Sai, N. Mokhtar, M. Iwahashi, P. Cumming, H. Arof
Faculty Grant University of Malaya, Grant/Award Number: GPF062A‐2018; Ministry of Higher Education, Malaysia, Grant/Award Number: UM.C/ HIR/MOHE/ENG/16; Universiti Malaya, Grant/ Award Number: PG260‐2015B; JSPS KAKENHI, Grant/Award Number: JP21K11934 Abstract Electroencephalography (EEG) is a method for recording electrical activities arising from the cortical surface of the brain, which has found wide applications not just in clinical medicine, but also in neuroscience research and studies of Brain‐Computer Interface (BCI). However, EEG recordings often suffer from distortions due to artefactual components that degrade the true EEG signals. Artefactual components are any unwanted signals recorded in the EEG spectrum that originate from sources other than the neurophysiological activity of the human brain. Examples of the origin of artefactual components include eye blinking, facial or scalp muscles activities, and electrode slippage. Techniques for automated artefact removal such as Wavelet Transform and Independent Component Analysis (ICA) have been used to remove or reduce the effect of artefactual components on the EEG signals. However, detecting or identifying the signal artefacts to be removed presents a great challenge, as EEG signal properties vary between individuals and age groups. Techniques that rely on some arbitrarily defined threshold often fail to identify accurately the signal artefacts in a given dataset. In this study, a method is proposed using unsupervised machine learning coupled with Wavelet‐ICA to remove EEG artefacts. Using Density‐Based Spatial Clustering of Application with Noise (DBSCAN), a classification accuracy of 97.9% is achieved in identifying artefactual components. DBSCAN achieved excellent and robust performance in identifying artefactual components during the Wavelet‐ICA process, especially in consideration of the low‐density nature of typical artefactual signals. This new hybrid method provides a scalable and unsupervised solution for automated artefact removal that should be applicable for a wide range of EEG data types.
马来亚大学教师资助,资助/奖励编号:GPF062A‐2018;马来西亚高等教育部,资助/奖励编号:UM.C/ HIR/MOHE/ENG/16;马来亚大学,资助/奖励编号:PG260‐2015B;摘要脑电图(EEG)是一种记录大脑皮层表面电活动的方法,不仅在临床医学中广泛应用,而且在神经科学研究和脑机接口(BCI)研究中也有广泛的应用。然而,由于人工成分降低了真实的EEG信号,因此EEG记录经常受到失真的影响。人造成分是脑电图频谱中记录的任何不需要的信号,这些信号来自人脑神经生理活动以外的来源。人造成分的来源包括眨眼、面部或头皮肌肉活动和电极滑动。小波变换和独立分量分析(ICA)等人工信号自动去除技术已被用于去除或减少人工信号对脑电信号的影响。然而,检测或识别要去除的信号伪影是一个很大的挑战,因为脑电图信号的特性在个体和年龄组之间是不同的。依赖于一些任意定义的阈值的技术通常无法准确识别给定数据集中的信号伪影。在这项研究中,提出了一种使用无监督机器学习与小波ICA相结合的方法来去除EEG伪影。使用基于密度的噪声应用空间聚类(DBSCAN),识别人工成分的分类准确率达到97.9%。在小波- ICA过程中,DBSCAN在识别伪信号方面取得了优异的鲁棒性,特别是考虑到典型伪信号的低密度特性。这种新的混合方法为自动去除伪影提供了一种可扩展的无监督解决方案,适用于广泛的EEG数据类型。
{"title":"Fully automated unsupervised artefact removal in multichannel electroencephalogram using wavelet-independent component analysis with density-based spatial clustering of application with noise","authors":"Chong Yeh Sai, N. Mokhtar, M. Iwahashi, P. Cumming, H. Arof","doi":"10.1049/SIL2.12058","DOIUrl":"https://doi.org/10.1049/SIL2.12058","url":null,"abstract":"Faculty Grant University of Malaya, Grant/Award Number: GPF062A‐2018; Ministry of Higher Education, Malaysia, Grant/Award Number: UM.C/ HIR/MOHE/ENG/16; Universiti Malaya, Grant/ Award Number: PG260‐2015B; JSPS KAKENHI, Grant/Award Number: JP21K11934 Abstract Electroencephalography (EEG) is a method for recording electrical activities arising from the cortical surface of the brain, which has found wide applications not just in clinical medicine, but also in neuroscience research and studies of Brain‐Computer Interface (BCI). However, EEG recordings often suffer from distortions due to artefactual components that degrade the true EEG signals. Artefactual components are any unwanted signals recorded in the EEG spectrum that originate from sources other than the neurophysiological activity of the human brain. Examples of the origin of artefactual components include eye blinking, facial or scalp muscles activities, and electrode slippage. Techniques for automated artefact removal such as Wavelet Transform and Independent Component Analysis (ICA) have been used to remove or reduce the effect of artefactual components on the EEG signals. However, detecting or identifying the signal artefacts to be removed presents a great challenge, as EEG signal properties vary between individuals and age groups. Techniques that rely on some arbitrarily defined threshold often fail to identify accurately the signal artefacts in a given dataset. In this study, a method is proposed using unsupervised machine learning coupled with Wavelet‐ICA to remove EEG artefacts. Using Density‐Based Spatial Clustering of Application with Noise (DBSCAN), a classification accuracy of 97.9% is achieved in identifying artefactual components. DBSCAN achieved excellent and robust performance in identifying artefactual components during the Wavelet‐ICA process, especially in consideration of the low‐density nature of typical artefactual signals. This new hybrid method provides a scalable and unsupervised solution for automated artefact removal that should be applicable for a wide range of EEG data types.","PeriodicalId":272888,"journal":{"name":"IET Signal Process.","volume":"34 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-06-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133917986","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A computationally efficient non-iterative four-parameter sine fitting method 一种计算效率高的非迭代四参数正弦拟合方法
Pub Date : 2021-06-11 DOI: 10.1049/SIL2.12061
B. Renczes, V. Pálfi
National Research Development and Innovation Fund Abstract A computationally efficient four‐parameter least squares (LS) sine fitting method in the time domain is presented here. Unlike the most widespread procedure defined in the relevant IEEE standard, the proposed fitting is non‐iterative. This is achieved by the second‐order approximation of the cost function (CF) around the actual frequency of the sinusoidal excitation. The approximation reduces the four‐parameter non‐linear fitting problem to a defined set of three‐parameter linear fitting problems. Therefore, the computational demand can be predicted precisely, which is an essential aspect of real‐ life applications. Furthermore, the proposed method is shown to have increased numerical stability. Finally, measurements and computer simulations are carried out to demonstrate the reduced computational demand, while preserving the accuracy compared with the algorithm proposed in the IEEE standard.
摘要提出了一种计算效率高的时域四参数最小二乘(LS)正弦拟合方法。与相关IEEE标准中定义的最广泛的程序不同,所提出的拟合是非迭代的。这是通过成本函数(CF)在正弦激励的实际频率周围的二阶近似来实现的。该近似方法将四参数非线性拟合问题简化为三参数线性拟合问题的定义集。因此,计算需求可以精确地预测,这是现实生活应用的一个重要方面。此外,所提出的方法具有较高的数值稳定性。最后,进行了测量和计算机模拟,以证明与IEEE标准中提出的算法相比,该算法在保持精度的同时减少了计算量。
{"title":"A computationally efficient non-iterative four-parameter sine fitting method","authors":"B. Renczes, V. Pálfi","doi":"10.1049/SIL2.12061","DOIUrl":"https://doi.org/10.1049/SIL2.12061","url":null,"abstract":"National Research Development and Innovation Fund Abstract A computationally efficient four‐parameter least squares (LS) sine fitting method in the time domain is presented here. Unlike the most widespread procedure defined in the relevant IEEE standard, the proposed fitting is non‐iterative. This is achieved by the second‐order approximation of the cost function (CF) around the actual frequency of the sinusoidal excitation. The approximation reduces the four‐parameter non‐linear fitting problem to a defined set of three‐parameter linear fitting problems. Therefore, the computational demand can be predicted precisely, which is an essential aspect of real‐ life applications. Furthermore, the proposed method is shown to have increased numerical stability. Finally, measurements and computer simulations are carried out to demonstrate the reduced computational demand, while preserving the accuracy compared with the algorithm proposed in the IEEE standard.","PeriodicalId":272888,"journal":{"name":"IET Signal Process.","volume":"79 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-06-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133793954","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Unobtrusive human activity classification based on combined time-range and time-frequency domain signatures using ultrawideband radar 基于超宽带雷达时频域联合特征的非显眼人类活动分类
Pub Date : 2021-06-03 DOI: 10.1049/SIL2.12060
Mohamad Mostafa, S. Chamaani
Alexander von Humboldt‐Stiftung Abstract In this proposed approach to unobtrusive human activity classification, a two‐stage machine learning–based algorithm was applied to backscattered ultrawideband radar signals. First, a preprocessing step was applied for noise and clutter suppression. Then, feature extraction and a combination of time‐frequency (TF) and time‐range (TR) domains were used to extract the features of human activities. Then, feature analysis was performed to determine robust features relative to this kind of classification and reduce the dimensionality of the feature vector. Subsequently, different recognition algorithms were applied to group activities as fall or non‐fall and categorise their types. Finally, a performance study was used to choose the higher accuracy algorithm. The ensemble bagged tree and fine K‐nearest neighbour methods showed the best performance. The results show that the two‐stage classification was more accurate than the one‐stage. Finally, it was observed that the proposed approach using a combination of TR and TF domains with two‐stage recognition outperformed reference approaches mentioned in the literature, with average accuracies of 95.8% for eight‐activities classification and 96.9% in distinguishing between fall and non‐fall activities with efficient computational complexity.
在这个提出的不引人注目的人类活动分类方法中,一个基于两阶段机器学习的算法被应用于后向散射超宽带雷达信号。首先,对噪声和杂波进行抑制预处理。然后,利用特征提取和时频域(TF)和时程域(TR)相结合的方法提取人类活动特征。然后,进行特征分析,确定相对于这种分类的鲁棒特征,并降低特征向量的维数。随后,将不同的识别算法应用于跌倒或非跌倒的群体活动,并对其类型进行分类。最后,通过性能研究选择精度较高的算法。综合袋树法和精细K近邻法表现出最好的性能。结果表明,两阶段分类比一阶段分类更准确。最后,我们观察到,采用TR和TF结构域结合两阶段识别的方法优于文献中提到的参考方法,八种活动分类的平均准确率为95.8%,区分跌倒和非跌倒活动的平均准确率为96.9%,具有高效的计算复杂度。
{"title":"Unobtrusive human activity classification based on combined time-range and time-frequency domain signatures using ultrawideband radar","authors":"Mohamad Mostafa, S. Chamaani","doi":"10.1049/SIL2.12060","DOIUrl":"https://doi.org/10.1049/SIL2.12060","url":null,"abstract":"Alexander von Humboldt‐Stiftung Abstract In this proposed approach to unobtrusive human activity classification, a two‐stage machine learning–based algorithm was applied to backscattered ultrawideband radar signals. First, a preprocessing step was applied for noise and clutter suppression. Then, feature extraction and a combination of time‐frequency (TF) and time‐range (TR) domains were used to extract the features of human activities. Then, feature analysis was performed to determine robust features relative to this kind of classification and reduce the dimensionality of the feature vector. Subsequently, different recognition algorithms were applied to group activities as fall or non‐fall and categorise their types. Finally, a performance study was used to choose the higher accuracy algorithm. The ensemble bagged tree and fine K‐nearest neighbour methods showed the best performance. The results show that the two‐stage classification was more accurate than the one‐stage. Finally, it was observed that the proposed approach using a combination of TR and TF domains with two‐stage recognition outperformed reference approaches mentioned in the literature, with average accuracies of 95.8% for eight‐activities classification and 96.9% in distinguishing between fall and non‐fall activities with efficient computational complexity.","PeriodicalId":272888,"journal":{"name":"IET Signal Process.","volume":"545 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115249962","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Arabic speech recognition using end-to-end deep learning 使用端到端深度学习的阿拉伯语语音识别
Pub Date : 2021-06-02 DOI: 10.1049/SIL2.12057
Hamzah A. Alsayadi, A. Abdelhamid, I. Hegazy, Z. Fayed
Arabic automatic speech recognition (ASR) methods with diacritics have the ability to be integrated with other systems better than Arabic ASR methods without diacritics. In this work, the application of state ‐ of ‐ the ‐ art end ‐ to ‐ end deep learning approaches is inves-tigated to build a robust diacritised Arabic ASR. These approaches are based on the Mel ‐ Frequency Cepstral Coefficients and the log Mel ‐ Scale Filter Bank energies as acoustic features. To the best of our knowledge, end ‐ to ‐ end deep learning approach has not been used in the task of diacritised Arabic automatic speech recognition. To fill this gap, this work presents a new CTC ‐ based ASR, CNN ‐ LSTM, and an attention ‐ based end ‐ to ‐ end approach for improving diacritisedArabic ASR. In addition, a word ‐ based language model is employed to achieve better results. The end ‐ to ‐ end approaches applied in this work are based on state ‐ of ‐ the ‐ art frameworks, namely ESPnet and Espresso. Training and testing of these frameworks are performed based on the Standard Arabic Single Speaker Corpus (SASSC), which contains 7 h of modern standard Arabic speech. Experimental results show that the CNN ‐ LSTM
带变音符的阿拉伯语自动语音识别(ASR)方法比不带变音符的阿拉伯语自动语音识别方法具有更好的与其他系统集成的能力。在这项工作中,研究了最先进的端到端深度学习方法的应用,以建立一个鲁棒的变音符阿拉伯语ASR。这些方法是基于Mel - Frequency倒谱系数和对数Mel - Scale滤波器组能量作为声学特征。据我们所知,端到端深度学习方法尚未用于变音符阿拉伯语自动语音识别任务。为了填补这一空白,本研究提出了一种新的基于CTC的ASR, CNN - LSTM,以及一种基于注意力的端到端方法,用于改进变音符基础ASR。此外,为了达到更好的效果,采用了基于词的语言模型。在这项工作中应用的端到端方法是基于最先进的框架,即ESPnet和Espresso。这些框架的训练和测试是基于标准阿拉伯语单语语料库(SASSC)进行的,该语料库包含7小时的现代标准阿拉伯语语音。实验结果表明,CNN - LSTM
{"title":"Arabic speech recognition using end-to-end deep learning","authors":"Hamzah A. Alsayadi, A. Abdelhamid, I. Hegazy, Z. Fayed","doi":"10.1049/SIL2.12057","DOIUrl":"https://doi.org/10.1049/SIL2.12057","url":null,"abstract":"Arabic automatic speech recognition (ASR) methods with diacritics have the ability to be integrated with other systems better than Arabic ASR methods without diacritics. In this work, the application of state ‐ of ‐ the ‐ art end ‐ to ‐ end deep learning approaches is inves-tigated to build a robust diacritised Arabic ASR. These approaches are based on the Mel ‐ Frequency Cepstral Coefficients and the log Mel ‐ Scale Filter Bank energies as acoustic features. To the best of our knowledge, end ‐ to ‐ end deep learning approach has not been used in the task of diacritised Arabic automatic speech recognition. To fill this gap, this work presents a new CTC ‐ based ASR, CNN ‐ LSTM, and an attention ‐ based end ‐ to ‐ end approach for improving diacritisedArabic ASR. In addition, a word ‐ based language model is employed to achieve better results. The end ‐ to ‐ end approaches applied in this work are based on state ‐ of ‐ the ‐ art frameworks, namely ESPnet and Espresso. Training and testing of these frameworks are performed based on the Standard Arabic Single Speaker Corpus (SASSC), which contains 7 h of modern standard Arabic speech. Experimental results show that the CNN ‐ LSTM","PeriodicalId":272888,"journal":{"name":"IET Signal Process.","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-06-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129945317","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 22
Sensor fusion with high-order moments constraints using projection-based neural network 基于投影神经网络的高阶矩约束传感器融合
Pub Date : 2021-05-22 DOI: 10.1049/SIL2.12046
Y. Alipouri, Reza Rafati Bonab, Le Zhong
Yousef Alipouri, Department of Mechanical Engineering, University of Alberta, 9211‐116 Street NW, Edmonton, AB T6G 1H9, Canada. Email: alipouri@ualberta.ca Abstract The existing sensor fusion methods mainly follow two approaches, including Gaussian and Non‐Gaussian‐based sensor fusion approaches. In the first approach, fusion weights are determined based on the second moment. This approach is unable to account for high‐order moments; thus, it is not accurate for non‐Gaussian sensors. In the second approach, the fusion weights are determined using distribution functions of sensor data. Though this method is more accurate than Gaussian‐based sensor fusion, it is a sophisticated method as it requires all moments information of each sensor, which is either not available or at least hard to be identified. Here, we propose an alternative way to determine the fusion weights by a limited number of n (>2) moment information of data. The proposed method makes trades off between accuracy and complexity. The other problem, which has not been studied in the literature, is existence of constraints on moments. The proposed method can address this problem as well. To do this, a projection‐based neural network‐based optimization method is used to calculate the optimal fusion weights that satisfy moment constraints. A practical application of the proposed sensor fusion method on predicting occupancy for heating, ventilation, and air conditioning (HVAC) is conducted.
Yousef Alipouri,阿尔伯塔大学机械工程系,加拿大埃德蒙顿市NW街9211‐116号,AB T6G 1H9。摘要现有的传感器融合方法主要有基于高斯和基于非高斯的传感器融合方法。在第一种方法中,基于第二矩确定融合权重。这种方法无法解释高阶矩;因此,它对非高斯传感器是不准确的。在第二种方法中,利用传感器数据的分布函数确定融合权值。虽然该方法比基于高斯的传感器融合更精确,但它需要每个传感器的所有矩信息,这些信息要么不可用,要么很难识别,因此是一种复杂的方法。在这里,我们提出了一种替代方法,通过有限数量的n(>2)个数据的矩信息来确定融合权重。所提出的方法在准确性和复杂性之间进行了权衡。另一个尚未在文献中研究的问题是矩约束的存在性。所提出的方法也可以解决这个问题。为此,采用基于投影的神经网络优化方法计算满足矩约束的最优融合权值。将所提出的传感器融合方法应用于暖通空调(HVAC)入住率预测。
{"title":"Sensor fusion with high-order moments constraints using projection-based neural network","authors":"Y. Alipouri, Reza Rafati Bonab, Le Zhong","doi":"10.1049/SIL2.12046","DOIUrl":"https://doi.org/10.1049/SIL2.12046","url":null,"abstract":"Yousef Alipouri, Department of Mechanical Engineering, University of Alberta, 9211‐116 Street NW, Edmonton, AB T6G 1H9, Canada. Email: alipouri@ualberta.ca Abstract The existing sensor fusion methods mainly follow two approaches, including Gaussian and Non‐Gaussian‐based sensor fusion approaches. In the first approach, fusion weights are determined based on the second moment. This approach is unable to account for high‐order moments; thus, it is not accurate for non‐Gaussian sensors. In the second approach, the fusion weights are determined using distribution functions of sensor data. Though this method is more accurate than Gaussian‐based sensor fusion, it is a sophisticated method as it requires all moments information of each sensor, which is either not available or at least hard to be identified. Here, we propose an alternative way to determine the fusion weights by a limited number of n (>2) moment information of data. The proposed method makes trades off between accuracy and complexity. The other problem, which has not been studied in the literature, is existence of constraints on moments. The proposed method can address this problem as well. To do this, a projection‐based neural network‐based optimization method is used to calculate the optimal fusion weights that satisfy moment constraints. A practical application of the proposed sensor fusion method on predicting occupancy for heating, ventilation, and air conditioning (HVAC) is conducted.","PeriodicalId":272888,"journal":{"name":"IET Signal Process.","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130830344","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Improved performance of gas turbine diagnostics using new noise-removal techniques 使用新的降噪技术提高燃气轮机诊断性能
Pub Date : 2021-05-03 DOI: 10.1049/SIL2.12042
Mohsen Ensafjoo, M. Safizadeh
Mir Saeed Safizadeh, Associate Professor, School of Mechanical Engineering, Iran University of Science and Technology, Narmak, Tehran, 16846‐13114, Iran. Email: safizadeh@iust.ac.ir Abstract Fault detection and identification (FDI) systems are responsible for detecting and identifying errors as fast as possible with high reliability. These systems should be robust against noise and avoid false warnings. Herein, the perspective of using wavelet filters for noise reduction in FDI systems has been investigated. To achieve that, a wavelet filter and a wavelet‐hybrid filter are presented and compared in noise reduction with conventional filters, such as linear filters (finite impulse response (FIR) and infinite impulse response), median filter, and FIR‐median hybrid filter (SWFMH). The comparison is conducted in two steps: (a) noise reduction of a noisy sample signal from a gas turbine and (b) increasing the fault detection accuracy of a gas turbine FDI system in the presence of noisy data. In step one, a conventional noisy sample signal of a gas turbine is presented, and the performances of different filters in noise reduction of the signal have been studied. In step two, considering that excessive filtering can result in the loss of useful information for an FDI system's diagnostics, the performances of an FDI system coupled with different filters have been evaluated. For this purpose, an FDI system utilising an adaptive neuro‐fuzzy inference system and gas path analysis has been designed. It is demonstrated that, in some cases, the wavelet filters have a lower denoising capability for a noisy sample signal, but when used together with the FDI system, they have better performance. Therefore, wavelet filters are better suited for use in FDI systems.
Mir Saeed Safizadeh,伊朗科技大学机械工程学院副教授,德黑兰Narmak, 16846‐13114摘要FDI (Fault detection and identification)系统负责以高可靠性,以最快的速度检测和识别错误。这些系统应该具有抗噪声和避免错误警告的鲁棒性。本文研究了在FDI系统中使用小波滤波器进行降噪的前景。为了实现这一目标,提出了小波滤波器和小波混合滤波器,并将其与传统滤波器(如线性滤波器(有限脉冲响应(FIR)和无限脉冲响应)、中值滤波器和FIR中值混合滤波器(SWFMH))在降噪方面进行了比较。比较分为两个步骤:(a)对燃气轮机噪声样本信号进行降噪,(b)提高燃气轮机FDI系统在噪声数据存在下的故障检测精度。第一步,给出了燃气轮机常规噪声样本信号,研究了不同滤波器对该信号的降噪效果。在第二步中,考虑到过度滤波可能导致FDI系统诊断有用信息的丢失,对耦合不同滤波器的FDI系统的性能进行了评估。为此,设计了一种利用自适应神经模糊推理系统和气路分析的FDI系统。研究表明,在某些情况下,小波滤波器对噪声样本信号的去噪能力较低,但与FDI系统一起使用时,它们具有更好的性能。因此,小波滤波器更适合用于FDI系统。
{"title":"Improved performance of gas turbine diagnostics using new noise-removal techniques","authors":"Mohsen Ensafjoo, M. Safizadeh","doi":"10.1049/SIL2.12042","DOIUrl":"https://doi.org/10.1049/SIL2.12042","url":null,"abstract":"Mir Saeed Safizadeh, Associate Professor, School of Mechanical Engineering, Iran University of Science and Technology, Narmak, Tehran, 16846‐13114, Iran. Email: safizadeh@iust.ac.ir Abstract Fault detection and identification (FDI) systems are responsible for detecting and identifying errors as fast as possible with high reliability. These systems should be robust against noise and avoid false warnings. Herein, the perspective of using wavelet filters for noise reduction in FDI systems has been investigated. To achieve that, a wavelet filter and a wavelet‐hybrid filter are presented and compared in noise reduction with conventional filters, such as linear filters (finite impulse response (FIR) and infinite impulse response), median filter, and FIR‐median hybrid filter (SWFMH). The comparison is conducted in two steps: (a) noise reduction of a noisy sample signal from a gas turbine and (b) increasing the fault detection accuracy of a gas turbine FDI system in the presence of noisy data. In step one, a conventional noisy sample signal of a gas turbine is presented, and the performances of different filters in noise reduction of the signal have been studied. In step two, considering that excessive filtering can result in the loss of useful information for an FDI system's diagnostics, the performances of an FDI system coupled with different filters have been evaluated. For this purpose, an FDI system utilising an adaptive neuro‐fuzzy inference system and gas path analysis has been designed. It is demonstrated that, in some cases, the wavelet filters have a lower denoising capability for a noisy sample signal, but when used together with the FDI system, they have better performance. Therefore, wavelet filters are better suited for use in FDI systems.","PeriodicalId":272888,"journal":{"name":"IET Signal Process.","volume":"315 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-05-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"119183774","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Cross refinement network with edge detection for salient object detection 基于边缘检测的交叉细化网络显著目标检测
Pub Date : 2021-04-26 DOI: 10.1049/sil2.12041
Junjiang Xiang, Xiao Hu, Jiayu Ding, Xiang Tan, Jiaxin Yang
National Natural Science Foundation of China China, Grant/Award Number: 62076075 Abstract Salient object detection aims to identify the most attractive objects from images. However, their boundaries are typically of poor quality when predicted using available methods. One or multiple objects may also be left undetected if the image contains multiple objects. To solve these problems, this article proposes the novel cross refinement network, which consists of a Res2Net‐based backbone network; a fusion network equipped with four convolutional block attention modules and four edge‐salient cross units; and a detection network with an edge enhancement unit and a residual refinement network (RNN). For RNN training, the rough salient maps generated using the DUTS‐TR dataset are treated as a special training dataset. Compared to existing methods, the proposed network can effectively detect all objects as well as improve the boundaries of the detected objects by performing experiments on five benchmark datasets. Based on the experimental results, the proposed network outperforms existing methods both objectively and subjectively.
摘要显著目标检测旨在从图像中识别出最吸引人的物体。然而,当使用现有方法预测时,它们的边界通常质量较差。如果图像包含多个对象,一个或多个对象也可能未被检测到。为了解决这些问题,本文提出了一种新型的交叉细化网络,该网络由基于Res2Net的骨干网组成;一个包含四个卷积块注意力模块和四个边缘突出交叉单元的融合网络;以及具有边缘增强单元和残差细化网络(RNN)的检测网络。对于RNN训练,使用DUTS‐TR数据集生成的粗糙显著图被视为特殊的训练数据集。通过在5个基准数据集上的实验,与现有方法相比,本文提出的网络可以有效地检测所有目标,并改善检测目标的边界。实验结果表明,该网络在客观上和主观上都优于现有方法。
{"title":"Cross refinement network with edge detection for salient object detection","authors":"Junjiang Xiang, Xiao Hu, Jiayu Ding, Xiang Tan, Jiaxin Yang","doi":"10.1049/sil2.12041","DOIUrl":"https://doi.org/10.1049/sil2.12041","url":null,"abstract":"National Natural Science Foundation of China China, Grant/Award Number: 62076075 Abstract Salient object detection aims to identify the most attractive objects from images. However, their boundaries are typically of poor quality when predicted using available methods. One or multiple objects may also be left undetected if the image contains multiple objects. To solve these problems, this article proposes the novel cross refinement network, which consists of a Res2Net‐based backbone network; a fusion network equipped with four convolutional block attention modules and four edge‐salient cross units; and a detection network with an edge enhancement unit and a residual refinement network (RNN). For RNN training, the rough salient maps generated using the DUTS‐TR dataset are treated as a special training dataset. Compared to existing methods, the proposed network can effectively detect all objects as well as improve the boundaries of the detected objects by performing experiments on five benchmark datasets. Based on the experimental results, the proposed network outperforms existing methods both objectively and subjectively.","PeriodicalId":272888,"journal":{"name":"IET Signal Process.","volume":"150 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-04-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"118750351","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Distributed multiple speaker tracking based on time delay estimation in microphone array network 麦克风阵列网络中基于时延估计的分布式多说话人跟踪
Pub Date : 2020-12-01 DOI: 10.1049/iet-spr.2019.0613
Rong Wang, Zhe Chen, F. Yin
Multiple speaker tracking in distributed microphone array (DMA) network is a challenging task. A critical issue for multiple speaker scenarios is to distinguish the ambiguous observation and associate it to the corresponding speaker, especially under reverberant and noisy environments. To address the problem, a distributed multiple speaker tracking method based on time delay estimation in DMA is proposed in this study. Specifically, the time delay estimated by the generalised crosscorrelation function is treated as an observation. In order to distinguish the observation for each speaker, the possible time delays, refer to as candidates, are extracted based on data association technique. Considering the ambient influence, a time delay estimation strategy is designed to calculate the time delay for each speaker from the candidates. Finally, only the reliable time delays in DMA are propagated throughout the whole network by diffusion fusion algorithm and used for updating the speakers' state within the distributed Kalman filter framework. The proposed approach can track multiple speakers successfully in a non-centralised manner under reverberant and noisy environments. Simulation results indicate that, compared with other methods, the proposed method can achieve a smaller root mean square error for multiple speaker tracking, especially in adverse conditions.
分布式麦克风阵列(DMA)网络中的多扬声器跟踪是一项具有挑战性的任务。多扬声器场景的一个关键问题是区分模糊观察并将其与相应的扬声器联系起来,特别是在混响和噪声环境下。为了解决这一问题,本文提出了一种基于时延估计的分布式多说话人跟踪方法。具体地说,由广义互相关函数估计的时间延迟被视为观测值。为了区分每个说话人的观察结果,基于数据关联技术提取可能的时间延迟,称为候选时间延迟。考虑周围环境的影响,设计了一种时延估计策略,从候选发言者中计算每个发言者的时延。最后,通过扩散融合算法将DMA中可靠的时延传播到整个网络,并在分布式卡尔曼滤波框架内用于更新说话人的状态。该方法可以在混响和噪声环境下,以非集中的方式成功地跟踪多个说话人。仿真结果表明,与其他方法相比,该方法可以实现较小的多说话人跟踪均方根误差,特别是在不利条件下。
{"title":"Distributed multiple speaker tracking based on time delay estimation in microphone array network","authors":"Rong Wang, Zhe Chen, F. Yin","doi":"10.1049/iet-spr.2019.0613","DOIUrl":"https://doi.org/10.1049/iet-spr.2019.0613","url":null,"abstract":"Multiple speaker tracking in distributed microphone array (DMA) network is a challenging task. A critical issue for multiple speaker scenarios is to distinguish the ambiguous observation and associate it to the corresponding speaker, especially under reverberant and noisy environments. To address the problem, a distributed multiple speaker tracking method based on time delay estimation in DMA is proposed in this study. Specifically, the time delay estimated by the generalised crosscorrelation function is treated as an observation. In order to distinguish the observation for each speaker, the possible time delays, refer to as candidates, are extracted based on data association technique. Considering the ambient influence, a time delay estimation strategy is designed to calculate the time delay for each speaker from the candidates. Finally, only the reliable time delays in DMA are propagated throughout the whole network by diffusion fusion algorithm and used for updating the speakers' state within the distributed Kalman filter framework. The proposed approach can track multiple speakers successfully in a non-centralised manner under reverberant and noisy environments. Simulation results indicate that, compared with other methods, the proposed method can achieve a smaller root mean square error for multiple speaker tracking, especially in adverse conditions.","PeriodicalId":272888,"journal":{"name":"IET Signal Process.","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133640540","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Near orthogonal discrete quaternion Fourier transform components via an optimal frequency rescaling approach 近正交离散四元数傅立叶变换分量通过最优频率重标方法
Pub Date : 2020-12-01 DOI: 10.1049/iet-spr.2020.0199
Lingyue Hu, B. Ling, C. Y. Ho, Guoheng Huang
: The quaternion-valued signals consist of four signal components. The discrete quaternion Fourier transform is to map these four signal components in the time domain to that in the frequency domain. These four signal components in the frequency domain are called the discrete quaternion Fourier transform components. There are a total of 16 inner products among any two discrete quaternion Fourier transform components. The total orthogonal error among the discrete quaternion Fourier transform components is defined based on these 16 inner products. This study aims to find the optimal quaternion number in the discrete quaternion Fourier transforms so that the total orthogonal errors among the discrete quaternion Fourier transform components are minimised. It is worth noting that finding the optimal quaternion number in the discrete quaternion Fourier transform is equivalent to finding the optimal rescaling factors. Since the discrete quaternion Fourier transform components are expressed in terms of the high-order polynomials of the trigonometric functions of the rescaling factors, this optimisation problem is non-convex. To address this problem, a two-stage approach is employed for finding the solution to the optimisation problem. The comparison results show that the authors proposed method outperforms the existing methods in terms of achieving the low total orthogonal error among the discrete quaternion Fourier transform components.
四元数信号由四个信号分量组成。离散四元数傅里叶变换就是将这四个信号分量在时域映射到频域。这四个信号分量在频域中被称为离散四元数傅立叶变换分量。在任意两个离散四元数傅里叶变换分量之间共有16个内积。基于这16个内积,定义了离散四元数傅里叶变换分量间的总正交误差。本研究旨在找出离散四元数傅里叶变换中最优的四元数个数,使离散四元数傅里叶变换分量间的总正交误差最小化。值得注意的是,在离散四元数傅里叶变换中找到最优的四元数等于找到最优的重标因子。由于离散四元数傅里叶变换分量是用重标度因子的三角函数的高阶多项式来表示的,所以这个优化问题是非凸的。为了解决这个问题,我们采用了两阶段的方法来寻找优化问题的解决方案。对比结果表明,本文提出的方法在实现离散四元数傅里叶变换分量的低总正交误差方面优于现有方法。
{"title":"Near orthogonal discrete quaternion Fourier transform components via an optimal frequency rescaling approach","authors":"Lingyue Hu, B. Ling, C. Y. Ho, Guoheng Huang","doi":"10.1049/iet-spr.2020.0199","DOIUrl":"https://doi.org/10.1049/iet-spr.2020.0199","url":null,"abstract":": The quaternion-valued signals consist of four signal components. The discrete quaternion Fourier transform is to map these four signal components in the time domain to that in the frequency domain. These four signal components in the frequency domain are called the discrete quaternion Fourier transform components. There are a total of 16 inner products among any two discrete quaternion Fourier transform components. The total orthogonal error among the discrete quaternion Fourier transform components is defined based on these 16 inner products. This study aims to find the optimal quaternion number in the discrete quaternion Fourier transforms so that the total orthogonal errors among the discrete quaternion Fourier transform components are minimised. It is worth noting that finding the optimal quaternion number in the discrete quaternion Fourier transform is equivalent to finding the optimal rescaling factors. Since the discrete quaternion Fourier transform components are expressed in terms of the high-order polynomials of the trigonometric functions of the rescaling factors, this optimisation problem is non-convex. To address this problem, a two-stage approach is employed for finding the solution to the optimisation problem. The comparison results show that the authors proposed method outperforms the existing methods in terms of achieving the low total orthogonal error among the discrete quaternion Fourier transform components.","PeriodicalId":272888,"journal":{"name":"IET Signal Process.","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122076936","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Joint empirical mode decomposition, exponential function estimation and L 1 norm approach for estimating mean value of photoplethysmogram and blood glucose level 联合经验模态分解、指数函数估计和l1范数法估计光容积图和血糖水平的平均值
Pub Date : 2020-12-01 DOI: 10.1049/iet-spr.2020.0096
Xueling Zhou, B. Ling, Zikang Tian, Yiu-Wai Ho, K. Teo
{"title":"Joint empirical mode decomposition, exponential function estimation and L 1 norm approach for estimating mean value of photoplethysmogram and blood glucose level","authors":"Xueling Zhou, B. Ling, Zikang Tian, Yiu-Wai Ho, K. Teo","doi":"10.1049/iet-spr.2020.0096","DOIUrl":"https://doi.org/10.1049/iet-spr.2020.0096","url":null,"abstract":"","PeriodicalId":272888,"journal":{"name":"IET Signal Process.","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128966338","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
期刊
IET Signal Process.
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1