首页 > 最新文献

2022 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)最新文献

英文 中文
A Fair Model is not Fair in a Biased Environment 公平的模式在有偏见的环境中是不公平的
Y. Sato, S. Maeda, M. Akasaka, M. Nishigaki, Tetsushi Ohki
Facial images contain sensitive attributes such as skin color, and the elimination of them from the input in the face recognition is not easy. In addition, the input data includes the influence of the environment in which the system is actually used, so the interaction between sensitive attributes and the environment may make it inherently difficult for the facial feature extractor to extract facial features. Therefore, studies on the fairness of face recognition should consider the fairness of environmental factors. Common datasets used to evaluate the fairness of face recognition includes a variety of environmental factors, and the fairness evaluated by these datasets are usually the fairness in a typical shooting environment. However, a dataset that includes only extremely biased environmental factors potentially results in less equity among attributes. We construct a dataset with pseudo-biased environmental factors by dynamically changing environmental factors such as brightness in the test data. The results also show that the biased environmental factors deteriorate the fairness inter-attribute. Also, we showed that the distinguished attributes in terms of fairness in a biased environment vary based on the architecture of the model and the training dataset.
人脸图像包含皮肤颜色等敏感属性,在人脸识别中从输入中消除这些敏感属性并不容易。此外,输入数据包含系统实际使用环境的影响,因此敏感属性与环境之间的相互作用可能会使面部特征提取器在提取面部特征时存在固有的困难。因此,对人脸识别公平性的研究应考虑环境因素的公平性。通常用于评价人脸识别公平性的数据集包含多种环境因素,这些数据集评价的公平性通常是典型射击环境下的公平性。然而,如果数据集只包含极端偏颇的环境因素,则可能导致属性之间的公平性降低。我们通过动态改变测试数据中的环境因子(如亮度)来构建具有伪偏差环境因子的数据集。结果还表明,环境因素的偏倚会使公平间属性恶化。此外,我们还表明,在有偏见的环境中,公平性的区分属性根据模型的架构和训练数据集而变化。
{"title":"A Fair Model is not Fair in a Biased Environment","authors":"Y. Sato, S. Maeda, M. Akasaka, M. Nishigaki, Tetsushi Ohki","doi":"10.23919/APSIPAASC55919.2022.9980134","DOIUrl":"https://doi.org/10.23919/APSIPAASC55919.2022.9980134","url":null,"abstract":"Facial images contain sensitive attributes such as skin color, and the elimination of them from the input in the face recognition is not easy. In addition, the input data includes the influence of the environment in which the system is actually used, so the interaction between sensitive attributes and the environment may make it inherently difficult for the facial feature extractor to extract facial features. Therefore, studies on the fairness of face recognition should consider the fairness of environmental factors. Common datasets used to evaluate the fairness of face recognition includes a variety of environmental factors, and the fairness evaluated by these datasets are usually the fairness in a typical shooting environment. However, a dataset that includes only extremely biased environmental factors potentially results in less equity among attributes. We construct a dataset with pseudo-biased environmental factors by dynamically changing environmental factors such as brightness in the test data. The results also show that the biased environmental factors deteriorate the fairness inter-attribute. Also, we showed that the distinguished attributes in terms of fairness in a biased environment vary based on the architecture of the model and the training dataset.","PeriodicalId":382967,"journal":{"name":"2022 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)","volume":"54 35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123806519","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dual Prototypical Network for Robust Few-shot Image Classification 鲁棒少拍图像分类的双原型网络
Qi Song, Zebin Peng, Luchen Ji, Xiaochen Yang, Xiaoxu Li
Deep neural networks have outperformed humans on some image recognition and classification tasks. However, with the emergence of various novel classes, it remains a chal-lenge to continuously expand the learning capability of such networks from a limited number of labeled samples. Metric-based approaches have been playing a key role in few-shot image classification, but most of them measure the distance between samples in the metric space using only a single metric function. In this paper, we propose a Dual Prototypical Network (DPN) to improve the test-time robustness of the classical prototypical network. The proposed method not only focuses on the distance of the original features, but also adds perturbation noise to the image and calculates the distance of noisy features. By enforcing the model to predict well under both metrics, more representative and robust class prototypes are learned and thus lead to better generalization performance. We validate our method on three fine-grained datasets in both clean and noisy settings.
深度神经网络在一些图像识别和分类任务上的表现超过了人类。然而,随着各种新颖类的出现,如何从有限的标记样本中不断扩展网络的学习能力仍然是一个挑战。基于度量的方法在少拍图像分类中发挥了关键作用,但大多数方法仅使用单个度量函数来测量度量空间中样本之间的距离。本文提出了一种双原型网络(Dual Prototypical Network, DPN)来提高经典原型网络的测试时间鲁棒性。该方法不仅关注原始特征的距离,而且在图像中加入扰动噪声并计算噪声特征的距离。通过强制模型在两个指标下进行预测,可以学习到更具代表性和健壮性的类原型,从而获得更好的泛化性能。我们在三个细粒度的数据集上验证了我们的方法,包括干净和嘈杂的设置。
{"title":"Dual Prototypical Network for Robust Few-shot Image Classification","authors":"Qi Song, Zebin Peng, Luchen Ji, Xiaochen Yang, Xiaoxu Li","doi":"10.23919/APSIPAASC55919.2022.9979898","DOIUrl":"https://doi.org/10.23919/APSIPAASC55919.2022.9979898","url":null,"abstract":"Deep neural networks have outperformed humans on some image recognition and classification tasks. However, with the emergence of various novel classes, it remains a chal-lenge to continuously expand the learning capability of such networks from a limited number of labeled samples. Metric-based approaches have been playing a key role in few-shot image classification, but most of them measure the distance between samples in the metric space using only a single metric function. In this paper, we propose a Dual Prototypical Network (DPN) to improve the test-time robustness of the classical prototypical network. The proposed method not only focuses on the distance of the original features, but also adds perturbation noise to the image and calculates the distance of noisy features. By enforcing the model to predict well under both metrics, more representative and robust class prototypes are learned and thus lead to better generalization performance. We validate our method on three fine-grained datasets in both clean and noisy settings.","PeriodicalId":382967,"journal":{"name":"2022 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)","volume":"51 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122603981","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Unified Compression and Watermarking Scheme for MT-BTC Images MT-BTC图像的统一压缩和水印方案
Jing-Ming Guo, Sankarasrinivasan Seshathiri
As most multimedia is compressed for optimal storage or transmission, the watermarking during compression is more appropriate and demanding. Multi-tone block truncation coding image (MT-BTC) is a latest and superior version of halftone based block truncation coding images. In this paper, a novel watermarking strategy is proposed for MT-BTC images using the adaptive dither array selection (ADAS) and inter-tone shifting. ADAS method utilize dither array constructed using two different gaussian filters based on the Human Visual System (HVS) model to embed the watermark information. Further, various configuration of dither array such as actual, conjugate, transpose and transpose conjugate is used to embed more data. Moreover, a new approach termed inter-tone shifting is also proposed to improve the decoding rate and security. The decoding scheme is performed using the pattern similarity on the dithered watermarked image. For result validation, a 2,4,6 and 8-Tone watermark image is embedded in the 4-Tone MTBTC image. From extensive analysis on decoding rate and robustness, it has been validated that the proposed scheme outperforms many halftone watermarking and consistent with conventional BTC methods.
由于大多数多媒体都是为了优化存储或传输而进行压缩的,因此压缩过程中的水印更加合适,要求也更高。多色调分块截断编码图像(MT-BTC)是基于半色调分块截断编码图像的最新、优越版本。本文提出了一种基于自适应抖动阵列选择(ADAS)和色调间移位的MT-BTC图像水印策略。ADAS方法利用基于人类视觉系统(HVS)模型的两种不同高斯滤波器构建的抖动阵列来嵌入水印信息。此外,还采用了实际抖动阵列、共轭抖动阵列、转置抖动阵列和转置共轭抖动阵列等多种构型来嵌入更多的数据。此外,为了提高译码率和安全性,还提出了一种新的译码方法——音间移位。利用抖动水印图像的模式相似度进行解码。为了验证结果,在4- tone mbittc图像中嵌入2,4,6和8-Tone水印图像。通过对译码率和鲁棒性的广泛分析,验证了该方案优于许多半色调水印,与传统的BTC方法一致。
{"title":"A Unified Compression and Watermarking Scheme for MT-BTC Images","authors":"Jing-Ming Guo, Sankarasrinivasan Seshathiri","doi":"10.23919/APSIPAASC55919.2022.9979994","DOIUrl":"https://doi.org/10.23919/APSIPAASC55919.2022.9979994","url":null,"abstract":"As most multimedia is compressed for optimal storage or transmission, the watermarking during compression is more appropriate and demanding. Multi-tone block truncation coding image (MT-BTC) is a latest and superior version of halftone based block truncation coding images. In this paper, a novel watermarking strategy is proposed for MT-BTC images using the adaptive dither array selection (ADAS) and inter-tone shifting. ADAS method utilize dither array constructed using two different gaussian filters based on the Human Visual System (HVS) model to embed the watermark information. Further, various configuration of dither array such as actual, conjugate, transpose and transpose conjugate is used to embed more data. Moreover, a new approach termed inter-tone shifting is also proposed to improve the decoding rate and security. The decoding scheme is performed using the pattern similarity on the dithered watermarked image. For result validation, a 2,4,6 and 8-Tone watermark image is embedded in the 4-Tone MTBTC image. From extensive analysis on decoding rate and robustness, it has been validated that the proposed scheme outperforms many halftone watermarking and consistent with conventional BTC methods.","PeriodicalId":382967,"journal":{"name":"2022 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122884414","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Robust Speech Dereverberation Based on Adaptive Weighted Prediction Error Algorithm with Eigenvector Extraction 基于特征向量提取的自适应加权预测误差算法的鲁棒语音去噪
Yitong Chen, Wen Zhang
Due to its satisfactory performance and no need for room impulse response information, the adaptive weighted prediction error (AWPE) algorithm is promising for speech dereverberation in practice. However, the robustness of AWPE to additive noise is low. To alleviate this problem, this paper proposes a variant of the AWPE algorithm that is based on eigen-decomposition of the signal auto-correlation matrix to construct the reference signal. By using the dominant eigenvector as the reference signal, a linear prediction filter is designed which has a better performance to predict the late reverberation even when the additive noise level is high. To reduce the computational complexity of the standard eigen-decomposition operation in the proposed AWPE variant, an online eigenvector extraction algorithm based on a fixed-point iteration algorithm is presented. Simulations are conducted to validate the effectiveness and robustness of the proposed algorithms over the standard AWPE algorithm.
自适应加权预测误差(AWPE)算法由于其令人满意的性能和不需要房间脉冲响应信息,在实际应用中具有较好的语音去噪效果。然而,AWPE对加性噪声的鲁棒性较低。为了解决这个问题,本文提出了一种基于信号自相关矩阵的特征分解来构造参考信号的AWPE算法的变体。利用优势特征向量作为参考信号,设计了一种即使在加性噪声水平较高时也能较好预测后期混响的线性预测滤波器。为了降低AWPE变体中标准特征分解运算的计算复杂度,提出了一种基于不动点迭代算法的特征向量在线提取算法。通过仿真验证了所提算法相对于标准AWPE算法的有效性和鲁棒性。
{"title":"Robust Speech Dereverberation Based on Adaptive Weighted Prediction Error Algorithm with Eigenvector Extraction","authors":"Yitong Chen, Wen Zhang","doi":"10.23919/APSIPAASC55919.2022.9979829","DOIUrl":"https://doi.org/10.23919/APSIPAASC55919.2022.9979829","url":null,"abstract":"Due to its satisfactory performance and no need for room impulse response information, the adaptive weighted prediction error (AWPE) algorithm is promising for speech dereverberation in practice. However, the robustness of AWPE to additive noise is low. To alleviate this problem, this paper proposes a variant of the AWPE algorithm that is based on eigen-decomposition of the signal auto-correlation matrix to construct the reference signal. By using the dominant eigenvector as the reference signal, a linear prediction filter is designed which has a better performance to predict the late reverberation even when the additive noise level is high. To reduce the computational complexity of the standard eigen-decomposition operation in the proposed AWPE variant, an online eigenvector extraction algorithm based on a fixed-point iteration algorithm is presented. Simulations are conducted to validate the effectiveness and robustness of the proposed algorithms over the standard AWPE algorithm.","PeriodicalId":382967,"journal":{"name":"2022 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121241582","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Fast Converge Spectral Modulation Sensitive Active Noise Control System 一种快速收敛频谱调制灵敏主动噪声控制系统
Kah-Meng Cheong, Yih-Liang Shen, T. Chi
Psychoacoustic active noise control (PANC) systems have been proposed to improve the noise reduction performance of active noise control (ANC) systems by considering hearing properties. PANC systems are usually implemented in the filtered@ $mathbf{x}$ least mean square (FxLMS) architecture. In this paper, we propose a PANC system using a simple and stable fast affine projection (FAP) algorithm in the filtered-x architecture, namely the filtered-x conjugate gradient fast affine projection (FxCGFAP) PANC. The proposed FxCGFAP PANC system converges faster than typical FxLMS ANC and PANC systems. The proposed PANC system considers not only the sensitivity of human hearing to frequency but also the sensitivity to spectral modulation. Objective and subjective evaluations have been conducted. The evaluation results show that the proposed system outperforms the other three systems in terms of objective loudness scores and subjective ratings. The proposed system has been implemented on the Tensilica HiFi3 DSP platform with the fixed-point data format.
心理声主动噪声控制(PANC)系统通过考虑听觉特性来提高主动噪声控制(ANC)系统的降噪性能。PANC系统通常在过滤@ $mathbf{x}$最小均方(FxLMS)架构中实现。本文提出了一种基于滤波-x结构的快速仿射投影(FAP)算法的PANC系统,即滤波-x共轭梯度快速仿射投影(FxCGFAP) PANC。提出的FxCGFAP PANC系统比典型的FxLMS ANC和PANC系统收敛速度更快。该系统不仅考虑了人的听觉对频率的敏感性,而且考虑了对频谱调制的敏感性。进行了客观和主观评价。评价结果表明,该系统在客观响度评分和主观评分方面均优于其他三种系统。该系统已在Tensilica HiFi3 DSP平台上实现,采用定点数据格式。
{"title":"A Fast Converge Spectral Modulation Sensitive Active Noise Control System","authors":"Kah-Meng Cheong, Yih-Liang Shen, T. Chi","doi":"10.23919/APSIPAASC55919.2022.9979983","DOIUrl":"https://doi.org/10.23919/APSIPAASC55919.2022.9979983","url":null,"abstract":"Psychoacoustic active noise control (PANC) systems have been proposed to improve the noise reduction performance of active noise control (ANC) systems by considering hearing properties. PANC systems are usually implemented in the filtered@ $mathbf{x}$ least mean square (FxLMS) architecture. In this paper, we propose a PANC system using a simple and stable fast affine projection (FAP) algorithm in the filtered-x architecture, namely the filtered-x conjugate gradient fast affine projection (FxCGFAP) PANC. The proposed FxCGFAP PANC system converges faster than typical FxLMS ANC and PANC systems. The proposed PANC system considers not only the sensitivity of human hearing to frequency but also the sensitivity to spectral modulation. Objective and subjective evaluations have been conducted. The evaluation results show that the proposed system outperforms the other three systems in terms of objective loudness scores and subjective ratings. The proposed system has been implemented on the Tensilica HiFi3 DSP platform with the fixed-point data format.","PeriodicalId":382967,"journal":{"name":"2022 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127731112","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep Unfolding-Aided Sum-Product Algorithm for Error Correction of CRC Coded Short Message 基于深度展开辅助的CRC编码短信纠错和积算法
Qilin Zhang, S. Ibi, Takumi Takahashi, H. Iwai
This paper proposes a deep unfolding-aided sum-product algorithm (SPA) for error correction decoding of cyclic redundancy check (CRC) coded short message. SPA is a practical decoding algorithm for linear codes without requiring enormous computational complexity. However, if the SPA is used as it is for CRC codes, belief correlation and outliers will be induced in the iterative decoding process, resulting in lousy correction capability. To compensate for this drawback, we design a SPA-based decoding process for CRC code that incorporates a data-driven design based on deep learning and learning optimization of in-ternal trainable parameters. Considering the operation principle of soft-decision decoder, a novel loss function based on a weighted average of negentropy, which is a key measure to evaluate the Gaussianity, and BCE of the decoder output is proposed. Numerical results show that the proposed algorithm improves the bit error rate (BER) performance with deep unfolding and negentropy-aware loss function.
提出了一种用于循环冗余校验(CRC)编码短信纠错解码的深度展开辅助和积算法(SPA)。SPA是一种实用的线性码译码算法,不需要大量的计算复杂度。但是,如果对CRC码使用SPA,在迭代解码过程中会产生信念相关和离群值,导致校正能力差。为了弥补这一缺点,我们为CRC代码设计了一个基于spa的解码过程,该过程结合了基于深度学习和内部可训练参数学习优化的数据驱动设计。针对软判决译码器的工作原理,提出了一种基于负熵加权平均的损失函数,作为评价译码器输出高斯性和BCE的关键指标。数值结果表明,该算法通过深度展开和负熵感知损失函数改善了误码率性能。
{"title":"Deep Unfolding-Aided Sum-Product Algorithm for Error Correction of CRC Coded Short Message","authors":"Qilin Zhang, S. Ibi, Takumi Takahashi, H. Iwai","doi":"10.23919/APSIPAASC55919.2022.9979875","DOIUrl":"https://doi.org/10.23919/APSIPAASC55919.2022.9979875","url":null,"abstract":"This paper proposes a deep unfolding-aided sum-product algorithm (SPA) for error correction decoding of cyclic redundancy check (CRC) coded short message. SPA is a practical decoding algorithm for linear codes without requiring enormous computational complexity. However, if the SPA is used as it is for CRC codes, belief correlation and outliers will be induced in the iterative decoding process, resulting in lousy correction capability. To compensate for this drawback, we design a SPA-based decoding process for CRC code that incorporates a data-driven design based on deep learning and learning optimization of in-ternal trainable parameters. Considering the operation principle of soft-decision decoder, a novel loss function based on a weighted average of negentropy, which is a key measure to evaluate the Gaussianity, and BCE of the decoder output is proposed. Numerical results show that the proposed algorithm improves the bit error rate (BER) performance with deep unfolding and negentropy-aware loss function.","PeriodicalId":382967,"journal":{"name":"2022 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)","volume":"342 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132632371","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fast Signal Completion Algorithm with Cyclic Convolutional Smoothing 基于循环卷积平滑的快速信号补全算法
Hiromu Takayama, Tatsuya Yokota
Recently, signal completion methods using delay-embedding transforms (DT) have been actively studied. Since the DT is an operation to transform a signal into a Hankel matrix, the high computational cost associated with the increase in data size is an issue. In this study, we consider modeling smooth signals based on inverse delay-embedding instead of delay-embedding. We propose a new algorithm that incorporates the properties of the delay-embedding-based methods while reducing the computational cost. The proposed algorithm takes advantage of the inverse delay-embedding being a cyclic convolution, and the computational complexity can be reduced to $mathcal{O}(NlogN)$ by transforming the optimization problem to Fourier space. Numerical experiments with typical signals and audio data show the effectiveness of the proposed algorithm in signal declipping and completion problems.
近年来,基于延迟嵌入变换(DT)的信号补全方法得到了积极的研究。由于DT是将信号转换为汉克尔矩阵的操作,因此与数据大小增加相关的高计算成本是一个问题。在本研究中,我们考虑基于逆延迟嵌入而不是延迟嵌入来建模平滑信号。我们提出了一种新的算法,它结合了基于延迟嵌入的方法的特性,同时降低了计算成本。该算法利用了逆延迟嵌入是一个循环卷积的优点,通过将优化问题转化为傅里叶空间,将计算复杂度降低到$mathcal{O}(NlogN)$。典型信号和音频数据的数值实验表明了该算法在信号去噪和补全问题中的有效性。
{"title":"Fast Signal Completion Algorithm with Cyclic Convolutional Smoothing","authors":"Hiromu Takayama, Tatsuya Yokota","doi":"10.23919/APSIPAASC55919.2022.9980284","DOIUrl":"https://doi.org/10.23919/APSIPAASC55919.2022.9980284","url":null,"abstract":"Recently, signal completion methods using delay-embedding transforms (DT) have been actively studied. Since the DT is an operation to transform a signal into a Hankel matrix, the high computational cost associated with the increase in data size is an issue. In this study, we consider modeling smooth signals based on inverse delay-embedding instead of delay-embedding. We propose a new algorithm that incorporates the properties of the delay-embedding-based methods while reducing the computational cost. The proposed algorithm takes advantage of the inverse delay-embedding being a cyclic convolution, and the computational complexity can be reduced to $mathcal{O}(NlogN)$ by transforming the optimization problem to Fourier space. Numerical experiments with typical signals and audio data show the effectiveness of the proposed algorithm in signal declipping and completion problems.","PeriodicalId":382967,"journal":{"name":"2022 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133511797","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Deep Adaptive Denoising Auto-Encoder Networks for ECG Noise Cancellation via Time-Frequency Domain 基于时频域心电降噪的深度自适应自编码器网络
Amir Mohammadisrab, Poorya Aghaomidi, Jalil Mazloum, M. Akbarzadeh, M. Orooji, N. Mokari, H. Yanikomeroglu
In this paper, we study the performance of a deep adaptive denoising auto-encoder network (DeepADAENet) for electrocardiogram (ECG) signal noise cancelation in the time-frequency domain for practical use cases. In order to achieve a higher resolution in distinguishing the noise from valuable data, the fractional Stockwell transform (FrST) is exploited to convert the ECG to the time-frequency image. The magnitude of the time-frequency version of the ECG is noise-canceled using DeepADAENet. Then, inverse FrST is utilized to return the denoised time-frequency ECG into the time domain. Furthermore, we use the MIT-BID Apnea-ECG database (APNEA-ECG) for preparing the dataset due to various physiologies and records compared with other ECG databases. Moreover, muscle artifacts (MA), baseline wander (BW), and electrode motion (EM) from the MIT-BID Noise Stress Test Database (NSTDB) are utilized to make noisy this clean dataset. The ECG signals recorded by non-clinical devices contain more noise than clinical recording. Accordingly, by changing the coefficient and frequency of noise resources, we attempt to close the simulated noisy signal to reality. Results reveal the excellent performance of DeepADAENet compared with similar work in terms of signal-to-noise ratio (SNR), root mean square error (RMSE), and percent root mean square difference (PRD).
在本文中,我们研究了一种深度自适应降噪自编码器网络(DeepADAENet)在实际用例中用于心电图(ECG)信号噪声消除的时频域性能。为了从有价值的数据中获得更高的分辨噪声,利用分数阶斯托克韦尔变换(FrST)将心电信号转换为时频图像。使用DeepADAENet对ECG的时频幅值进行噪声消除。然后利用逆first st将去噪后的时频心电信号返回到时域。此外,与其他ECG数据库相比,我们使用MIT-BID呼吸暂停-ECG数据库(Apnea-ECG)来准备数据集,因为它具有各种生理和记录。此外,利用MIT-BID噪声压力测试数据库(NSTDB)中的肌肉伪像(MA)、基线漂移(BW)和电极运动(EM)对这个干净的数据集进行噪声处理。非临床设备记录的心电信号比临床记录的噪声更大。因此,我们试图通过改变噪声源的系数和频率,使模拟的噪声信号接近真实。结果表明,与同类算法相比,DeepADAENet在信噪比(SNR)、均方根误差(RMSE)和均方根差(PRD)百分比方面表现优异。
{"title":"Deep Adaptive Denoising Auto-Encoder Networks for ECG Noise Cancellation via Time-Frequency Domain","authors":"Amir Mohammadisrab, Poorya Aghaomidi, Jalil Mazloum, M. Akbarzadeh, M. Orooji, N. Mokari, H. Yanikomeroglu","doi":"10.23919/APSIPAASC55919.2022.9980058","DOIUrl":"https://doi.org/10.23919/APSIPAASC55919.2022.9980058","url":null,"abstract":"In this paper, we study the performance of a deep adaptive denoising auto-encoder network (DeepADAENet) for electrocardiogram (ECG) signal noise cancelation in the time-frequency domain for practical use cases. In order to achieve a higher resolution in distinguishing the noise from valuable data, the fractional Stockwell transform (FrST) is exploited to convert the ECG to the time-frequency image. The magnitude of the time-frequency version of the ECG is noise-canceled using DeepADAENet. Then, inverse FrST is utilized to return the denoised time-frequency ECG into the time domain. Furthermore, we use the MIT-BID Apnea-ECG database (APNEA-ECG) for preparing the dataset due to various physiologies and records compared with other ECG databases. Moreover, muscle artifacts (MA), baseline wander (BW), and electrode motion (EM) from the MIT-BID Noise Stress Test Database (NSTDB) are utilized to make noisy this clean dataset. The ECG signals recorded by non-clinical devices contain more noise than clinical recording. Accordingly, by changing the coefficient and frequency of noise resources, we attempt to close the simulated noisy signal to reality. Results reveal the excellent performance of DeepADAENet compared with similar work in terms of signal-to-noise ratio (SNR), root mean square error (RMSE), and percent root mean square difference (PRD).","PeriodicalId":382967,"journal":{"name":"2022 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)","volume":"59 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134038951","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhanced Bidirectional Motion Estimation Using Feature Refinement for HDR Imaging 基于HDR成像特征细化的增强双向运动估计
An Gia Vien, Truong Thanh Nhat Mai, Seonghyun Park, Gahyeong Kim, Chul Lee
We propose a high dynamic range (HDR) image synthesis algorithm based on enhanced bidirectional motion estimation using feature refinement. First, we extract multiscale features from input low dynamic range (LDR) images and then estimate accurate motion vector fields between them in a coarse-to-fine manner via progressive refinement. Then, we estimate adaptive local kernels to merge only valid information in the spatio-exposed neighboring pixels for synthesis. Finally, we refine the initially merged image by exploiting global information to further improve synthesis performance. Experimental results show that the proposed algorithm outperforms state-of-the-art algorithms in quantitative and qualitative comparisons.
我们提出了一种基于增强双向运动估计的高动态范围(HDR)图像合成算法。首先,我们从输入的低动态范围(LDR)图像中提取多尺度特征,然后通过逐步细化,以粗到精的方式估计出它们之间精确的运动向量场。然后,我们估计自适应局部核,只合并空间暴露的相邻像素中的有效信息进行合成。最后,利用全局信息对初始合并图像进行细化,进一步提高合成性能。实验结果表明,该算法在定量和定性比较方面都优于现有算法。
{"title":"Enhanced Bidirectional Motion Estimation Using Feature Refinement for HDR Imaging","authors":"An Gia Vien, Truong Thanh Nhat Mai, Seonghyun Park, Gahyeong Kim, Chul Lee","doi":"10.23919/APSIPAASC55919.2022.9980026","DOIUrl":"https://doi.org/10.23919/APSIPAASC55919.2022.9980026","url":null,"abstract":"We propose a high dynamic range (HDR) image synthesis algorithm based on enhanced bidirectional motion estimation using feature refinement. First, we extract multiscale features from input low dynamic range (LDR) images and then estimate accurate motion vector fields between them in a coarse-to-fine manner via progressive refinement. Then, we estimate adaptive local kernels to merge only valid information in the spatio-exposed neighboring pixels for synthesis. Finally, we refine the initially merged image by exploiting global information to further improve synthesis performance. Experimental results show that the proposed algorithm outperforms state-of-the-art algorithms in quantitative and qualitative comparisons.","PeriodicalId":382967,"journal":{"name":"2022 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)","volume":"81 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133801581","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Vibration measurement using spatial shifting coherent digital holography 利用空间位移相干数字全息测量振动
Long Ngo, Q. Pham
In this research, we proposed a new digital coherent holographic configuration for accurately measuring the three-dimensional (3D) vibration of the object. The vibration was indirectly measured by the displacement of the three mirrors attached on the object. The hologram recorded by the camera consisting of 6 sub-holograms can be separated by Fourier transform and appropriated spatial band-pass filters. Three phase sets extracted from 3 sub-holograms of the reference mirror and 3 object mirrors were used to calculate the displacement of the object in 3D directions. The relation between the displacement of the object and the phases of the sub-holograms was related to the wavelength of the light source, therefore this allows observing the vibration of the object with nano-scale accuracy in z direction and much smaller than the pixel size of the camera accuracy in x and y directions.
在这项研究中,我们提出了一种新的数字相干全息配置,用于精确测量物体的三维振动。振动是通过附着在物体上的三个反射镜的位移间接测量的。由6个子全息图组成的相机记录的全息图可以通过傅里叶变换和适当的空间带通滤波器进行分离。从参考镜和3个目标镜的3个子全息图中提取3个相位集,计算目标在三维方向上的位移。物体的位移与子全息图的相位之间的关系与光源的波长有关,因此可以在z方向上以纳米级精度观察物体的振动,而在x和y方向上则远远小于相机精度的像素尺寸。
{"title":"Vibration measurement using spatial shifting coherent digital holography","authors":"Long Ngo, Q. Pham","doi":"10.23919/APSIPAASC55919.2022.9980262","DOIUrl":"https://doi.org/10.23919/APSIPAASC55919.2022.9980262","url":null,"abstract":"In this research, we proposed a new digital coherent holographic configuration for accurately measuring the three-dimensional (3D) vibration of the object. The vibration was indirectly measured by the displacement of the three mirrors attached on the object. The hologram recorded by the camera consisting of 6 sub-holograms can be separated by Fourier transform and appropriated spatial band-pass filters. Three phase sets extracted from 3 sub-holograms of the reference mirror and 3 object mirrors were used to calculate the displacement of the object in 3D directions. The relation between the displacement of the object and the phases of the sub-holograms was related to the wavelength of the light source, therefore this allows observing the vibration of the object with nano-scale accuracy in z direction and much smaller than the pixel size of the camera accuracy in x and y directions.","PeriodicalId":382967,"journal":{"name":"2022 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133853087","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2022 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1