首页 > 最新文献

Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific最新文献

英文 中文
HMM-based Thai speech synthesis using unsupervised stress context labeling 基于hmm的泰语语音合成方法的无监督重音语境标注
Decha Moungsri, Tomoki Koriyama, Takao Kobayashi
This paper describes an approach to HMM-based Thai speech synthesis using stress context. It has been shown that context related to stressed/unstressed syllable information (stress context) significantly improves the tone correctness of the synthetic speech, but there is a problem of requiring a manual context labeling process in tone modeling. To reduce costs for the stress context labeling, we propose an unsupervised technique for automatic labeling based on the characteristics of Thai stressed syllables, namely, having high FO movement and long duration. In the proposed technique, we use log FO variance and duration of each syllable to classify it into one of stress-related context classes. Objective and subjective evaluation results show that the proposed context labeling gives comparable performance to that conducted carefully by a human in terms of tone naturalness of synthetic speech.
本文描述了一种利用重音上下文的基于hmm的泰语语音合成方法。研究表明,与重读/非重读音节信息相关的语境(重音语境)显著提高了合成语音的音调正确性,但在声调建模中存在需要人工语境标注过程的问题。为了降低重音上下文标注的成本,本文基于泰语重音音节高FO运动和长持续时间的特点,提出了一种无监督的自动标注技术。在提出的技术中,我们使用每个音节的对数方差和持续时间将其分类到一个与重音相关的上下文类中。客观评价和主观评价结果表明,本文提出的上下文标注方法在合成语音的音调自然度方面与人工标注效果相当。
{"title":"HMM-based Thai speech synthesis using unsupervised stress context labeling","authors":"Decha Moungsri, Tomoki Koriyama, Takao Kobayashi","doi":"10.1109/APSIPA.2014.7041599","DOIUrl":"https://doi.org/10.1109/APSIPA.2014.7041599","url":null,"abstract":"This paper describes an approach to HMM-based Thai speech synthesis using stress context. It has been shown that context related to stressed/unstressed syllable information (stress context) significantly improves the tone correctness of the synthetic speech, but there is a problem of requiring a manual context labeling process in tone modeling. To reduce costs for the stress context labeling, we propose an unsupervised technique for automatic labeling based on the characteristics of Thai stressed syllables, namely, having high FO movement and long duration. In the proposed technique, we use log FO variance and duration of each syllable to classify it into one of stress-related context classes. Objective and subjective evaluation results show that the proposed context labeling gives comparable performance to that conducted carefully by a human in terms of tone naturalness of synthetic speech.","PeriodicalId":231382,"journal":{"name":"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116881436","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Phase-correction-system (PCS) design utilizing successively linearized optimization 采用连续线性化优化的相位校正系统设计
N. Ito
In this paper, we first derive a new phase-error function for designing all-pass phase-correction-system (PCS) that is needed in digital communication systems and other signal-processing systems. Based on the new phase-error function, we propose a linearized optimization scheme for linearizing the non-linear optimization problem as a successively linearized optimization problem. An illustrative example is given to validate the proposed successively linearized optimization scheme.
本文首先推导了一种新的相位误差函数,用于设计数字通信系统和其他信号处理系统中所需要的全通相位校正系统。基于新的相位误差函数,提出了一种线性化优化方案,将非线性优化问题线性化为连续线性化优化问题。通过实例验证了所提出的连续线性化优化方案的有效性。
{"title":"Phase-correction-system (PCS) design utilizing successively linearized optimization","authors":"N. Ito","doi":"10.1109/APSIPA.2014.7041513","DOIUrl":"https://doi.org/10.1109/APSIPA.2014.7041513","url":null,"abstract":"In this paper, we first derive a new phase-error function for designing all-pass phase-correction-system (PCS) that is needed in digital communication systems and other signal-processing systems. Based on the new phase-error function, we propose a linearized optimization scheme for linearizing the non-linear optimization problem as a successively linearized optimization problem. An illustrative example is given to validate the proposed successively linearized optimization scheme.","PeriodicalId":231382,"journal":{"name":"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117040612","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A comparative study of preprocessing methods in the parametric loudspeaker 参数化扬声器预处理方法的比较研究
Chuang Shi, Y. Kajikawa
The parametric loudspeaker is a directional sound reproduction device making use of the parametric sound generation. A sound beam is formed as a result of nonlinear interactions between ultrasonic beams. The parametric loudspeaker is advantageous in transmitting an equally narrow sound beam from a smaller emitter as compared to the conventional loudspeaker. Due to this advantage, parametric loudspeakers are readily applied in a variety of sound field control applications, such as creation of personal listening spots, spatial audio reproduction, and active noise control. However, there is a long concerned drawback of the parametric loudspeaker, whereby harmonic and intermodulation distortions are byproducts of the parametric sound generation. Hence, a comparative study of six preprocessing methods, including two proposed methods from this paper, is carried out. Harmonic and intermodulation distortions are demonstrated by experiments.
参数扬声器是一种利用参数声音产生的定向声音再现装置。声束是超声波束之间非线性相互作用的结果。与传统扬声器相比,参数扬声器在从较小的发射器发射同样窄的声束方面具有优势。由于这一优势,参数扬声器很容易应用于各种声场控制应用,例如创建个人收听点,空间音频再现和主动噪声控制。然而,参数扬声器有一个长期关注的缺点,即谐波和互调失真是参数声音产生的副产品。因此,对六种预处理方法进行了比较研究,其中包括本文提出的两种预处理方法。实验证明了谐波和互调失真。
{"title":"A comparative study of preprocessing methods in the parametric loudspeaker","authors":"Chuang Shi, Y. Kajikawa","doi":"10.1109/APSIPA.2014.7041678","DOIUrl":"https://doi.org/10.1109/APSIPA.2014.7041678","url":null,"abstract":"The parametric loudspeaker is a directional sound reproduction device making use of the parametric sound generation. A sound beam is formed as a result of nonlinear interactions between ultrasonic beams. The parametric loudspeaker is advantageous in transmitting an equally narrow sound beam from a smaller emitter as compared to the conventional loudspeaker. Due to this advantage, parametric loudspeakers are readily applied in a variety of sound field control applications, such as creation of personal listening spots, spatial audio reproduction, and active noise control. However, there is a long concerned drawback of the parametric loudspeaker, whereby harmonic and intermodulation distortions are byproducts of the parametric sound generation. Hence, a comparative study of six preprocessing methods, including two proposed methods from this paper, is carried out. Harmonic and intermodulation distortions are demonstrated by experiments.","PeriodicalId":231382,"journal":{"name":"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116371630","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Computational cost analysis and implementation of accelerated iterative shrinkage smoothing 加速迭代收缩平滑的计算成本分析与实现
Dabwitso Kasauka, Hiroshi Tsutsui, H. Okuhata, Y. Miyanaga
In this paper, we present a computational cost analysis result of accelerated iterative shrinkage smoothing algorithm, which is one of promising image smoothing algorithms with sufficient smoothing quality results and reduced processing time. The main motivation of this cost analysis is to provide a base for efficient hardware implementation. We implemented it in a lower-level programming language with OpenCV library as opposed to the MATLAB implementation. The resolution dependency of the processing time is also illustrated.
本文给出了加速迭代收缩平滑算法的计算成本分析结果,该算法是一种很有前途的图像平滑算法,具有足够的平滑质量,减少了处理时间。这种成本分析的主要动机是为有效的硬件实现提供基础。我们使用底层编程语言OpenCV库来实现它,而不是MATLAB实现。分析了处理时间对分辨率的依赖性。
{"title":"Computational cost analysis and implementation of accelerated iterative shrinkage smoothing","authors":"Dabwitso Kasauka, Hiroshi Tsutsui, H. Okuhata, Y. Miyanaga","doi":"10.1109/APSIPA.2014.7041808","DOIUrl":"https://doi.org/10.1109/APSIPA.2014.7041808","url":null,"abstract":"In this paper, we present a computational cost analysis result of accelerated iterative shrinkage smoothing algorithm, which is one of promising image smoothing algorithms with sufficient smoothing quality results and reduced processing time. The main motivation of this cost analysis is to provide a base for efficient hardware implementation. We implemented it in a lower-level programming language with OpenCV library as opposed to the MATLAB implementation. The resolution dependency of the processing time is also illustrated.","PeriodicalId":231382,"journal":{"name":"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125737872","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Chaotic encoder-decoder on FPGA for crypto system 用于加密系统的FPGA混沌编解码器
Chanathip Roeksukrungrueang, Xaysamone Dittaphong, K. Khongsomboon, Nounchan Panyanouyong, S. Chivapreecha
An implementation of chaotic encoder-decoder on FPGA will be proposed in this paper. Overflow non-linearity by using 2's complement number in digital filter causes the phenomenon called "Chaos" in digital filter. An 1ER filter can be used to chaotic encoder while an FIR filter is used to chaotic decoder. Filter coefficients of both encoder and decoder can be compared to the secret key in private-key crypto system. However, if filter coefficients of chaotic decoder are not same as filter coefficients of chaotic encoder, ciphertext cannot decrypt to get original plaintext. Both chaotic encoder and decoder will be implemented on FPGA to demonstrate the hardware prototype of chaotic crypto system.
本文提出了一种基于FPGA的混沌编解码器的实现方法。数字滤波器中利用2的补数产生的溢流非线性导致了数字滤波器中的“混沌”现象。1ER滤波器可用于混沌编码器,FIR滤波器可用于混沌解码器。在私钥加密系统中,编码器和解码器的滤波系数都可以与密钥进行比较。但是,如果混沌解码器的滤波系数与混沌编码器的滤波系数不一致,密文就无法解密得到原始明文。混沌编码器和解码器都将在FPGA上实现,以演示混沌密码系统的硬件原型。
{"title":"Chaotic encoder-decoder on FPGA for crypto system","authors":"Chanathip Roeksukrungrueang, Xaysamone Dittaphong, K. Khongsomboon, Nounchan Panyanouyong, S. Chivapreecha","doi":"10.1109/APSIPA.2014.7041740","DOIUrl":"https://doi.org/10.1109/APSIPA.2014.7041740","url":null,"abstract":"An implementation of chaotic encoder-decoder on FPGA will be proposed in this paper. Overflow non-linearity by using 2's complement number in digital filter causes the phenomenon called \"Chaos\" in digital filter. An 1ER filter can be used to chaotic encoder while an FIR filter is used to chaotic decoder. Filter coefficients of both encoder and decoder can be compared to the secret key in private-key crypto system. However, if filter coefficients of chaotic decoder are not same as filter coefficients of chaotic encoder, ciphertext cannot decrypt to get original plaintext. Both chaotic encoder and decoder will be implemented on FPGA to demonstrate the hardware prototype of chaotic crypto system.","PeriodicalId":231382,"journal":{"name":"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126111111","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
2D semi-NMF of scale-frequency map for environmental sound classification 用于环境声音分类的二维比例-频率图半nmf
Wen-Chi Hsieh, Chin-Wen Ho, Viet-Hang Duong, Yuan-Shan Lee, Jia-Ching Wang
This paper introduces a novel two dimensional feature extraction method for environmental sound classification, based on two dimensional semi-nonnegative matrix factorization (2D Semi-NMF) of scale-frequency maps. We first extract scale-frequency maps (SFMs) from the input signals, and this feature is considered preserving scale and frequency characteristics of signals. Second, a 2D Semi-NMF method is applied on SFMs to get more information of the input signals. We use the combinational coefficients extracted from 2D Semi-NMF for classification. Experimental results on an 8 class environmental sound database show that 2D Semi-NMF has better classification accuracy than traditional ID NMF and 2D NMF Also, applying 2D Semi-NMF on SFMs will get slightly improvement than SFMs features alone.
提出了一种新的基于比例-频率映射的二维半非负矩阵分解(2D Semi-NMF)的环境声分类二维特征提取方法。我们首先从输入信号中提取比例-频率映射(SFMs),并认为该特征保留了信号的比例和频率特征。其次,将二维半nmf方法应用于SFMs,以获得更多的输入信号信息。我们使用从2D Semi-NMF中提取的组合系数进行分类。在8类环境声数据库上的实验结果表明,二维半NMF比传统的ID NMF和二维NMF具有更好的分类精度,并且在SFMs上应用二维半NMF比单独使用SFMs特征有略高的提高。
{"title":"2D semi-NMF of scale-frequency map for environmental sound classification","authors":"Wen-Chi Hsieh, Chin-Wen Ho, Viet-Hang Duong, Yuan-Shan Lee, Jia-Ching Wang","doi":"10.1109/APSIPA.2014.7041681","DOIUrl":"https://doi.org/10.1109/APSIPA.2014.7041681","url":null,"abstract":"This paper introduces a novel two dimensional feature extraction method for environmental sound classification, based on two dimensional semi-nonnegative matrix factorization (2D Semi-NMF) of scale-frequency maps. We first extract scale-frequency maps (SFMs) from the input signals, and this feature is considered preserving scale and frequency characteristics of signals. Second, a 2D Semi-NMF method is applied on SFMs to get more information of the input signals. We use the combinational coefficients extracted from 2D Semi-NMF for classification. Experimental results on an 8 class environmental sound database show that 2D Semi-NMF has better classification accuracy than traditional ID NMF and 2D NMF Also, applying 2D Semi-NMF on SFMs will get slightly improvement than SFMs features alone.","PeriodicalId":231382,"journal":{"name":"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124661536","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Physical layer security using multi-band transmission considering channel selection for cognitive radio networks 考虑信道选择的认知无线网络多频带传输物理层安全
Akinari Ida, T. Fujii
In this paper, we consider a secrecy transmission scheme based on physical layer, employing multi-band transmitters with dynamic power allocations and channel selections under the circumstance of spectrum sharing cognitive radio networks. Here, we apply a physical layer security utilizing multi-band transmitter for distributing each confidential message of each receiver over multiple frequency channels and for decreasing leakage. Moreover, we aim to improve the secrecy capacity by using a channel selection method based on the channel condition of each user and primary usage. By using computer simulations, we could verify that the proposed method improves the performance in terms of secrecy capacity against eavesdropping compared with methods using single-band transmission under primary user coexisting environment.
本文在频谱共享的认知无线网络环境下,考虑了一种基于物理层的保密传输方案,该方案采用动态功率分配和信道选择的多波段发射机。在这里,我们应用了一种物理层安全,利用多波段发射器在多个频率通道上分发每个接收器的每个机密消息,并减少泄漏。此外,我们还采用了一种基于每个用户的信道条件和主要用途的信道选择方法来提高保密能力。通过计算机仿真,我们可以验证,与主用户共存环境下的单波段传输方法相比,所提出的方法在防窃听的保密能力方面提高了性能。
{"title":"Physical layer security using multi-band transmission considering channel selection for cognitive radio networks","authors":"Akinari Ida, T. Fujii","doi":"10.1109/APSIPA.2014.7041730","DOIUrl":"https://doi.org/10.1109/APSIPA.2014.7041730","url":null,"abstract":"In this paper, we consider a secrecy transmission scheme based on physical layer, employing multi-band transmitters with dynamic power allocations and channel selections under the circumstance of spectrum sharing cognitive radio networks. Here, we apply a physical layer security utilizing multi-band transmitter for distributing each confidential message of each receiver over multiple frequency channels and for decreasing leakage. Moreover, we aim to improve the secrecy capacity by using a channel selection method based on the channel condition of each user and primary usage. By using computer simulations, we could verify that the proposed method improves the performance in terms of secrecy capacity against eavesdropping compared with methods using single-band transmission under primary user coexisting environment.","PeriodicalId":231382,"journal":{"name":"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124837755","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A reversible data hiding based on adaptive prediction technique and histogram shifting 基于自适应预测技术和直方图移位的可逆数据隐藏
R. Liu, R. Ni, Yao Zhao
Reversible data hiding recovers the original image from the stego-image without distortion after data extraction. In this paper, we propose a novel reversible data hiding method based on adaptive prediction techniques and histogram shifting. Because most natural images always contain edges, it is not suitable to predict these pixels using existing prediction methods. For more precise prediction, two prediction methods are adaptively used to calculate prediction error according to the characteristic of a pixel. As a result, two prediction error histograms are built. One is for pixels located at edges, and the other is for the rest pixels. Data are embedded in the image by using histogram shifting method. In addition, a new sorting method is applied to histogram shifting, which considers the differences of all pixel pairs in the neighborhood and better reflects the correlation among pixels. Through the sorting method, the prediction errors with small absolute values are arranged in the front and more embeddable pixels are preferentially processed. Therefore, the number of shifting pixels is decreased if the peaks in the histograms are all dealt with or the capacity is satisfied, which is beneficial to distortion reduction. Experimental results demonstrate that the proposed method acquires greater capacity and higher quality compared with other state-of-the-art schemes.
可逆数据隐藏是在数据提取后不失真地从隐写图像中恢复原始图像。本文提出了一种基于自适应预测技术和直方图移位的可逆数据隐藏方法。由于大多数自然图像总是包含边缘,使用现有的预测方法不适合预测这些像素。为了提高预测精度,根据像素的特性,采用两种预测方法自适应计算预测误差。结果,建立了两个预测误差直方图。一个用于位于边缘的像素,另一个用于其余像素。采用直方图移位法将数据嵌入到图像中。此外,在直方图移位中应用了一种新的排序方法,该方法考虑了邻域内所有像素对的差异,更好地反映了像素间的相关性。通过排序方法,将绝对值较小的预测误差排在前面,优先处理可嵌入像素较多的预测误差。因此,如果对直方图中的峰值全部处理或容量满足,则可以减少移位像素的数量,有利于减少失真。实验结果表明,与其他先进的方法相比,该方法具有更大的容量和更高的质量。
{"title":"A reversible data hiding based on adaptive prediction technique and histogram shifting","authors":"R. Liu, R. Ni, Yao Zhao","doi":"10.1109/APSIPA.2014.7041698","DOIUrl":"https://doi.org/10.1109/APSIPA.2014.7041698","url":null,"abstract":"Reversible data hiding recovers the original image from the stego-image without distortion after data extraction. In this paper, we propose a novel reversible data hiding method based on adaptive prediction techniques and histogram shifting. Because most natural images always contain edges, it is not suitable to predict these pixels using existing prediction methods. For more precise prediction, two prediction methods are adaptively used to calculate prediction error according to the characteristic of a pixel. As a result, two prediction error histograms are built. One is for pixels located at edges, and the other is for the rest pixels. Data are embedded in the image by using histogram shifting method. In addition, a new sorting method is applied to histogram shifting, which considers the differences of all pixel pairs in the neighborhood and better reflects the correlation among pixels. Through the sorting method, the prediction errors with small absolute values are arranged in the front and more embeddable pixels are preferentially processed. Therefore, the number of shifting pixels is decreased if the peaks in the histograms are all dealt with or the capacity is satisfied, which is beneficial to distortion reduction. Experimental results demonstrate that the proposed method acquires greater capacity and higher quality compared with other state-of-the-art schemes.","PeriodicalId":231382,"journal":{"name":"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129771516","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Upsampling of low-resolution depth map with enhancing depth discontinuity regions 增强深度不连续区域的低分辨率深度图上采样
Y. Kang, Yo-Sung Ho
In this paper, we present an upsampling method of low-resolution depth maps with enhancing depth discontinuities using color segment information. After we supply the initial depth measurement considering the corresponding color segment information, we define an energy function for depth map upsampling based on the depth measurement, color values, and color segments. Then, we obtain high-resolution depth maps by by belief propagation optimization. Experimental results show that the proposed method outperforms other approaches for depth map upsampling in terms of the bad pixel rate and mean absolute error.
本文提出了一种利用颜色段信息增强深度不连续的低分辨率深度图上采样方法。在提供了初始深度测量值并考虑了相应的颜色段信息后,我们基于深度测量值、颜色值和颜色段定义了深度图上采样的能量函数。然后,通过信念传播优化得到高分辨率深度图。实验结果表明,该方法在坏像率和平均绝对误差方面优于其他深度图上采样方法。
{"title":"Upsampling of low-resolution depth map with enhancing depth discontinuity regions","authors":"Y. Kang, Yo-Sung Ho","doi":"10.1109/APSIPA.2014.7041739","DOIUrl":"https://doi.org/10.1109/APSIPA.2014.7041739","url":null,"abstract":"In this paper, we present an upsampling method of low-resolution depth maps with enhancing depth discontinuities using color segment information. After we supply the initial depth measurement considering the corresponding color segment information, we define an energy function for depth map upsampling based on the depth measurement, color values, and color segments. Then, we obtain high-resolution depth maps by by belief propagation optimization. Experimental results show that the proposed method outperforms other approaches for depth map upsampling in terms of the bad pixel rate and mean absolute error.","PeriodicalId":231382,"journal":{"name":"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127236924","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multi frame size feature extraction for acoustic event detection 针对声事件检测的多帧特征提取
Liqun Peng, Deshun Yang, Xiaoou Chen
This paper addresses the problem of detection and recognition of impulsive sounds in surveillance system, such as door slams, footsteps, glass breaks, gunshots and human screams. We build an acoustic event dataset of about 1k sound clips and a ground truth dataset of a surveillance system. We investigate the influence of different frame size in audio feature extraction when classify acoustic events and our result show that the classification accuracy differs from different audio frame sizes. Based on this result, we propose an approach to integrate multi frame size features to generate a new feature set, which can achieve better performance. We build an abnormal acoustic event detection system for surveillance using this feature and adopt a smoothing post process. The experiments show the effectiveness of our proposed approach.
本文研究了在监控系统中对诸如关门声、脚步声、玻璃破碎声、枪声和人的尖叫声等脉冲声音的检测和识别问题。我们建立了一个大约1k个声音片段的声学事件数据集和一个监视系统的地面真相数据集。研究了不同帧长的音频特征提取对声学事件分类的影响,结果表明不同帧长的音频特征提取的分类精度不同。在此基础上,我们提出了一种整合多帧大小特征来生成新的特征集的方法,可以获得更好的性能。我们利用这一特征构建了一个用于监视的异常声事件检测系统,并采用平滑后置处理。实验证明了该方法的有效性。
{"title":"Multi frame size feature extraction for acoustic event detection","authors":"Liqun Peng, Deshun Yang, Xiaoou Chen","doi":"10.1109/APSIPA.2014.7041574","DOIUrl":"https://doi.org/10.1109/APSIPA.2014.7041574","url":null,"abstract":"This paper addresses the problem of detection and recognition of impulsive sounds in surveillance system, such as door slams, footsteps, glass breaks, gunshots and human screams. We build an acoustic event dataset of about 1k sound clips and a ground truth dataset of a surveillance system. We investigate the influence of different frame size in audio feature extraction when classify acoustic events and our result show that the classification accuracy differs from different audio frame sizes. Based on this result, we propose an approach to integrate multi frame size features to generate a new feature set, which can achieve better performance. We build an abnormal acoustic event detection system for surveillance using this feature and adopt a smoothing post process. The experiments show the effectiveness of our proposed approach.","PeriodicalId":231382,"journal":{"name":"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122377207","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
期刊
Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1