首页 > 最新文献

2017 IEEE International Conference on Signal and Image Processing Applications (ICSIPA)最新文献

英文 中文
Retinex-based signal enhancement for image dark regions 基于视黄醇的图像暗区信号增强
Hirotaka Tanaka, Yuji Waizumi, T. Kasezawa
To improve text detection and text recognition in natural scenes, it is important to improve legibility by removing the effects of illumination on the image. Therefore, in this study, we present a signal enhancement method for the dark regions in the image. In addition, we propose a procedure to reduce computational complexity of the bilateral filter that is used to estimate the illumination in our method. Since our main aim is to provide the preprocessing of text detection and recognition in natural scenes, we have proposed a simple processing in which only the signals of the dark regions in the image are enhanced. Experimental results show that our method is effective in conserving the naturalness of the image and improving the legibility of texts in natural scenes. Furthermore, by using our proposed procedure, bilateral filter computation loads can be reduced by about 30%.
为了改善自然场景中的文本检测和文本识别,通过消除光照对图像的影响来提高易读性是很重要的。因此,在本研究中,我们提出了一种对图像中的暗区进行信号增强的方法。此外,我们提出了一个程序,以减少计算复杂性的双边滤波器,用于估计我们的方法中的照明。由于我们的主要目的是提供自然场景中文本检测和识别的预处理,因此我们提出了一种简单的处理方法,其中仅增强图像中暗区域的信号。实验结果表明,该方法能够有效地保持自然场景下图像的自然度,提高文本的易读性。此外,采用我们提出的方法,双边滤波器的计算量可以减少约30%。
{"title":"Retinex-based signal enhancement for image dark regions","authors":"Hirotaka Tanaka, Yuji Waizumi, T. Kasezawa","doi":"10.1109/ICSIPA.2017.8120607","DOIUrl":"https://doi.org/10.1109/ICSIPA.2017.8120607","url":null,"abstract":"To improve text detection and text recognition in natural scenes, it is important to improve legibility by removing the effects of illumination on the image. Therefore, in this study, we present a signal enhancement method for the dark regions in the image. In addition, we propose a procedure to reduce computational complexity of the bilateral filter that is used to estimate the illumination in our method. Since our main aim is to provide the preprocessing of text detection and recognition in natural scenes, we have proposed a simple processing in which only the signals of the dark regions in the image are enhanced. Experimental results show that our method is effective in conserving the naturalness of the image and improving the legibility of texts in natural scenes. Furthermore, by using our proposed procedure, bilateral filter computation loads can be reduced by about 30%.","PeriodicalId":268112,"journal":{"name":"2017 IEEE International Conference on Signal and Image Processing Applications (ICSIPA)","volume":"145 ","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133454753","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Improvement of stereo corresponding algorithm based on sum of absolute differences and edge preserving filter 基于绝对差和和边缘保持滤波的立体对应算法改进
R. A. Hamzah, M. S. Hamid, Ahmad Fauzan Kadmin, S. Ghani
This paper proposes a new stereo corresponding algorithm which uses local-based. The Sum of Absolute Differences (SAD) algorithm produces accurate results on the disparity map for the textured regions. However, this algorithm is sensitive to low texture areas and high noise on images with high different brightness and contrast of images. To get over these problems, the proposed algorithm utilizes edge-preserving filter which is known as Bilateral Filter (BF). The BF kernel well-recovered low texture areas which is able to reduce noise and sharpen the images. Additionally, BF is strong against the distortions due to high brightness and contrast. The proposed work in this paper produces accurate results and performs much better compared to some established algorithms based on the quantitative and qualitative measurements using standard stereo benchmarking evaluation from the Middlebury.
本文提出了一种新的基于局部的立体对应算法。绝对差和(Sum of Absolute difference, SAD)算法对纹理区域的视差图产生准确的结果。然而,该算法对亮度和对比度差异较大的图像的低纹理区域和高噪声敏感。为了克服这些问题,该算法采用了边缘保持滤波器,即双边滤波器(BF)。该算法能够很好地恢复低纹理区域,从而降低图像的噪声,提高图像的锐化程度。此外,由于高亮度和高对比度,BF具有很强的抗畸变能力。本文提出的工作产生了准确的结果,并且与一些基于使用Middlebury标准立体基准评估的定量和定性测量的既定算法相比,性能要好得多。
{"title":"Improvement of stereo corresponding algorithm based on sum of absolute differences and edge preserving filter","authors":"R. A. Hamzah, M. S. Hamid, Ahmad Fauzan Kadmin, S. Ghani","doi":"10.1109/ICSIPA.2017.8120610","DOIUrl":"https://doi.org/10.1109/ICSIPA.2017.8120610","url":null,"abstract":"This paper proposes a new stereo corresponding algorithm which uses local-based. The Sum of Absolute Differences (SAD) algorithm produces accurate results on the disparity map for the textured regions. However, this algorithm is sensitive to low texture areas and high noise on images with high different brightness and contrast of images. To get over these problems, the proposed algorithm utilizes edge-preserving filter which is known as Bilateral Filter (BF). The BF kernel well-recovered low texture areas which is able to reduce noise and sharpen the images. Additionally, BF is strong against the distortions due to high brightness and contrast. The proposed work in this paper produces accurate results and performs much better compared to some established algorithms based on the quantitative and qualitative measurements using standard stereo benchmarking evaluation from the Middlebury.","PeriodicalId":268112,"journal":{"name":"2017 IEEE International Conference on Signal and Image Processing Applications (ICSIPA)","volume":"69 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131463971","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Hybrid neural network and regression tree ensemble pruned by simulated annealing for virtual flow metering application 混合神经网络与模拟退火剪枝回归树集成在虚拟流量计量中的应用
Tareq Aziz AL-Qutami, R. Ibrahim, I. Ismail
Virtual flow metering (VFM) is an attractive and cost-effective solution to meet the rising multiphase flow monitoring demands in the petroleum industry. It can also augment and backup physical multiphase flow metering. In this study, a heterogeneous ensemble of neural networks and regression trees is proposed to develop a VFM model utilizing bootstrapping and parameter perturbation to generate diversity among learners. The ensemble is pruned using simulated annealing optimization to further ensure accuracy and reduce ensemble complexity. The proposed VFM model is validated using five years well-test data from eight production wells. Results show improved performance over homogeneous ensemble techniques. Average errors achieved are 1.5%, 6.5%, and 4.7% for gas, oil, and, water flow rate estimations. The developed VFM provides accurate flow rate estimations across a wide range of gas volume fractions and water cuts and is anticipated to be a step forward towards the vision of completely integrated operations.
虚拟流量测量(VFM)是满足石油工业中不断增长的多相流监测需求的一种有吸引力且经济高效的解决方案。它还可以补充和备份物理多相流计量。在这项研究中,提出了一个神经网络和回归树的异构集成来开发一个VFM模型,利用自举和参数摄动来产生学习者之间的多样性。采用模拟退火优化对集合进行剪枝,进一步保证了集合的精度,降低了集合的复杂度。利用8口生产井的5年试井数据验证了所提出的VFM模型。结果表明,性能优于均匀集成技术。对于天然气、石油和水的流量估计,平均误差分别为1.5%、6.5%和4.7%。开发的VFM可以在大范围的气体体积分数和含水率下提供准确的流量估计,预计将是实现完全集成作业的重要一步。
{"title":"Hybrid neural network and regression tree ensemble pruned by simulated annealing for virtual flow metering application","authors":"Tareq Aziz AL-Qutami, R. Ibrahim, I. Ismail","doi":"10.1109/ICSIPA.2017.8120626","DOIUrl":"https://doi.org/10.1109/ICSIPA.2017.8120626","url":null,"abstract":"Virtual flow metering (VFM) is an attractive and cost-effective solution to meet the rising multiphase flow monitoring demands in the petroleum industry. It can also augment and backup physical multiphase flow metering. In this study, a heterogeneous ensemble of neural networks and regression trees is proposed to develop a VFM model utilizing bootstrapping and parameter perturbation to generate diversity among learners. The ensemble is pruned using simulated annealing optimization to further ensure accuracy and reduce ensemble complexity. The proposed VFM model is validated using five years well-test data from eight production wells. Results show improved performance over homogeneous ensemble techniques. Average errors achieved are 1.5%, 6.5%, and 4.7% for gas, oil, and, water flow rate estimations. The developed VFM provides accurate flow rate estimations across a wide range of gas volume fractions and water cuts and is anticipated to be a step forward towards the vision of completely integrated operations.","PeriodicalId":268112,"journal":{"name":"2017 IEEE International Conference on Signal and Image Processing Applications (ICSIPA)","volume":"54 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114191973","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Hybrid DWT and MFCC feature warping for noisy forensic speaker verification in room reverberation 混合DWT和MFCC特征翘曲在室内混响嘈杂的法医扬声器验证
Ahmed Kamil Hasan Al-Ali, B. Senadji, V. Chandran
The robustness of speaker verification systems is often degraded in real forensic applications, which contain environmental noise and reverberation. Reverberation results in mismatched conditions between enrolment and test speech signals. In this work, we investigate the effectiveness of combining features of discrete wavelet transform (DWT) and feature-warped mel frequency cepstral coefficients (MFCCs) to improve the performance of speaker verification under conditions of reverberation and environmental noises. State of the art intermediate vector (i-vector) and probabilistic linear discriminant analysis (PLDA) were used as a classifier. The algorithm was evaluated by convolving the impulse room response with enrolment speech from an Australian forensic voice comparison database. The test speech signals were combined with car, street, and home noises from the QUT-NOISE database at signal to noise ratios (SNR) ranging from −10 dB to 10 dB. Experimental results indicate that the algorithm achieves a reduction in average equal error rate (EER) ranging from 17.10% to 51.86% over traditional MFCC features when reverberated enrolment data and the test speech signals are corrupted with car, street and home noises at SNRs ranging from −10 dB to 10 dB.
在包含环境噪声和混响的实际司法应用中,说话人验证系统的鲁棒性经常下降。混响导致入学和测试语音信号不匹配。在这项工作中,我们研究了将离散小波变换(DWT)和特征扭曲的mel频率倒谱系数(MFCCs)相结合的有效性,以改善混响和环境噪声条件下的说话人验证性能。使用最先进的中间向量(i-vector)和概率线性判别分析(PLDA)作为分类器。该算法通过将脉冲房间响应与来自澳大利亚法医语音比较数据库的入学演讲进行卷积来评估。测试语音信号与来自QUT-NOISE数据库的汽车、街道和家庭噪声结合在一起,信噪比(SNR)范围为−10 dB至10 dB。实验结果表明,当混响注册数据和测试语音信号被汽车、街道和家庭噪声(信噪比为- 10 dB ~ 10 dB)干扰时,该算法比传统的MFCC算法平均误差率(EER)降低了17.10% ~ 51.86%。
{"title":"Hybrid DWT and MFCC feature warping for noisy forensic speaker verification in room reverberation","authors":"Ahmed Kamil Hasan Al-Ali, B. Senadji, V. Chandran","doi":"10.1109/ICSIPA.2017.8120650","DOIUrl":"https://doi.org/10.1109/ICSIPA.2017.8120650","url":null,"abstract":"The robustness of speaker verification systems is often degraded in real forensic applications, which contain environmental noise and reverberation. Reverberation results in mismatched conditions between enrolment and test speech signals. In this work, we investigate the effectiveness of combining features of discrete wavelet transform (DWT) and feature-warped mel frequency cepstral coefficients (MFCCs) to improve the performance of speaker verification under conditions of reverberation and environmental noises. State of the art intermediate vector (i-vector) and probabilistic linear discriminant analysis (PLDA) were used as a classifier. The algorithm was evaluated by convolving the impulse room response with enrolment speech from an Australian forensic voice comparison database. The test speech signals were combined with car, street, and home noises from the QUT-NOISE database at signal to noise ratios (SNR) ranging from −10 dB to 10 dB. Experimental results indicate that the algorithm achieves a reduction in average equal error rate (EER) ranging from 17.10% to 51.86% over traditional MFCC features when reverberated enrolment data and the test speech signals are corrupted with car, street and home noises at SNRs ranging from −10 dB to 10 dB.","PeriodicalId":268112,"journal":{"name":"2017 IEEE International Conference on Signal and Image Processing Applications (ICSIPA)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114431443","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Extraction of geometric shape of repetitive elements with application to traceology 重复元素几何形状的提取及其在痕迹学中的应用
M. Vagac, M. Melichercík, Michaela Samuelcikova
Detection of repetitive patterns in images is subject of several research papers. The majority of them deals with detection of lattice patterns of repetitive elements. However, there are many situations, when element's repetition doesn't follow any particular pattern. In this paper we focus on the following two objectives. Firstly, our algorithm detects repetitive elements regardless of their relative positions. Secondly, the algorithm extracts the shape of repetitive element. Main contribution of this paper is the proposed algorithm with ability to extract shape of not only regulary repeating elements.
图像中重复模式的检测是几篇研究论文的主题。它们中的大多数处理重复元素的晶格模式的检测。然而,在许多情况下,元素的重复并不遵循任何特定的模式。在本文中,我们主要关注以下两个目标。首先,我们的算法检测重复元素,而不考虑它们的相对位置。其次,提取重复元素的形状;本文的主要贡献在于提出的算法不仅能够提取规则重复元素的形状,而且能够提取规则重复元素的形状。
{"title":"Extraction of geometric shape of repetitive elements with application to traceology","authors":"M. Vagac, M. Melichercík, Michaela Samuelcikova","doi":"10.1109/ICSIPA.2017.8120593","DOIUrl":"https://doi.org/10.1109/ICSIPA.2017.8120593","url":null,"abstract":"Detection of repetitive patterns in images is subject of several research papers. The majority of them deals with detection of lattice patterns of repetitive elements. However, there are many situations, when element's repetition doesn't follow any particular pattern. In this paper we focus on the following two objectives. Firstly, our algorithm detects repetitive elements regardless of their relative positions. Secondly, the algorithm extracts the shape of repetitive element. Main contribution of this paper is the proposed algorithm with ability to extract shape of not only regulary repeating elements.","PeriodicalId":268112,"journal":{"name":"2017 IEEE International Conference on Signal and Image Processing Applications (ICSIPA)","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124493267","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Robust breast tumor detection via shrinkage covariance matrix estimation 基于收缩协方差矩阵估计的鲁棒乳腺肿瘤检测
Li-Der Fang, Wen-Hsien Fang, D. Chang, Yie-Tarng Chen
Microwave imaging (MWI) is a promising imaging modality for breast tumor detection. One challenge faced by the ultra-wideband (UWB) radar-based breast cancer detection is the estimation of clutter-plus-noise covariance matrix. To render a more accurate covariance matrix estimate when the number of samples is not large, this paper presents a new covariance matrix estimate using the shrinkage method. The parameters of the proposed shrinkage-based covariance matrix are cast as a modified semi-definite programming (SDP) problem based on the minimum mean-squared error (MMSE) criterion. Moreover, to reduce the computational overhead, we also incorporate the compressive sensing (CS) technique with the above scheme for UWB breast tumor detection. The performance of the Capon beamformer based on the new reconstructed covariance matrix is tested under multistatic scenario by a 2-D numerical breast analysis model. Simulations show that the proposed approach possesses a better target identification capability and improves the signal-to-clutter-noise ratio (SCNR) than the existing counterparts.
微波成像(MWI)是一种很有前途的乳腺肿瘤检测成像方式。基于超宽带(UWB)雷达的乳腺癌检测面临的一个挑战是杂波加噪声协方差矩阵的估计。为了在样本数量不大的情况下给出更准确的协方差矩阵估计,本文提出了一种新的用收缩法估计协方差矩阵的方法。将提出的基于收缩的协方差矩阵参数转换为基于最小均方误差(MMSE)准则的改进半确定规划问题。此外,为了减少计算开销,我们还将压缩感知(CS)技术与上述方案结合用于超宽带乳腺肿瘤检测。利用二维数值乳房分析模型,对基于新重构协方差矩阵的Capon波束形成器在多静态场景下的性能进行了测试。仿真结果表明,该方法比现有方法具有更好的目标识别能力,提高了信噪比(SCNR)。
{"title":"Robust breast tumor detection via shrinkage covariance matrix estimation","authors":"Li-Der Fang, Wen-Hsien Fang, D. Chang, Yie-Tarng Chen","doi":"10.1109/ICSIPA.2017.8120584","DOIUrl":"https://doi.org/10.1109/ICSIPA.2017.8120584","url":null,"abstract":"Microwave imaging (MWI) is a promising imaging modality for breast tumor detection. One challenge faced by the ultra-wideband (UWB) radar-based breast cancer detection is the estimation of clutter-plus-noise covariance matrix. To render a more accurate covariance matrix estimate when the number of samples is not large, this paper presents a new covariance matrix estimate using the shrinkage method. The parameters of the proposed shrinkage-based covariance matrix are cast as a modified semi-definite programming (SDP) problem based on the minimum mean-squared error (MMSE) criterion. Moreover, to reduce the computational overhead, we also incorporate the compressive sensing (CS) technique with the above scheme for UWB breast tumor detection. The performance of the Capon beamformer based on the new reconstructed covariance matrix is tested under multistatic scenario by a 2-D numerical breast analysis model. Simulations show that the proposed approach possesses a better target identification capability and improves the signal-to-clutter-noise ratio (SCNR) than the existing counterparts.","PeriodicalId":268112,"journal":{"name":"2017 IEEE International Conference on Signal and Image Processing Applications (ICSIPA)","volume":"319 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122987527","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Non-von-neumann heap for better streaming, capturing and storing of raw 8K video data 非诺伊曼堆更好的流,捕获和存储原始8K视频数据
Mohamed Shaafiee, R. Logeswaran
The advent of 8K and better resolutions of video pose problems for the capture and storage of data by these standards. The contemporary alternative is to compromise on quality and use various (often lossy) compression techniques to reduce the bandwidth required to move this data. This paper proposes a novel method for handling large volumes of video data without compromising its quality through space saving techniques such as chroma subsampling. A proposed implementation is also presented. The method is shown to be capable of handling the capture and storage of raw 8K video data as well as supports better video streaming.
8K和更高分辨率视频的出现给这些标准的数据捕获和存储带来了问题。当前的替代方案是在质量上妥协,并使用各种(通常是有损的)压缩技术来减少移动这些数据所需的带宽。本文提出了一种通过色度子采样等节省空间的技术来处理大量视频数据而不影响其质量的新方法。提出了一种实现方案。该方法能够处理原始8K视频数据的捕获和存储,并支持更好的视频流。
{"title":"Non-von-neumann heap for better streaming, capturing and storing of raw 8K video data","authors":"Mohamed Shaafiee, R. Logeswaran","doi":"10.1109/ICSIPA.2017.8120657","DOIUrl":"https://doi.org/10.1109/ICSIPA.2017.8120657","url":null,"abstract":"The advent of 8K and better resolutions of video pose problems for the capture and storage of data by these standards. The contemporary alternative is to compromise on quality and use various (often lossy) compression techniques to reduce the bandwidth required to move this data. This paper proposes a novel method for handling large volumes of video data without compromising its quality through space saving techniques such as chroma subsampling. A proposed implementation is also presented. The method is shown to be capable of handling the capture and storage of raw 8K video data as well as supports better video streaming.","PeriodicalId":268112,"journal":{"name":"2017 IEEE International Conference on Signal and Image Processing Applications (ICSIPA)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132356780","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Perspective projection for decoding of QR codes posted on cylinders 透视投影,用于解码张贴在圆柱体上的QR码
K. Lay, M. Zhou
Nowadays, QR (quick-response) codes have become part of our daily life. In many applications, QR codes are posted (i.e. pasted or printed) on cylinders. Then, the QR image as captured by a camera would be distorted. In this paper, we try to tackle the decoding of QR codes in such a situation. It is based on perspective projection (PP), which is specified by a camera matrix (cM), with the assistance of cross ratio (CR). In the proposed scheme, the mathematics involved is neat, and the computation is fast. Experimental results show that the proposed scheme is effective, in the sense that with the aid of it many failed decoding attempts became successful.
如今,QR(快速响应)码已经成为我们日常生活的一部分。在许多应用中,QR码被张贴(即粘贴或打印)在圆柱体上。然后,相机捕捉到的QR图像会被扭曲。本文试图解决这种情况下的QR码解码问题。它基于透视投影(PP),由相机矩阵(cM)指定,并借助于交叉比(CR)。该方案数学计算简洁,计算速度快。实验结果表明,该方案是有效的,许多失败的译码尝试都成功了。
{"title":"Perspective projection for decoding of QR codes posted on cylinders","authors":"K. Lay, M. Zhou","doi":"10.1109/ICSIPA.2017.8120576","DOIUrl":"https://doi.org/10.1109/ICSIPA.2017.8120576","url":null,"abstract":"Nowadays, QR (quick-response) codes have become part of our daily life. In many applications, QR codes are posted (i.e. pasted or printed) on cylinders. Then, the QR image as captured by a camera would be distorted. In this paper, we try to tackle the decoding of QR codes in such a situation. It is based on perspective projection (PP), which is specified by a camera matrix (cM), with the assistance of cross ratio (CR). In the proposed scheme, the mathematics involved is neat, and the computation is fast. Experimental results show that the proposed scheme is effective, in the sense that with the aid of it many failed decoding attempts became successful.","PeriodicalId":268112,"journal":{"name":"2017 IEEE International Conference on Signal and Image Processing Applications (ICSIPA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128799687","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
A study of training free salient object detection methods in presence of noise 噪声条件下无训练显著目标检测方法的研究
Nitin Kumar, Maheep Singh, Surendra Singh, Abhimanyu Kumar
Salient Object Detection (SOD) has received much attention from the research community due to its increasing applications in the areas such as object detection and recognition, image editing, image and video compression, video summarization and so on. Most of the SOD methods are proposed in literature presuming that the digital images in which salient objects is to be detected, are free from any kind of artifact. SOD in the presence of noise has received much less attention from research community. In this paper, we study and analyze popular salient object detection methods in the presence of Gaussian, Salt and Pepper and Speckle Noises. Extensive experiments are performed on two publicly available SOD datasets viz. MSRA5K and DUT OMRON. The performance of the methods are evaluated in terms of Precision, Recall and F-measure. It is found that Context Aware Saliency Detection (CA) method gives maximum Precision while Graph Based Visual Saliency (GB) gives maximum Recall and F-measure on both the datasets in presence of any of the three noises.
显著目标检测(SOD)在目标检测与识别、图像编辑、图像与视频压缩、视频摘要等领域的应用越来越广泛,受到了研究界的广泛关注。文献中提出的大多数SOD方法都假设要检测的显着物体的数字图像没有任何类型的伪影。超氧化物歧化酶在噪声环境下的研究较少受到学术界的关注。在本文中,我们研究和分析了在高斯噪声、椒盐噪声和散斑噪声存在下流行的显著目标检测方法。在两个公开可用的SOD数据集即MSRA5K和DUT OMRON上进行了广泛的实验。从精密度、召回率和F-measure三个方面对方法的性能进行了评价。发现上下文感知显著性检测(CA)方法给出了最大的精度,而基于图的视觉显著性检测(GB)方法在存在三种噪声中的任何一种情况下都给出了最大的召回率和F-measure。
{"title":"A study of training free salient object detection methods in presence of noise","authors":"Nitin Kumar, Maheep Singh, Surendra Singh, Abhimanyu Kumar","doi":"10.1109/ICSIPA.2017.8120637","DOIUrl":"https://doi.org/10.1109/ICSIPA.2017.8120637","url":null,"abstract":"Salient Object Detection (SOD) has received much attention from the research community due to its increasing applications in the areas such as object detection and recognition, image editing, image and video compression, video summarization and so on. Most of the SOD methods are proposed in literature presuming that the digital images in which salient objects is to be detected, are free from any kind of artifact. SOD in the presence of noise has received much less attention from research community. In this paper, we study and analyze popular salient object detection methods in the presence of Gaussian, Salt and Pepper and Speckle Noises. Extensive experiments are performed on two publicly available SOD datasets viz. MSRA5K and DUT OMRON. The performance of the methods are evaluated in terms of Precision, Recall and F-measure. It is found that Context Aware Saliency Detection (CA) method gives maximum Precision while Graph Based Visual Saliency (GB) gives maximum Recall and F-measure on both the datasets in presence of any of the three noises.","PeriodicalId":268112,"journal":{"name":"2017 IEEE International Conference on Signal and Image Processing Applications (ICSIPA)","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125071922","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Video synopsis generation using spatio-temporal groups 基于时空分组的视频摘要生成
Arif Ahmed, D. P. Dogra, S. Kar, Renuka Patnaik, S. Lee, Heeseung Choi, Ig-Jae Kim
Millions of surveillance cameras operate at 24×7 generating huge amount of visual data for processing. However, retrieval of important activities from such a large data can be time consuming. Thus, researchers are working on finding solutions to present hours of visual data in a compressed, but meaningful way. Video synopsis is one of the ways to represent activities using relatively shorter duration clips. So far, two main approaches have been used by researchers to address this problem, namely synopsis by tracking moving objects and synopsis by clustering moving objects. Synopses outputs, mainly depend on tracking, segmenting, and shifting of moving objects temporally as well as spatially. In many situations, tracking fails, thus produces multiple trajectories of the same object. Due to this, the object may appear and disappear multiple times within the same synopsis output, which is misleading. This also leads to discontinuity and often can be confusing to the viewer of the synopsis. In this paper, we present a new approach for generating compressed video synopsis by grouping tracklets of moving objects. Grouping helps to generate a synopsis where chronologically related objects appear together with meaningful spatio-temporal relation. Our proposed method produces continuous, but a less confusing synopses when tested on publicly available dataset videos as well as in-house dataset videos.
数以百万计的监控摄像头在24×7上运行,产生大量的视觉数据供处理。但是,从如此大的数据中检索重要的活动可能非常耗时。因此,研究人员正在努力寻找以压缩但有意义的方式呈现数小时视觉数据的解决方案。视频摘要是使用持续时间相对较短的片段表示活动的方法之一。到目前为止,研究人员主要采用两种方法来解决这一问题,即通过跟踪运动物体来进行摘要和通过聚类运动物体来进行摘要。synoses输出,主要依赖于对运动对象的时间和空间的跟踪、分割和移动。在许多情况下,跟踪失败,从而产生同一目标的多个轨迹。因此,该对象可能在同一概要输出中多次出现和消失,这是误导性的。这也会导致不连续性,经常会让观众感到困惑。本文提出了一种通过对运动目标的轨迹进行分组来生成压缩视频摘要的新方法。分组有助于生成一个大纲,其中按时间顺序相关的对象与有意义的时空关系一起出现。当对公开可用的数据集视频和内部数据集视频进行测试时,我们提出的方法产生连续的,但较少令人困惑的概要。
{"title":"Video synopsis generation using spatio-temporal groups","authors":"Arif Ahmed, D. P. Dogra, S. Kar, Renuka Patnaik, S. Lee, Heeseung Choi, Ig-Jae Kim","doi":"10.1109/ICSIPA.2017.8120666","DOIUrl":"https://doi.org/10.1109/ICSIPA.2017.8120666","url":null,"abstract":"Millions of surveillance cameras operate at 24×7 generating huge amount of visual data for processing. However, retrieval of important activities from such a large data can be time consuming. Thus, researchers are working on finding solutions to present hours of visual data in a compressed, but meaningful way. Video synopsis is one of the ways to represent activities using relatively shorter duration clips. So far, two main approaches have been used by researchers to address this problem, namely synopsis by tracking moving objects and synopsis by clustering moving objects. Synopses outputs, mainly depend on tracking, segmenting, and shifting of moving objects temporally as well as spatially. In many situations, tracking fails, thus produces multiple trajectories of the same object. Due to this, the object may appear and disappear multiple times within the same synopsis output, which is misleading. This also leads to discontinuity and often can be confusing to the viewer of the synopsis. In this paper, we present a new approach for generating compressed video synopsis by grouping tracklets of moving objects. Grouping helps to generate a synopsis where chronologically related objects appear together with meaningful spatio-temporal relation. Our proposed method produces continuous, but a less confusing synopses when tested on publicly available dataset videos as well as in-house dataset videos.","PeriodicalId":268112,"journal":{"name":"2017 IEEE International Conference on Signal and Image Processing Applications (ICSIPA)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121800895","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
期刊
2017 IEEE International Conference on Signal and Image Processing Applications (ICSIPA)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1