首页 > 最新文献

2012 3rd International Conference on Image Processing Theory, Tools and Applications (IPTA)最新文献

英文 中文
Case study: Deployment of the 2D NoC on 3D for the generation of large emulation platforms 案例研究:在3D上部署2D NoC以生成大型仿真平台
V. Fresse, Zhiwei Ge, Junyan Tan, F. Rousseau
The evaluation of Network-On-Chip (NoC) architectures is an up to date problem in the design of System-on-Chip. Emulation on FPGA (Field Programmable Gate Array) is used to cover all possible NoC solutions in a reduced exploration time. Emulation requires multi-FPGA platform as the resources for large NoC is important and cannot be handling by one FPGA. In the same time, SoC community is exploring 3D technology for the next generation of large SoC with 3D NoC, making emulation more complex. This paper presents a case study of the deployment of the 2D NoC structure to 3D. A design flow is proposed for the automatic generation of a NoC targeting 3D on multi-FPGAs. The flow integrates emulation blocks used for the validation and exploration on the NoC. With this automatic aided tool, the designer can evaluate and explore the NoC architecture and extract performances of the NoC regardless of the multi-component platform. One may expect a communication performance improvement using an adapted partitioning of the NoC, as highlighted by the results given in this paper.
片上网络(NoC)架构的评估是片上系统(System-on-Chip)设计中的一个最新问题。在FPGA(现场可编程门阵列)上进行仿真,以减少探索时间,涵盖所有可能的NoC解决方案。由于大型NoC的资源非常重要,单个FPGA无法处理,因此仿真需要多个FPGA平台。与此同时,SoC社区正在为下一代具有3D NoC的大型SoC探索3D技术,使仿真更加复杂。本文介绍了一个将二维NoC结构部署到三维的案例研究。提出了一种在多fpga上自动生成面向3D的NoC的设计流程。该流集成了用于验证和勘探NoC的仿真块。利用该自动辅助工具,设计人员可以评估和探索NoC体系结构,并提取NoC的性能,而无需考虑多组件平台。人们可以期望通过调整NoC的划分来提高通信性能,正如本文给出的结果所强调的那样。
{"title":"Case study: Deployment of the 2D NoC on 3D for the generation of large emulation platforms","authors":"V. Fresse, Zhiwei Ge, Junyan Tan, F. Rousseau","doi":"10.1109/RSP.2012.6380686","DOIUrl":"https://doi.org/10.1109/RSP.2012.6380686","url":null,"abstract":"The evaluation of Network-On-Chip (NoC) architectures is an up to date problem in the design of System-on-Chip. Emulation on FPGA (Field Programmable Gate Array) is used to cover all possible NoC solutions in a reduced exploration time. Emulation requires multi-FPGA platform as the resources for large NoC is important and cannot be handling by one FPGA. In the same time, SoC community is exploring 3D technology for the next generation of large SoC with 3D NoC, making emulation more complex. This paper presents a case study of the deployment of the 2D NoC structure to 3D. A design flow is proposed for the automatic generation of a NoC targeting 3D on multi-FPGAs. The flow integrates emulation blocks used for the validation and exploration on the NoC. With this automatic aided tool, the designer can evaluate and explore the NoC architecture and extract performances of the NoC regardless of the multi-component platform. One may expect a communication performance improvement using an adapted partitioning of the NoC, as highlighted by the results given in this paper.","PeriodicalId":267290,"journal":{"name":"2012 3rd International Conference on Image Processing Theory, Tools and Applications (IPTA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130835441","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Spherical coordinates framed RGB color space dichromatic reflection model based image segmentation: Application to wildland fires' outlines extraction 基于球坐标框架RGB色彩空间二色反射模型的图像分割:在野火轮廓提取中的应用
V. Amarger, D. Ramík, C. Sabourin, K. Madani, Ramón Moreno, L. Rossi, M. Graña
Wildland fires represent a major risk for many countries over the world. For efficient fire fighting, the modeling and prediction of fire front propagation is a curial need. However, wildland fires' involves complex dynamics and mathematical modelling of such complex systems needs reliable information extraction from real situations, which is far from being a trivial task. Artificial Vision and Image Processing offer appealing potential toward reliable extraction of required information. In this paper we focus on flames' and fires' segmentation, dealing with the above-stated already open problem. The segmentation approach that we propose is based on dichromatic reflection model reformulated on a spherical interpretation of the RGB color space.
野火是世界上许多国家面临的主要风险。为了高效灭火,火锋传播的建模和预测是迫切需要的。然而,野火涉及复杂的动力学,对这种复杂系统的数学建模需要从实际情况中提取可靠的信息,这远非一项微不足道的任务。人工视觉和图像处理为可靠地提取所需信息提供了诱人的潜力。在本文中,我们关注火焰和火焰的分割,处理上述已经开放的问题。我们提出的分割方法是基于对RGB色彩空间的球面解释重新制定的二色反射模型。
{"title":"Spherical coordinates framed RGB color space dichromatic reflection model based image segmentation: Application to wildland fires' outlines extraction","authors":"V. Amarger, D. Ramík, C. Sabourin, K. Madani, Ramón Moreno, L. Rossi, M. Graña","doi":"10.1109/IPTA.2012.6469529","DOIUrl":"https://doi.org/10.1109/IPTA.2012.6469529","url":null,"abstract":"Wildland fires represent a major risk for many countries over the world. For efficient fire fighting, the modeling and prediction of fire front propagation is a curial need. However, wildland fires' involves complex dynamics and mathematical modelling of such complex systems needs reliable information extraction from real situations, which is far from being a trivial task. Artificial Vision and Image Processing offer appealing potential toward reliable extraction of required information. In this paper we focus on flames' and fires' segmentation, dealing with the above-stated already open problem. The segmentation approach that we propose is based on dichromatic reflection model reformulated on a spherical interpretation of the RGB color space.","PeriodicalId":267290,"journal":{"name":"2012 3rd International Conference on Image Processing Theory, Tools and Applications (IPTA)","volume":"224 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-10-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124454346","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Image processing and vision for the study and the modeling of spreading fires 火灾蔓延的图像处理与视觉研究与建模
L. Rossi, M. Akhloufi, A. Pieri, Jean-Louis Rossi, T. Molinier
This paper presents a work conducted in the field of image processing and computer vision in order to measure geometrical characteristics of spreading fires; this allows understanding the phenomenon occurring during the propagation and improving and/or validating mathematical models by comparing experimental data with numerical ones. In order to develop a metrological system based on vision for wildland fires, it is necessary to solve scientific locks that are also presented in this paper.
本文介绍了在图像处理和计算机视觉领域进行的一项工作,以测量火灾蔓延的几何特征;这允许理解在传播过程中发生的现象,并通过比较实验数据和数值数据来改进和/或验证数学模型。为了建立一个基于视觉的野火计量系统,必须解决本文提出的科学锁。
{"title":"Image processing and vision for the study and the modeling of spreading fires","authors":"L. Rossi, M. Akhloufi, A. Pieri, Jean-Louis Rossi, T. Molinier","doi":"10.1109/IPTA.2012.6469512","DOIUrl":"https://doi.org/10.1109/IPTA.2012.6469512","url":null,"abstract":"This paper presents a work conducted in the field of image processing and computer vision in order to measure geometrical characteristics of spreading fires; this allows understanding the phenomenon occurring during the propagation and improving and/or validating mathematical models by comparing experimental data with numerical ones. In order to develop a metrological system based on vision for wildland fires, it is necessary to solve scientific locks that are also presented in this paper.","PeriodicalId":267290,"journal":{"name":"2012 3rd International Conference on Image Processing Theory, Tools and Applications (IPTA)","volume":"82 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-10-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124709617","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A combining approach for 2D face recognition application on IV2 database 基于IV2数据库的二维人脸识别组合方法
Nefissa Khiari Hili, S. Lelandais, Christophe Montagne, K. Hamrouni
It is often difficult to deal with the problem of 2D-face recognition under unconstrained conditions. The objective of this study is to develop an original method that overcomes such obstructions. The proposed approach combines a holistic method, the Principal Component Analysis (PCA) to a local method, the Steerable Pyramid (SP). All tests were run on IV2 database, with challenging variability and including 3500 to 5000 comparisons by experiment from 315 different people. The followed protocol was established in the first evaluation campaign on 2D-face images using the multimodal IV2 database. Comparison with five submitted algorithms as PCA, LDA and LDA/Gabor provides satisfying results.
无约束条件下的二维人脸识别问题往往难以处理。本研究的目的是开发一种克服这些障碍的原始方法。提出的方法结合了整体方法,主成分分析(PCA)和局部方法,可导向金字塔(SP)。所有测试都在IV2数据库上运行,具有挑战性的可变性,包括来自315个不同人的3500到5000个比较。以下协议是在使用多模态IV2数据库对2d面部图像进行的第一次评估活动中建立的。通过与PCA、LDA和LDA/Gabor算法的比较,得到了满意的结果。
{"title":"A combining approach for 2D face recognition application on IV2 database","authors":"Nefissa Khiari Hili, S. Lelandais, Christophe Montagne, K. Hamrouni","doi":"10.1109/IPTA.2012.6469507","DOIUrl":"https://doi.org/10.1109/IPTA.2012.6469507","url":null,"abstract":"It is often difficult to deal with the problem of 2D-face recognition under unconstrained conditions. The objective of this study is to develop an original method that overcomes such obstructions. The proposed approach combines a holistic method, the Principal Component Analysis (PCA) to a local method, the Steerable Pyramid (SP). All tests were run on IV2 database, with challenging variability and including 3500 to 5000 comparisons by experiment from 315 different people. The followed protocol was established in the first evaluation campaign on 2D-face images using the multimodal IV2 database. Comparison with five submitted algorithms as PCA, LDA and LDA/Gabor provides satisfying results.","PeriodicalId":267290,"journal":{"name":"2012 3rd International Conference on Image Processing Theory, Tools and Applications (IPTA)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-10-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115422021","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dynamic registration of cardiac US and CT data using Fourier descriptors and Dynamic Time Warping 使用傅里叶描述子和动态时间扭曲的心脏US和CT数据的动态配准
F. Tavard, A. Simon, Alfredo I. Hernández, J. Betancur, E. Donal, M. Garreau
Cardiac Resynchronization Therapy (CRT) can be optimized by the fusion of anatomical, functional and electrical information in a unified framework in order to identify the most effective pacing sites. The aim of this work is to perform a registration between dynamic CT and ultrasound (US) images (2D speckle tracking mode). The proposed registration approach is based on a contour to surface scheme, decomposed in four steps: (1) the temporal synchronization of data; (2) the segmentation of the left ventricle endocardial surface from the 3D+t CT data; (3) the segmentation and tracking of the myocardium from the 2D+t US data; (4) the registration of US contour with the CT surface. The originality of the method relies on the use of Fourier descriptors and Dynamic Time Warping (DTW) to handle different spatial and temporal resolutions as well as dissimilar cardiac rhythms between CT and US data. An evaluation on simulated data is described as well as results obtained on three patient databases.
心脏再同步化治疗(CRT)可以通过将解剖、功能和电信息融合在一个统一的框架中来优化,以确定最有效的起搏部位。这项工作的目的是在动态CT和超声(US)图像(2D散斑跟踪模式)之间进行配准。所提出的配准方法基于轮廓到表面的配准方案,分为四个步骤:(1)数据的时间同步;(2) 3D+t CT数据分割左心室心内膜表面;(3)对2D+t US数据进行心肌的分割和跟踪;(4) US轮廓与CT曲面的配准。该方法的独创性依赖于使用傅里叶描述子和动态时间扭曲(DTW)来处理不同的空间和时间分辨率以及CT和US数据之间不同的心律。对模拟数据的评估以及在三个患者数据库中获得的结果进行了描述。
{"title":"Dynamic registration of cardiac US and CT data using Fourier descriptors and Dynamic Time Warping","authors":"F. Tavard, A. Simon, Alfredo I. Hernández, J. Betancur, E. Donal, M. Garreau","doi":"10.1109/IPTA.2012.6469516","DOIUrl":"https://doi.org/10.1109/IPTA.2012.6469516","url":null,"abstract":"Cardiac Resynchronization Therapy (CRT) can be optimized by the fusion of anatomical, functional and electrical information in a unified framework in order to identify the most effective pacing sites. The aim of this work is to perform a registration between dynamic CT and ultrasound (US) images (2D speckle tracking mode). The proposed registration approach is based on a contour to surface scheme, decomposed in four steps: (1) the temporal synchronization of data; (2) the segmentation of the left ventricle endocardial surface from the 3D+t CT data; (3) the segmentation and tracking of the myocardium from the 2D+t US data; (4) the registration of US contour with the CT surface. The originality of the method relies on the use of Fourier descriptors and Dynamic Time Warping (DTW) to handle different spatial and temporal resolutions as well as dissimilar cardiac rhythms between CT and US data. An evaluation on simulated data is described as well as results obtained on three patient databases.","PeriodicalId":267290,"journal":{"name":"2012 3rd International Conference on Image Processing Theory, Tools and Applications (IPTA)","volume":"268 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122984635","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Exploiting a scene calibration mechanism for depth estimation 利用场景校准机制进行深度估计
Ali Musa Kazmi, N. I. Rao, Fawad Fazal, Muhammad Faisal Khan
The problem of depth estimation from one or more image(s) is most frequently discussed in computer vision using binocular cues, motion parallax or monocular cues. In this paper, we exploited a scene calibration mechanism for estimating depth from a single image, with emphasis on motorways. The approach incorporates linear perspective depth cue to restore distance information of vehicle(s) from a given image. Based upon the assumption that linear perspective is available in ample amount in structured environments, proposed approach computes 1D projective transformation across ground plane which maps imaged distances to the corresponding real-world distances. Once the homography matrix for 1D projective transformation is available, it can be applied to any point to compute its straight line distance from the reference point. Experimental results show that the proposed approach is computationally efficient and delivers desirably accurate depth estimates; thus, it has been applied to identify over-speedings.
在计算机视觉中,使用双目线索、运动视差或单眼线索最常讨论从一个或多个图像进行深度估计的问题。在本文中,我们利用了一种场景校准机制来估计单幅图像的深度,重点是高速公路。该方法结合线性视角深度线索,从给定图像中恢复车辆的距离信息。基于假设在结构化环境中有大量的线性透视,提出的方法计算跨地平面的一维投影变换,将成像距离映射到相应的现实世界距离。一旦得到一维射影变换的单应矩阵,就可以将其应用于任意点,计算其到参考点的直线距离。实验结果表明,该方法计算效率高,深度估计精度高;因此,它已被用于识别超速。
{"title":"Exploiting a scene calibration mechanism for depth estimation","authors":"Ali Musa Kazmi, N. I. Rao, Fawad Fazal, Muhammad Faisal Khan","doi":"10.1109/IPTA.2012.6469572","DOIUrl":"https://doi.org/10.1109/IPTA.2012.6469572","url":null,"abstract":"The problem of depth estimation from one or more image(s) is most frequently discussed in computer vision using binocular cues, motion parallax or monocular cues. In this paper, we exploited a scene calibration mechanism for estimating depth from a single image, with emphasis on motorways. The approach incorporates linear perspective depth cue to restore distance information of vehicle(s) from a given image. Based upon the assumption that linear perspective is available in ample amount in structured environments, proposed approach computes 1D projective transformation across ground plane which maps imaged distances to the corresponding real-world distances. Once the homography matrix for 1D projective transformation is available, it can be applied to any point to compute its straight line distance from the reference point. Experimental results show that the proposed approach is computationally efficient and delivers desirably accurate depth estimates; thus, it has been applied to identify over-speedings.","PeriodicalId":267290,"journal":{"name":"2012 3rd International Conference on Image Processing Theory, Tools and Applications (IPTA)","volume":"49 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129078955","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Extending the interaction area for view-invariant 3D gesture recognition 扩展了视觉不变三维手势识别的交互区域
M. Caon, J. Tscherrig, Yong Yue, Omar Abou Khaled, E. Mugellini
This paper presents a non-intrusive approach for view-invariant hand gesture recognition. In fact, the representation of gestures changes dynamically depending on camera viewpoints. Therefore, the different positions of the user between the training phase and the evaluation phase can severely compromise the recognition process. The proposed approach involves the calibration of two Microsoft Kinect depth cameras to allow the 3D modeling of the dynamic hands movements. The gestures are modeled as 3D trajectories and the classification is based on Hidden Markov Models. The approach is trained on data from one viewpoint and tested on data from other very different viewpoints with an angular variation of 180°. The average recognition rate is always higher than 94%. Since it is similar to the recognition rate when training and testing on gestures from the same viewpoint, hence the approach is indeed view-invariant. Comparing these results with those deriving from the test of a one depth camera approach demonstrates that the adoption of two calibrated cameras is crucial.
提出了一种非侵入式的视觉不变手势识别方法。事实上,手势的表现会根据摄像机的视点动态变化。因此,用户在训练阶段和评价阶段的不同位置会严重影响识别过程。提出的方法包括校准两个微软Kinect深度摄像头,以实现手部动态运动的3D建模。手势建模为3D轨迹,分类基于隐马尔可夫模型。该方法对来自一个视点的数据进行训练,并对来自其他角度变化为180°的不同视点的数据进行测试。平均识别率始终在94%以上。由于它与从同一视点训练和测试手势时的识别率相似,因此该方法确实是视点不变的。将这些结果与单深度相机方法的测试结果进行比较,表明采用两个校准相机是至关重要的。
{"title":"Extending the interaction area for view-invariant 3D gesture recognition","authors":"M. Caon, J. Tscherrig, Yong Yue, Omar Abou Khaled, E. Mugellini","doi":"10.1109/IPTA.2012.6469542","DOIUrl":"https://doi.org/10.1109/IPTA.2012.6469542","url":null,"abstract":"This paper presents a non-intrusive approach for view-invariant hand gesture recognition. In fact, the representation of gestures changes dynamically depending on camera viewpoints. Therefore, the different positions of the user between the training phase and the evaluation phase can severely compromise the recognition process. The proposed approach involves the calibration of two Microsoft Kinect depth cameras to allow the 3D modeling of the dynamic hands movements. The gestures are modeled as 3D trajectories and the classification is based on Hidden Markov Models. The approach is trained on data from one viewpoint and tested on data from other very different viewpoints with an angular variation of 180°. The average recognition rate is always higher than 94%. Since it is similar to the recognition rate when training and testing on gestures from the same viewpoint, hence the approach is indeed view-invariant. Comparing these results with those deriving from the test of a one depth camera approach demonstrates that the adoption of two calibrated cameras is crucial.","PeriodicalId":267290,"journal":{"name":"2012 3rd International Conference on Image Processing Theory, Tools and Applications (IPTA)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115369399","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
No-reference quality metric for watermarked images based on combining of objective metrics using neural network 基于神经网络结合客观度量的水印图像无参考质量度量
M. Gaata, W. Puech, Sattar Sadkhn, Saad Hasson
In this paper, a new no-reference image quality metric is proposed to estimate the quality of watermarked images automatically based on combining objective metrics using neural network. The aim is to predict the subjective quality scores, known as the mean opinion score (MOS) obtained from human observers. In practice, our metric consists of three stages: first, filtering process is applied to watermarked image in order to generate its filtered image. Second, we use watermarked image and its filtered image in the calculation of the objective metrics as input to a neural network. Third; these metrics are combined using neural network model. The output of this neural network is a single value corresponding to the MOS scores. Experimental results show that combination of objective metrics through the neural network, indeed is able to accurately predict perceived quality of watermarked images.
本文提出了一种新的无参考图像质量度量,利用神经网络结合客观度量自动估计水印图像的质量。其目的是预测主观质量分数,即从人类观察者那里获得的平均意见分数(MOS)。在实践中,我们的度量包括三个阶段:首先,对水印图像进行滤波处理,以生成其滤波后的图像。其次,我们在客观度量的计算中使用水印图像及其滤波图像作为神经网络的输入。第三;利用神经网络模型对这些指标进行组合。该神经网络的输出是与MOS分数相对应的单个值。实验结果表明,通过神经网络结合客观指标,确实能够准确预测水印图像的感知质量。
{"title":"No-reference quality metric for watermarked images based on combining of objective metrics using neural network","authors":"M. Gaata, W. Puech, Sattar Sadkhn, Saad Hasson","doi":"10.1109/IPTA.2012.6469513","DOIUrl":"https://doi.org/10.1109/IPTA.2012.6469513","url":null,"abstract":"In this paper, a new no-reference image quality metric is proposed to estimate the quality of watermarked images automatically based on combining objective metrics using neural network. The aim is to predict the subjective quality scores, known as the mean opinion score (MOS) obtained from human observers. In practice, our metric consists of three stages: first, filtering process is applied to watermarked image in order to generate its filtered image. Second, we use watermarked image and its filtered image in the calculation of the objective metrics as input to a neural network. Third; these metrics are combined using neural network model. The output of this neural network is a single value corresponding to the MOS scores. Experimental results show that combination of objective metrics through the neural network, indeed is able to accurately predict perceived quality of watermarked images.","PeriodicalId":267290,"journal":{"name":"2012 3rd International Conference on Image Processing Theory, Tools and Applications (IPTA)","volume":"174 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116399345","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
The detection of naval vessels by fusion of edge and color background models 基于边缘和彩色背景模型融合的舰船检测
P. Holtzhausen, V. Crnojevic, B. Herbst
The detection of naval vessels in open water is a difficult challenge, with many applications from harbor monitoring to proximity security systems for ocean-faring ships. Background modelling is an effective method of detecting candidate targets, and we describe a framework that improves the operational robustness by the interaction of two different Mixture of Gaussian (MoG) models. These results are then processed by a tracking system that accurately detects moving vessels. The algorithm performs well in varying environmental conditions with good real-time performance characteristics.
在开阔水域探测海军舰艇是一项艰巨的挑战,从港口监测到远洋船舶的近距离安全系统都有许多应用。背景建模是一种检测候选目标的有效方法,我们描述了一个框架,通过两种不同的混合高斯(MoG)模型的相互作用来提高操作鲁棒性。然后,跟踪系统对这些结果进行处理,该系统可以准确地检测到移动的血管。该算法在不同的环境条件下都具有良好的实时性。
{"title":"The detection of naval vessels by fusion of edge and color background models","authors":"P. Holtzhausen, V. Crnojevic, B. Herbst","doi":"10.1109/IPTA.2012.6469522","DOIUrl":"https://doi.org/10.1109/IPTA.2012.6469522","url":null,"abstract":"The detection of naval vessels in open water is a difficult challenge, with many applications from harbor monitoring to proximity security systems for ocean-faring ships. Background modelling is an effective method of detecting candidate targets, and we describe a framework that improves the operational robustness by the interaction of two different Mixture of Gaussian (MoG) models. These results are then processed by a tracking system that accurately detects moving vessels. The algorithm performs well in varying environmental conditions with good real-time performance characteristics.","PeriodicalId":267290,"journal":{"name":"2012 3rd International Conference on Image Processing Theory, Tools and Applications (IPTA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130703034","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Integrating unsupervised and supervised clustering methods on a GPU platform for fast image segmentation 在GPU平台上集成无监督和监督聚类方法实现快速图像分割
A. Faro, D. Giordano, S. Palazzo
Aim of the paper is to demonstrate how by integrating unsupervised and supervised parallel neural clustering methods in a GPU platform we may carry out a fast image segmentation with a satisfactory compromise between the topological preservation of the original image and the minimization of the quantization error, also known as clustering accuracy. For this reason, an unsupervised parallel clustering method inspired by the Extended SOM (ESOM) powered by a Learning Vector Quantization (LVQ) like algorithm is proposed. Then, its parallel supervised versions is presented to further minimize the quantization error in case proper prototypes of the desired clusters are known. Finally, the GPU implementation of both these methods are illustrated to show how we may support time critical tasks such as real time surveillance, interactive medical diagnosis, and control of dynamical systems. The performance of the GPU implementation is discussed with the help of small examples and realistic processing tasks.
本文的目的是演示如何通过在GPU平台上集成无监督和有监督并行神经聚类方法,我们可以在原始图像的拓扑保留和量化误差最小化(也称为聚类精度)之间取得令人满意的妥协,进行快速图像分割。为此,提出了一种受扩展SOM (ESOM)启发的无监督并行聚类方法,该方法由类似学习向量量化(LVQ)的算法驱动。然后,在已知所需簇的适当原型的情况下,提出了其并行监督版本,以进一步减小量化误差。最后,说明了这两种方法的GPU实现,以显示我们如何支持时间关键任务,如实时监视,交互式医疗诊断和动态系统控制。通过小实例和实际处理任务,讨论了GPU实现的性能。
{"title":"Integrating unsupervised and supervised clustering methods on a GPU platform for fast image segmentation","authors":"A. Faro, D. Giordano, S. Palazzo","doi":"10.1109/IPTA.2012.6469568","DOIUrl":"https://doi.org/10.1109/IPTA.2012.6469568","url":null,"abstract":"Aim of the paper is to demonstrate how by integrating unsupervised and supervised parallel neural clustering methods in a GPU platform we may carry out a fast image segmentation with a satisfactory compromise between the topological preservation of the original image and the minimization of the quantization error, also known as clustering accuracy. For this reason, an unsupervised parallel clustering method inspired by the Extended SOM (ESOM) powered by a Learning Vector Quantization (LVQ) like algorithm is proposed. Then, its parallel supervised versions is presented to further minimize the quantization error in case proper prototypes of the desired clusters are known. Finally, the GPU implementation of both these methods are illustrated to show how we may support time critical tasks such as real time surveillance, interactive medical diagnosis, and control of dynamical systems. The performance of the GPU implementation is discussed with the help of small examples and realistic processing tasks.","PeriodicalId":267290,"journal":{"name":"2012 3rd International Conference on Image Processing Theory, Tools and Applications (IPTA)","volume":"40 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132658887","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
期刊
2012 3rd International Conference on Image Processing Theory, Tools and Applications (IPTA)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1