首页 > 最新文献

2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops最新文献

英文 中文
Multi-fiber reconstruction from DW-MRI using a continuous mixture of von Mises-Fisher distributions 使用von Mises-Fisher分布连续混合的DW-MRI多纤维重建
Ritwik K. Kumar, Angelos Barmpoutis, B. Vemuri, P. Carney, T. Mareci
In this paper we propose a method for reconstructing the Diffusion Weighted Magnetic Resonance (DW-MR) signal at each lattice point using a novel continuous mixture of von Mises-Fisher distribution functions. Unlike most existing methods, neither does this model assume a fixed functional form for the MR signal attenuation (e.g. 2nd or 4th order tensor) nor does it arbitrarily fix important mixture parameters like the number of components. We show that this continuous mixture has a closed form expression and leads to a linear system which can be easily solved. Through extensive experimentation with synthetic data we show that this technique outperforms various other state-of-the-art techniques in resolving fiber crossings. Finally, we demonstrate the effectiveness of this method using real DW-MRI data from rat brain and optic chiasm.
在本文中,我们提出了一种利用von Mises-Fisher分布函数的一种新的连续混合来重建每个格点上的扩散加权磁共振(DW-MR)信号的方法。与大多数现有方法不同,该模型既没有假设MR信号衰减的固定函数形式(如二阶或四阶张量),也没有任意固定重要的混合参数,如分量数。我们证明了这种连续混合具有一个封闭的形式表达式,并导致一个易于求解的线性系统。通过对合成数据的广泛实验,我们表明该技术在解决光纤交叉方面优于其他各种最先进的技术。最后,我们用大鼠脑和视交叉的真实DW-MRI数据证明了该方法的有效性。
{"title":"Multi-fiber reconstruction from DW-MRI using a continuous mixture of von Mises-Fisher distributions","authors":"Ritwik K. Kumar, Angelos Barmpoutis, B. Vemuri, P. Carney, T. Mareci","doi":"10.1109/CVPRW.2008.4562991","DOIUrl":"https://doi.org/10.1109/CVPRW.2008.4562991","url":null,"abstract":"In this paper we propose a method for reconstructing the Diffusion Weighted Magnetic Resonance (DW-MR) signal at each lattice point using a novel continuous mixture of von Mises-Fisher distribution functions. Unlike most existing methods, neither does this model assume a fixed functional form for the MR signal attenuation (e.g. 2nd or 4th order tensor) nor does it arbitrarily fix important mixture parameters like the number of components. We show that this continuous mixture has a closed form expression and leads to a linear system which can be easily solved. Through extensive experimentation with synthetic data we show that this technique outperforms various other state-of-the-art techniques in resolving fiber crossings. Finally, we demonstrate the effectiveness of this method using real DW-MRI data from rat brain and optic chiasm.","PeriodicalId":102206,"journal":{"name":"2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133412094","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 25
Effective image database search via dimensionality reduction 有效的图像数据库搜索通过降维
A. Dahl, H. Aanæs
Image search using the bag-of-words image representation is investigated further in this paper. This approach has shown promising results for large scale image collections making it relevant for Internet applications. The steps involved in the bag-of-words approach are feature extraction, vocabulary building, and searching with a query image. It is important to keep the computational cost low through all steps. In this paper we focus on the efficiency of the technique. To do that we substantially reduce the dimensionality of the features by the use of PCA and addition of color. Building of the visual vocabulary is typically done using k-means. We investigate a clustering algorithm based on the leader follower principle (LF-clustering), in which the number of clusters is not fixed. The adaptive nature of LF-clustering is shown to improve the quality of the visual vocabulary using this. In the query step, features from the query image are assigned to the visual vocabulary. The dimensionality reduction enables us to do exact feature labeling using kD-tree, instead of approximate approaches normally used. Despite the dimensionality reduction to between 6 and 15 dimensions we obtain improved results compared to the traditional bag-of-words approach based on 128 dimensional SIFT feature and k-means clustering.
本文进一步研究了基于词袋图像表示的图像搜索。这种方法在大规模图像收集方面显示出了令人鼓舞的结果,使其与Internet应用程序相关。词袋方法涉及的步骤是特征提取、词汇构建和使用查询图像进行搜索。在所有步骤中保持较低的计算成本非常重要。在本文中,我们重点讨论了该技术的效率。为了做到这一点,我们通过使用PCA和添加颜色来大幅降低特征的维数。视觉词汇的构建通常使用k-means。研究了一种不固定簇数的基于leader - follower原则的聚类算法(LF-clustering)。lf聚类的自适应特性表明,使用它可以提高视觉词汇的质量。在查询步骤中,将查询图像中的特征分配给视觉词汇表。降维使我们能够使用kD-tree进行精确的特征标记,而不是通常使用的近似方法。尽管将维数降至6至15维,但与基于128维SIFT特征和k-means聚类的传统词袋方法相比,我们获得了更好的结果。
{"title":"Effective image database search via dimensionality reduction","authors":"A. Dahl, H. Aanæs","doi":"10.1109/CVPRW.2008.4562957","DOIUrl":"https://doi.org/10.1109/CVPRW.2008.4562957","url":null,"abstract":"Image search using the bag-of-words image representation is investigated further in this paper. This approach has shown promising results for large scale image collections making it relevant for Internet applications. The steps involved in the bag-of-words approach are feature extraction, vocabulary building, and searching with a query image. It is important to keep the computational cost low through all steps. In this paper we focus on the efficiency of the technique. To do that we substantially reduce the dimensionality of the features by the use of PCA and addition of color. Building of the visual vocabulary is typically done using k-means. We investigate a clustering algorithm based on the leader follower principle (LF-clustering), in which the number of clusters is not fixed. The adaptive nature of LF-clustering is shown to improve the quality of the visual vocabulary using this. In the query step, features from the query image are assigned to the visual vocabulary. The dimensionality reduction enables us to do exact feature labeling using kD-tree, instead of approximate approaches normally used. Despite the dimensionality reduction to between 6 and 15 dimensions we obtain improved results compared to the traditional bag-of-words approach based on 128 dimensional SIFT feature and k-means clustering.","PeriodicalId":102206,"journal":{"name":"2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops","volume":"63 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124422969","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
Interleaved pixel lookup for embedded computer vision 用于嵌入式计算机视觉的交错像素查找
Kota Yamaguchi, Yoshihiro Watanabe, T. Komuro, M. Ishikawa
This paper describes an in-depth investigation and implementation of interleaved memory for pixel lookup operations in computer vision. Pixel lookup, mapping between coordinates and pixels, is a common operation in computer vision, but is also a potential bottleneck due to formidable bandwidth requirements for real-time operation. We focus on the acceleration of pixel lookup operations through parallelizing memory banks by interleaving. The key to applying interleaving for pixel lookup is 2D block data partitioning and support for unaligned access. With this optimization of interleaving, pixel lookup operations can output a block of pixels at once without major overhead for unaligned access. An example implementation of our optimized interleaved memory for affine motion tracking shows that the pixel lookup operations can achieve 12.8 Gbps for random lookup of a 4x4 size block of 8-bit pixels under 100 MHz operation. Interleaving can be a cost-effective solution for fast pixel lookup in embedded computer vision.
本文对计算机视觉中像素查找操作的交错存储器进行了深入的研究和实现。像素查找,坐标和像素之间的映射,是计算机视觉中常见的操作,但由于实时操作对带宽的要求很高,也是一个潜在的瓶颈。我们专注于通过交错并行内存库来加速像素查找操作。对像素查找应用交错的关键是2D块数据分区和对不对齐访问的支持。通过这种交错优化,像素查找操作可以一次输出一个像素块,而不会因为未对齐访问而产生很大的开销。我们优化的用于仿射运动跟踪的交错存储器的一个示例实现表明,在100 MHz操作下,像素查找操作可以实现12.8 Gbps的随机查找4x4大小的8位像素块。交错是嵌入式计算机视觉中快速查找像素的一种经济有效的解决方案。
{"title":"Interleaved pixel lookup for embedded computer vision","authors":"Kota Yamaguchi, Yoshihiro Watanabe, T. Komuro, M. Ishikawa","doi":"10.1109/CVPRW.2008.4563152","DOIUrl":"https://doi.org/10.1109/CVPRW.2008.4563152","url":null,"abstract":"This paper describes an in-depth investigation and implementation of interleaved memory for pixel lookup operations in computer vision. Pixel lookup, mapping between coordinates and pixels, is a common operation in computer vision, but is also a potential bottleneck due to formidable bandwidth requirements for real-time operation. We focus on the acceleration of pixel lookup operations through parallelizing memory banks by interleaving. The key to applying interleaving for pixel lookup is 2D block data partitioning and support for unaligned access. With this optimization of interleaving, pixel lookup operations can output a block of pixels at once without major overhead for unaligned access. An example implementation of our optimized interleaved memory for affine motion tracking shows that the pixel lookup operations can achieve 12.8 Gbps for random lookup of a 4x4 size block of 8-bit pixels under 100 MHz operation. Interleaving can be a cost-effective solution for fast pixel lookup in embedded computer vision.","PeriodicalId":102206,"journal":{"name":"2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops","volume":"75 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114644999","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Real-time estimation of human attention field in LWIR and color surveillance videos LWIR和彩色监控视频中人类注意力场的实时估计
A. Leykin, R. Hammoud
Knowing the visual attention field of a monitored subject is of great value for many applications including surveillance and marketing. This paper proposes first to track peoplepsilas bodies, and then estimates visual attention field for each human using head pose information. The proposed head pose technique aims at estimating the yaw angle only. The method is shown to operate on monocular color camera sequences and is further refined with the data from a thermal sensor. In typical monocular tracking sequences the resolution of the head is very low and parts of the head are occluded with the face often invisible to the camera. We propose a method of combining a skin color detector with the direction of motion in a probabilistic way. We show how head profile obtained from the thermal sequence can be used to further improve the result.
了解被监控对象的视觉注意范围对监控和营销等许多应用具有重要价值。本文提出首先对人的身体进行跟踪,然后利用头部姿势信息估计每个人的视觉注意范围。提出的头姿技术的目的仅在于估计偏航角。该方法适用于单目彩色摄像机序列,并利用热传感器的数据进一步改进。在典型的单目跟踪序列中,头部的分辨率非常低,并且头部的某些部分被面部遮挡,通常对摄像机来说是不可见的。我们提出了一种以概率方式将肤色检测器与运动方向相结合的方法。我们展示了如何使用从热序列中获得的头部剖面来进一步改善结果。
{"title":"Real-time estimation of human attention field in LWIR and color surveillance videos","authors":"A. Leykin, R. Hammoud","doi":"10.1109/CVPRW.2008.4563059","DOIUrl":"https://doi.org/10.1109/CVPRW.2008.4563059","url":null,"abstract":"Knowing the visual attention field of a monitored subject is of great value for many applications including surveillance and marketing. This paper proposes first to track peoplepsilas bodies, and then estimates visual attention field for each human using head pose information. The proposed head pose technique aims at estimating the yaw angle only. The method is shown to operate on monocular color camera sequences and is further refined with the data from a thermal sensor. In typical monocular tracking sequences the resolution of the head is very low and parts of the head are occluded with the face often invisible to the camera. We propose a method of combining a skin color detector with the direction of motion in a probabilistic way. We show how head profile obtained from the thermal sequence can be used to further improve the result.","PeriodicalId":102206,"journal":{"name":"2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops","volume":"128 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115831059","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
ToF-sensors: New dimensions for realism and interactivity tof传感器:现实性和交互性的新维度
A. Kolb, E. Barth, R. Koch
A growing number of applications depend on accurate and fast 3D scene analysis. Examples are object recognition, collision prevention, 3D modeling, mixed reality, and gesture recognition. The estimation of a range map by image analysis or laser scan techniques is still a time- consuming and expensive part of such systems. A lower-priced, fast and robust alternative for distance measurements are time-of-flight (ToF) cameras. Recently, significant improvements have been made in order to achieve low-cost and compact ToF-devices, that have the potential to revolutionize many fields of research, including computer vision, computer graphics and human computer interaction (HCI). These technologies are starting to have an impact on research and commercial applications. The upcoming generation of ToF sensors, however, will be even more powerful and will have the potential to become "ubiquitous geometry devices" for gaming, web-conferencing, and numerous other applications. This paper will give an account of some recent developments in ToF-technology and will discuss applications of this technology for vision, graphics, and HCI.
越来越多的应用程序依赖于准确和快速的3D场景分析。例如物体识别、碰撞预防、3D建模、混合现实和手势识别。通过图像分析或激光扫描技术估算距离图仍然是这类系统中耗时且昂贵的一部分。飞行时间(ToF)相机是一种价格更低、速度更快、功能更强大的距离测量替代方案。最近,为了实现低成本和紧凑的tof设备,已经进行了重大改进,这有可能彻底改变许多研究领域,包括计算机视觉,计算机图形学和人机交互(HCI)。这些技术开始对研究和商业应用产生影响。然而,即将到来的一代ToF传感器将更加强大,并有可能成为游戏、网络会议和许多其他应用的“无处不在的几何设备”。本文将介绍tof技术的一些最新发展,并讨论该技术在视觉、图形和HCI方面的应用。
{"title":"ToF-sensors: New dimensions for realism and interactivity","authors":"A. Kolb, E. Barth, R. Koch","doi":"10.1109/CVPRW.2008.4563159","DOIUrl":"https://doi.org/10.1109/CVPRW.2008.4563159","url":null,"abstract":"A growing number of applications depend on accurate and fast 3D scene analysis. Examples are object recognition, collision prevention, 3D modeling, mixed reality, and gesture recognition. The estimation of a range map by image analysis or laser scan techniques is still a time- consuming and expensive part of such systems. A lower-priced, fast and robust alternative for distance measurements are time-of-flight (ToF) cameras. Recently, significant improvements have been made in order to achieve low-cost and compact ToF-devices, that have the potential to revolutionize many fields of research, including computer vision, computer graphics and human computer interaction (HCI). These technologies are starting to have an impact on research and commercial applications. The upcoming generation of ToF sensors, however, will be even more powerful and will have the potential to become \"ubiquitous geometry devices\" for gaming, web-conferencing, and numerous other applications. This paper will give an account of some recent developments in ToF-technology and will discuss applications of this technology for vision, graphics, and HCI.","PeriodicalId":102206,"journal":{"name":"2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops","volume":"67 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124262077","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 67
3D model search and pose estimation from single images using VIP features 利用VIP特征对单幅图像进行三维模型搜索和姿态估计
Changchang Wu, F. Fraundorfer, Jan-Michael Frahm, M. Pollefeys
This paper describes a method to efficiently search for 3D models in a city-scale database and to compute the camera poses from single query images. The proposed method matches SIFT features (from a single image) to viewpoint invariant patches (VIP) from a 3D model by warping the SIFT features approximately into the orthographic frame of the VIP features. This significantly increases the number of feature correspondences which results in a reliable and robust pose estimation. We also present a 3D model search tool that uses a visual word based search scheme to efficiently retrieve 3D models from large databases using individual query images. Together the 3D model search and the pose estimation represent a highly scalable and efficient city-scale localization system. The performance of the 3D model search and pose estimation is demonstrated on urban image data.
本文描述了一种在城市尺度数据库中高效搜索三维模型并从单个查询图像中计算相机姿态的方法。该方法通过将SIFT特征近似地扭曲到视点不变补丁(VIP)的正射影框中,将SIFT特征(来自单幅图像)与3D模型的视点不变补丁(VIP)进行匹配。这大大增加了特征对应的数量,从而产生可靠和鲁棒的姿态估计。我们还提出了一个3D模型搜索工具,该工具使用基于视觉词的搜索方案,使用单个查询图像从大型数据库中有效地检索3D模型。将三维模型搜索和姿态估计结合在一起,实现了一个高度可扩展和高效的城市尺度定位系统。在城市图像数据上验证了三维模型搜索和姿态估计的性能。
{"title":"3D model search and pose estimation from single images using VIP features","authors":"Changchang Wu, F. Fraundorfer, Jan-Michael Frahm, M. Pollefeys","doi":"10.1109/CVPRW.2008.4563037","DOIUrl":"https://doi.org/10.1109/CVPRW.2008.4563037","url":null,"abstract":"This paper describes a method to efficiently search for 3D models in a city-scale database and to compute the camera poses from single query images. The proposed method matches SIFT features (from a single image) to viewpoint invariant patches (VIP) from a 3D model by warping the SIFT features approximately into the orthographic frame of the VIP features. This significantly increases the number of feature correspondences which results in a reliable and robust pose estimation. We also present a 3D model search tool that uses a visual word based search scheme to efficiently retrieve 3D models from large databases using individual query images. Together the 3D model search and the pose estimation represent a highly scalable and efficient city-scale localization system. The performance of the 3D model search and pose estimation is demonstrated on urban image data.","PeriodicalId":102206,"journal":{"name":"2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops","volume":"94 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122722622","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 32
3-D gesture-based scene navigation in medical imaging applications using Time-of-Flight cameras 使用飞行时间(Time-of-Flight)相机的医学成像应用中基于手势的三维场景导航
S. Soutschek, J. Penne, J. Hornegger, J. Kornhuber
For a lot of applications, and particularly for medical intra-operative applications, the exploration of and navigation through 3-D image data provided by sensors like ToF (time-of-flight) cameras, MUSTOF (multisensor-time-of-flight) endoscopes or CT (computed tomography) [8], requires a user-interface which avoids physical interaction with an input device. Thus, we process a touchless user-interface based on gestures classified by the data provided by a ToF camera. Reasonable and necessary user interactions are described. For those interactions a suitable set of gestures is introduced. A user-interface is then proposed, which interprets the current gesture and performs the assigned functionality. For evaluating the quality of the developed user-interface we considered the aspects of classification rate, real-time applicability, usability, intuitiveness and training time. The results of our evaluation show that our system, which provides a classification rate of 94.3% at a framerate of 11 frames per second, satisfactorily addresses all these quality requirements.
对于许多应用,特别是医疗术中应用,通过ToF(飞行时间)相机、MUSTOF(多传感器飞行时间)内窥镜或CT(计算机断层扫描)等传感器提供的3d图像数据进行探索和导航[8],需要一个避免与输入设备进行物理交互的用户界面。因此,我们基于ToF相机提供的数据分类的手势处理非接触式用户界面。描述了合理和必要的用户交互。对于这些交互,引入了一组合适的手势。然后提出一个用户界面,它解释当前的手势并执行指定的功能。为了评估开发的用户界面的质量,我们考虑了分类率、实时适用性、可用性、直观性和培训时间等方面。我们的评估结果表明,我们的系统在每秒11帧的帧率下提供了94.3%的分类率,令人满意地满足了所有这些质量要求。
{"title":"3-D gesture-based scene navigation in medical imaging applications using Time-of-Flight cameras","authors":"S. Soutschek, J. Penne, J. Hornegger, J. Kornhuber","doi":"10.1109/CVPRW.2008.4563162","DOIUrl":"https://doi.org/10.1109/CVPRW.2008.4563162","url":null,"abstract":"For a lot of applications, and particularly for medical intra-operative applications, the exploration of and navigation through 3-D image data provided by sensors like ToF (time-of-flight) cameras, MUSTOF (multisensor-time-of-flight) endoscopes or CT (computed tomography) [8], requires a user-interface which avoids physical interaction with an input device. Thus, we process a touchless user-interface based on gestures classified by the data provided by a ToF camera. Reasonable and necessary user interactions are described. For those interactions a suitable set of gestures is introduced. A user-interface is then proposed, which interprets the current gesture and performs the assigned functionality. For evaluating the quality of the developed user-interface we considered the aspects of classification rate, real-time applicability, usability, intuitiveness and training time. The results of our evaluation show that our system, which provides a classification rate of 94.3% at a framerate of 11 frames per second, satisfactorily addresses all these quality requirements.","PeriodicalId":102206,"journal":{"name":"2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114275672","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 103
A novel quality measure for information hiding in images 一种新的图像信息隐藏质量度量方法
KA Navas, M. Aravind, M. Sasikumar, Assitant
Objective quality assessment has been widely used in image processing for decades and many researchers have been studying the objective quality assessment method based on human visual system (HVS). This paper presents a new measure which denotes the perceptual degradation produced in an image using certain subjectively evaluated weighing functions. Experimental analysis when carried out on different sets of images for different levels of data hiding and under different attacks shows that this new measure shows a high degree of acceptance with the subjective analysis measure.
客观质量评价在图像处理中得到了广泛的应用,几十年来,许多研究者一直在研究基于人类视觉系统的客观质量评价方法。本文提出了一种新的度量方法,用主观评价的权重函数表示图像中产生的感知退化。在不同的数据隐藏水平和不同的攻击下对不同的图像集进行的实验分析表明,该方法与主观分析方法具有较高的接受度。
{"title":"A novel quality measure for information hiding in images","authors":"KA Navas, M. Aravind, M. Sasikumar, Assitant","doi":"10.1109/CVPRW.2008.4562985","DOIUrl":"https://doi.org/10.1109/CVPRW.2008.4562985","url":null,"abstract":"Objective quality assessment has been widely used in image processing for decades and many researchers have been studying the objective quality assessment method based on human visual system (HVS). This paper presents a new measure which denotes the perceptual degradation produced in an image using certain subjectively evaluated weighing functions. Experimental analysis when carried out on different sets of images for different levels of data hiding and under different attacks shows that this new measure shows a high degree of acceptance with the subjective analysis measure.","PeriodicalId":102206,"journal":{"name":"2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114499918","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Can similar scenes help surface layout estimation? 相似的场景能帮助估算表面布局吗?
S. Divvala, Alexei A. Efros, M. Hebert
We describe a preliminary investigation of utilising large amounts of unlabelled image data to help in the estimation of rough scene layout. We take the single-view geometry estimation system of Hoiem et al (2207) as the baseline and see if it is possible to improve its performance by considering a set of similar scenes gathered from the Web. The two complimentary approaches being considered are 1) improving surface classification by using average geometry estimated from the matches, and 2) improving surface segmentation by injecting segments generated from the average of the matched images. The system is evaluated using the labelled 300-image dataset of Hoiem et al. and shows promising results.
我们描述了利用大量未标记的图像数据来帮助估计粗略的场景布局的初步调查。我们以Hoiem等人(2207)的单视图几何估计系统为基准,看看是否有可能通过考虑从Web收集的一组类似场景来提高其性能。考虑的两种互补方法是:1)通过使用从匹配中估计的平均几何来改进表面分类;2)通过注入由匹配图像的平均生成的片段来改进表面分割。该系统使用Hoiem等人标记的300张图像数据集进行评估,并显示出有希望的结果。
{"title":"Can similar scenes help surface layout estimation?","authors":"S. Divvala, Alexei A. Efros, M. Hebert","doi":"10.1109/CVPRW.2008.4562951","DOIUrl":"https://doi.org/10.1109/CVPRW.2008.4562951","url":null,"abstract":"We describe a preliminary investigation of utilising large amounts of unlabelled image data to help in the estimation of rough scene layout. We take the single-view geometry estimation system of Hoiem et al (2207) as the baseline and see if it is possible to improve its performance by considering a set of similar scenes gathered from the Web. The two complimentary approaches being considered are 1) improving surface classification by using average geometry estimated from the matches, and 2) improving surface segmentation by injecting segments generated from the average of the matched images. The system is evaluated using the labelled 300-image dataset of Hoiem et al. and shows promising results.","PeriodicalId":102206,"journal":{"name":"2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116831449","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 18
A statistical framework for the registration of 3D knee implant components to single-plane X-ray images 三维膝关节植入部件与单平面x射线图像配准的统计框架
Jeroen Hermans, J. Bellemans, F. Maes, D. Vandermeulen, P. Suetens
Registration of 3D knee implant components to single-plane X-ray image sequences provides insight into implanted knee kinematics. In this paper a maximum likelihood approach is proposed to align the pose-related occluding contour of an object with edge segments extracted from a single-plane X-ray image. This leads to an expectation maximization algorithm which simultaneously determines the objectpsilas pose, estimates point correspondences and rejects outlier points from the registration process. Considering (nearly) planar-symmetrical objects, the method is extended in order to simultaneously estimate two symmetrical object poses which both align the corresponding occluding contours with 2D edge information. The algorithmpsilas capacity to generate accurate pose estimates and the necessity of determining both symmetrical poses when aligning (nearly) planar-symmetrical objects will be demonstrated in the context of automated registration of knee implant components to simulated and real single-plane X-ray images.
注册三维膝关节植入部件到单平面x射线图像序列提供洞察植入膝关节运动学。本文提出了一种最大似然方法,将物体的位姿相关遮挡轮廓与从单平面x射线图像中提取的边缘段对齐。这导致了期望最大化算法,该算法同时确定目标的姿态,估计点对应并从配准过程中拒绝异常点。针对(接近)平面对称的目标,对该方法进行了扩展,以便同时估计两个对称的目标位姿,这两个对称的目标位姿都将相应的遮挡轮廓与二维边缘信息对齐。该算法能够产生准确的姿态估计,并在对齐(接近)平面对称物体时确定对称姿态的必要性,将在膝关节植入部件自动注册到模拟和真实的单平面x射线图像的背景下进行演示。
{"title":"A statistical framework for the registration of 3D knee implant components to single-plane X-ray images","authors":"Jeroen Hermans, J. Bellemans, F. Maes, D. Vandermeulen, P. Suetens","doi":"10.1109/CVPRW.2008.4563004","DOIUrl":"https://doi.org/10.1109/CVPRW.2008.4563004","url":null,"abstract":"Registration of 3D knee implant components to single-plane X-ray image sequences provides insight into implanted knee kinematics. In this paper a maximum likelihood approach is proposed to align the pose-related occluding contour of an object with edge segments extracted from a single-plane X-ray image. This leads to an expectation maximization algorithm which simultaneously determines the objectpsilas pose, estimates point correspondences and rejects outlier points from the registration process. Considering (nearly) planar-symmetrical objects, the method is extended in order to simultaneously estimate two symmetrical object poses which both align the corresponding occluding contours with 2D edge information. The algorithmpsilas capacity to generate accurate pose estimates and the necessity of determining both symmetrical poses when aligning (nearly) planar-symmetrical objects will be demonstrated in the context of automated registration of knee implant components to simulated and real single-plane X-ray images.","PeriodicalId":102206,"journal":{"name":"2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops","volume":"45 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117078202","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
期刊
2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1