International Machine Vision and Image Processing Conference (IMVIP 2007)最新文献_第4页

Eye-tracking for efficient database labelling: Applications to automatic analysis of colonoscopy video 用于有效数据库标记的眼动追踪:结肠镜检查视频自动分析的应用

International Machine Vision and Image Processing Conference (IMVIP 2007)

Pub Date : 2007-09-05 DOI: 10.1109/IMVIP.2007.18

F. Vilariño, G. Lacey

In this paper we present our preliminary results in the automatic analysis of colonoscopy video using eye-tracking. We propose that eye-tracking can be successfully applied to solve different problems in computer assisted colonoscopy, such as database labelling, expertise assessment and abnormality detection. We provide results in these three areas, including a machine learning-based system for colon cancer detection using data generated with eye-tracking.

在本文中，我们介绍了我们在使用眼动追踪的结肠镜视频自动分析方面的初步结果。我们提出眼动追踪可以成功地应用于解决计算机辅助结肠镜检查中的不同问题，如数据库标记、专家评估和异常检测。我们在这三个领域提供了成果，包括一个基于机器学习的结肠癌检测系统，该系统使用眼动追踪生成的数据。

引用次数: 1

Video Semantic Content Analysis based on Ontology 基于本体的视频语义内容分析

International Machine Vision and Image Processing Conference (IMVIP 2007)

Pub Date : 2007-09-05 DOI: 10.1109/IMVIP.2007.44

Liang Bai, Songyang Lao, Gareth J. F. Jones, A. Smeaton

The rapid increase in the available amount of video data is creating a growing demand for efficient methods for understanding and managing it at the semantic level. New multimedia standards, such as MPEG-4 and MPEG-7, provide the basic functionalities in order to manipulate and transmit objects and metadata. But importantly, most of the content of video data at a semantic level is out of the scope of the standards. In this paper, a video semantic content analysis framework based on ontology is presented. Domain ontology is used to define high level semantic concepts and their relations in the context of the examined domain. And low-level features (e.g. visual and aural) and video content analysis algorithms are integrated into the ontology to enrich video semantic analysis. OWL is used for the ontology description. Rules in Description Logic are defined to describe how features and algorithms for video analysis should be applied according to different perception content and low-level features. Temporal Description Logic is used to describe the semantic events, and a reasoning algorithm is proposed for events detection. The proposed framework is demonstrated in a soccer video domain and shows promising results.

随着视频数据量的快速增长，对语义级理解和管理视频数据的有效方法的需求日益增长。新的多媒体标准，如MPEG-4和MPEG-7，提供了操作和传输对象和元数据的基本功能。但重要的是，在语义层面的视频数据的大部分内容超出了标准的范围。提出了一种基于本体的视频语义内容分析框架。领域本体用于定义高级语义概念及其在所研究领域上下文中的关系。并将底层特征(如视觉、听觉)和视频内容分析算法集成到本体中，丰富视频语义分析。OWL用于本体描述。在描述逻辑中定义了规则，用来描述针对不同的感知内容和底层特征，如何应用视频分析的特征和算法。采用时序描述逻辑对语义事件进行描述，并提出了一种事件检测推理算法。该框架在足球视频领域得到了验证，并取得了良好的效果。

{"title":"Video Semantic Content Analysis based on Ontology","authors":"Liang Bai, Songyang Lao, Gareth J. F. Jones, A. Smeaton","doi":"10.1109/IMVIP.2007.44","DOIUrl":"https://doi.org/10.1109/IMVIP.2007.44","url":null,"abstract":"The rapid increase in the available amount of video data is creating a growing demand for efficient methods for understanding and managing it at the semantic level. New multimedia standards, such as MPEG-4 and MPEG-7, provide the basic functionalities in order to manipulate and transmit objects and metadata. But importantly, most of the content of video data at a semantic level is out of the scope of the standards. In this paper, a video semantic content analysis framework based on ontology is presented. Domain ontology is used to define high level semantic concepts and their relations in the context of the examined domain. And low-level features (e.g. visual and aural) and video content analysis algorithms are integrated into the ontology to enrich video semantic analysis. OWL is used for the ontology description. Rules in Description Logic are defined to describe how features and algorithms for video analysis should be applied according to different perception content and low-level features. Temporal Description Logic is used to describe the semantic events, and a reasoning algorithm is proposed for events detection. The proposed framework is demonstrated in a soccer video domain and shows promising results.","PeriodicalId":249544,"journal":{"name":"International Machine Vision and Image Processing Conference (IMVIP 2007)","volume":"155 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126024217","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 52

Adaptive Neural Regularization Assignment for Semi-Blind Biomedical Image Restoration 半盲生物医学图像恢复的自适应神经正则化分配

International Machine Vision and Image Processing Conference (IMVIP 2007)

Pub Date : 2007-09-05 DOI: 10.1109/IMVIP.2007.8

E. Binaghi, I. Gallo, A. Guidali, M. Raspanti, G. Salvini

The aim of this work was to experimentally investigate the potentialities of an adaptive technique based on the Hopfield neural model for semi-blind restoration of Scanning Electron Microscopy (SEM) images.

这项工作的目的是实验研究一种基于Hopfield神经模型的自适应技术的潜力，用于扫描电子显微镜(SEM)图像的半盲恢复。

引用次数: 2

Speckle reduction using the discrete Fourier filtering technique 使用离散傅立叶滤波技术减少斑点

International Machine Vision and Image Processing Conference (IMVIP 2007)

Pub Date : 2007-09-05 DOI: 10.1109/IMVIP.2007.38

J. Maycock, B. Hennelly, J. McDonald, Y. Frauel, A. Castro, B. Javidi, T. Naughton

We present a digital signal processing technique that reduces the speckle content in reconstructed digital holograms. The method is based on sequential sampling of the discrete Fourier transform of the reconstructed image field. The resulting images show a reduction in speckle.

提出了一种减少重建数字全息图中散斑内容的数字信号处理技术。该方法基于对重构图像场的离散傅里叶变换进行顺序采样。所得到的图像显示斑点的减少。

引用次数: 7

MPEG-2 to H.264 Transcoding for DVB-H Applications 用于DVB-H应用的MPEG-2到H.264转码

International Machine Vision and Image Processing Conference (IMVIP 2007)

Pub Date : 2007-09-05 DOI: 10.1109/IMVIP.2007.29

M. Jiang, D. Crookes

The H.264-based video communication systems usually require an adaptive transcoding from MPEG-2 to H.264 for video transmission on the heterogeneous network, such as DlVB-H, WiMAX and UMTS channels. In this paper, an adaptive transcoder of MPEG-2 to H.264 was implemented for different DVB-H capability classes.

基于H.264的视频通信系统通常需要在异构网络(如DlVB-H、WiMAX和UMTS信道)上实现从MPEG-2到H.264的自适应转码。本文针对不同的DVB-H能力等级，实现了MPEG-2到H.264的自适应转码器。

引用次数: 1

Segmentation of three-dimensional objects from background in digital holograms 数字全息图中三维物体与背景的分割

International Machine Vision and Image Processing Conference (IMVIP 2007)

Pub Date : 2007-09-05 DOI: 10.1109/IMVIP.2007.35

C. McElhinney, J. McDonald, A. Castro, Y. Frauel, B. Javidi, T. Naughton

We present a technique for performing segmentation of three-dimensional, objects encoded using in-line digital holography from the scenes background. We create a volume of reconstructions through numerically reconstructing a digital hologram at a range of depths. For each reconstruction a variance map is created through calculating variance about a neighbourhood for each of the reconstructions pixels. We can then classify a pixel as object or background by thresholding the maximum variance of every pixel over all depths. We present segmentation results for objects of low and high contrast.

我们提出了一种技术，用于执行分割三维，对象编码使用在线数字全息从场景背景。我们通过在一定深度范围内对数字全息图进行数值重建，创建了大量的重建。对于每次重建，通过计算每个重建像素的邻域方差来创建方差图。然后，我们可以通过对所有深度上每个像素的最大方差设定阈值，将像素分类为对象或背景。我们给出了低对比度和高对比度目标的分割结果。

引用次数: 2

A New Manifold Representation for Visual Speech Recognition 视觉语音识别的一种新的流形表示

International Machine Vision and Image Processing Conference (IMVIP 2007)

Pub Date : 2007-08-27 DOI: 10.1007/978-3-540-74272-2_47

Dahai Yu, O. Ghita, Alistair Sutherland, P. Whelan

引用次数: 7