In this paper we present our preliminary results in the automatic analysis of colonoscopy video using eye-tracking. We propose that eye-tracking can be successfully applied to solve different problems in computer assisted colonoscopy, such as database labelling, expertise assessment and abnormality detection. We provide results in these three areas, including a machine learning-based system for colon cancer detection using data generated with eye-tracking.
{"title":"Eye-tracking for efficient database labelling: Applications to automatic analysis of colonoscopy video","authors":"F. Vilariño, G. Lacey","doi":"10.1109/IMVIP.2007.18","DOIUrl":"https://doi.org/10.1109/IMVIP.2007.18","url":null,"abstract":"In this paper we present our preliminary results in the automatic analysis of colonoscopy video using eye-tracking. We propose that eye-tracking can be successfully applied to solve different problems in computer assisted colonoscopy, such as database labelling, expertise assessment and abnormality detection. We provide results in these three areas, including a machine learning-based system for colon cancer detection using data generated with eye-tracking.","PeriodicalId":249544,"journal":{"name":"International Machine Vision and Image Processing Conference (IMVIP 2007)","volume":"66 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133415909","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Liang Bai, Songyang Lao, Gareth J. F. Jones, A. Smeaton
The rapid increase in the available amount of video data is creating a growing demand for efficient methods for understanding and managing it at the semantic level. New multimedia standards, such as MPEG-4 and MPEG-7, provide the basic functionalities in order to manipulate and transmit objects and metadata. But importantly, most of the content of video data at a semantic level is out of the scope of the standards. In this paper, a video semantic content analysis framework based on ontology is presented. Domain ontology is used to define high level semantic concepts and their relations in the context of the examined domain. And low-level features (e.g. visual and aural) and video content analysis algorithms are integrated into the ontology to enrich video semantic analysis. OWL is used for the ontology description. Rules in Description Logic are defined to describe how features and algorithms for video analysis should be applied according to different perception content and low-level features. Temporal Description Logic is used to describe the semantic events, and a reasoning algorithm is proposed for events detection. The proposed framework is demonstrated in a soccer video domain and shows promising results.
{"title":"Video Semantic Content Analysis based on Ontology","authors":"Liang Bai, Songyang Lao, Gareth J. F. Jones, A. Smeaton","doi":"10.1109/IMVIP.2007.44","DOIUrl":"https://doi.org/10.1109/IMVIP.2007.44","url":null,"abstract":"The rapid increase in the available amount of video data is creating a growing demand for efficient methods for understanding and managing it at the semantic level. New multimedia standards, such as MPEG-4 and MPEG-7, provide the basic functionalities in order to manipulate and transmit objects and metadata. But importantly, most of the content of video data at a semantic level is out of the scope of the standards. In this paper, a video semantic content analysis framework based on ontology is presented. Domain ontology is used to define high level semantic concepts and their relations in the context of the examined domain. And low-level features (e.g. visual and aural) and video content analysis algorithms are integrated into the ontology to enrich video semantic analysis. OWL is used for the ontology description. Rules in Description Logic are defined to describe how features and algorithms for video analysis should be applied according to different perception content and low-level features. Temporal Description Logic is used to describe the semantic events, and a reasoning algorithm is proposed for events detection. The proposed framework is demonstrated in a soccer video domain and shows promising results.","PeriodicalId":249544,"journal":{"name":"International Machine Vision and Image Processing Conference (IMVIP 2007)","volume":"155 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126024217","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
E. Binaghi, I. Gallo, A. Guidali, M. Raspanti, G. Salvini
The aim of this work was to experimentally investigate the potentialities of an adaptive technique based on the Hopfield neural model for semi-blind restoration of Scanning Electron Microscopy (SEM) images.
{"title":"Adaptive Neural Regularization Assignment for Semi-Blind Biomedical Image Restoration","authors":"E. Binaghi, I. Gallo, A. Guidali, M. Raspanti, G. Salvini","doi":"10.1109/IMVIP.2007.8","DOIUrl":"https://doi.org/10.1109/IMVIP.2007.8","url":null,"abstract":"The aim of this work was to experimentally investigate the potentialities of an adaptive technique based on the Hopfield neural model for semi-blind restoration of Scanning Electron Microscopy (SEM) images.","PeriodicalId":249544,"journal":{"name":"International Machine Vision and Image Processing Conference (IMVIP 2007)","volume":"55 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131123912","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
J. Maycock, B. Hennelly, J. McDonald, Y. Frauel, A. Castro, B. Javidi, T. Naughton
We present a digital signal processing technique that reduces the speckle content in reconstructed digital holograms. The method is based on sequential sampling of the discrete Fourier transform of the reconstructed image field. The resulting images show a reduction in speckle.
{"title":"Speckle reduction using the discrete Fourier filtering technique","authors":"J. Maycock, B. Hennelly, J. McDonald, Y. Frauel, A. Castro, B. Javidi, T. Naughton","doi":"10.1109/IMVIP.2007.38","DOIUrl":"https://doi.org/10.1109/IMVIP.2007.38","url":null,"abstract":"We present a digital signal processing technique that reduces the speckle content in reconstructed digital holograms. The method is based on sequential sampling of the discrete Fourier transform of the reconstructed image field. The resulting images show a reduction in speckle.","PeriodicalId":249544,"journal":{"name":"International Machine Vision and Image Processing Conference (IMVIP 2007)","volume":"03 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115845113","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
The H.264-based video communication systems usually require an adaptive transcoding from MPEG-2 to H.264 for video transmission on the heterogeneous network, such as DlVB-H, WiMAX and UMTS channels. In this paper, an adaptive transcoder of MPEG-2 to H.264 was implemented for different DVB-H capability classes.
{"title":"MPEG-2 to H.264 Transcoding for DVB-H Applications","authors":"M. Jiang, D. Crookes","doi":"10.1109/IMVIP.2007.29","DOIUrl":"https://doi.org/10.1109/IMVIP.2007.29","url":null,"abstract":"The H.264-based video communication systems usually require an adaptive transcoding from MPEG-2 to H.264 for video transmission on the heterogeneous network, such as DlVB-H, WiMAX and UMTS channels. In this paper, an adaptive transcoder of MPEG-2 to H.264 was implemented for different DVB-H capability classes.","PeriodicalId":249544,"journal":{"name":"International Machine Vision and Image Processing Conference (IMVIP 2007)","volume":"35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116058680","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
C. McElhinney, J. McDonald, A. Castro, Y. Frauel, B. Javidi, T. Naughton
We present a technique for performing segmentation of three-dimensional, objects encoded using in-line digital holography from the scenes background. We create a volume of reconstructions through numerically reconstructing a digital hologram at a range of depths. For each reconstruction a variance map is created through calculating variance about a neighbourhood for each of the reconstructions pixels. We can then classify a pixel as object or background by thresholding the maximum variance of every pixel over all depths. We present segmentation results for objects of low and high contrast.
{"title":"Segmentation of three-dimensional objects from background in digital holograms","authors":"C. McElhinney, J. McDonald, A. Castro, Y. Frauel, B. Javidi, T. Naughton","doi":"10.1109/IMVIP.2007.35","DOIUrl":"https://doi.org/10.1109/IMVIP.2007.35","url":null,"abstract":"We present a technique for performing segmentation of three-dimensional, objects encoded using in-line digital holography from the scenes background. We create a volume of reconstructions through numerically reconstructing a digital hologram at a range of depths. For each reconstruction a variance map is created through calculating variance about a neighbourhood for each of the reconstructions pixels. We can then classify a pixel as object or background by thresholding the maximum variance of every pixel over all depths. We present segmentation results for objects of low and high contrast.","PeriodicalId":249544,"journal":{"name":"International Machine Vision and Image Processing Conference (IMVIP 2007)","volume":"2017 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128589489","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2007-08-27DOI: 10.1007/978-3-540-74272-2_47
Dahai Yu, O. Ghita, Alistair Sutherland, P. Whelan
{"title":"A New Manifold Representation for Visual Speech Recognition","authors":"Dahai Yu, O. Ghita, Alistair Sutherland, P. Whelan","doi":"10.1007/978-3-540-74272-2_47","DOIUrl":"https://doi.org/10.1007/978-3-540-74272-2_47","url":null,"abstract":"","PeriodicalId":249544,"journal":{"name":"International Machine Vision and Image Processing Conference (IMVIP 2007)","volume":"52 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-08-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130298026","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}