首页 > 最新文献

12th International Conference on Image Analysis and Processing, 2003.Proceedings.最新文献

英文 中文
An empirical comparison of in-learning and post-learning optimization schemes for tuning the support vector machines in cost-sensitive applications 成本敏感应用中支持向量机的学习中和学习后优化方案的经验比较
Pub Date : 2003-09-17 DOI: 10.1109/ICIAP.2003.1234109
F. Tortorella
Support vector machines (SVM) are currently one of the classification systems most used in pattern recognition and data mining because of their accuracy and generalization capability. However, when dealing with very complex classification tasks where different errors bring different penalties, one should take into account the overall classification cost produced by the classifier more than its accuracy. It is thus necessary to provide some methods for tuning the SVM on the costs of the particular application. Depending on the characteristics of the cost matrix, this can be done during or after the learning phase of the classifier. In this paper we introduce two optimization schemes based on the two possible approaches and compare their performance on various data sets and kernels. The first experimental results show that both the proposed schemes are suitable for tuning SVM in cost-sensitive applications.
支持向量机(SVM)由于其准确性和泛化能力,是目前在模式识别和数据挖掘中应用最多的分类系统之一。然而,当处理非常复杂的分类任务时,不同的错误会带来不同的惩罚,人们应该考虑分类器产生的总体分类成本而不是其准确率。因此,有必要根据特定应用程序的成本提供一些方法来调优SVM。根据代价矩阵的特征,这可以在分类器的学习阶段期间或之后完成。在本文中,我们介绍了基于这两种可能的方法的两种优化方案,并比较了它们在不同数据集和内核上的性能。实验结果表明,这两种方法都适用于成本敏感的支持向量机调优。
{"title":"An empirical comparison of in-learning and post-learning optimization schemes for tuning the support vector machines in cost-sensitive applications","authors":"F. Tortorella","doi":"10.1109/ICIAP.2003.1234109","DOIUrl":"https://doi.org/10.1109/ICIAP.2003.1234109","url":null,"abstract":"Support vector machines (SVM) are currently one of the classification systems most used in pattern recognition and data mining because of their accuracy and generalization capability. However, when dealing with very complex classification tasks where different errors bring different penalties, one should take into account the overall classification cost produced by the classifier more than its accuracy. It is thus necessary to provide some methods for tuning the SVM on the costs of the particular application. Depending on the characteristics of the cost matrix, this can be done during or after the learning phase of the classifier. In this paper we introduce two optimization schemes based on the two possible approaches and compare their performance on various data sets and kernels. The first experimental results show that both the proposed schemes are suitable for tuning SVM in cost-sensitive applications.","PeriodicalId":218076,"journal":{"name":"12th International Conference on Image Analysis and Processing, 2003.Proceedings.","volume":"63 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131542190","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Coding techniques for CFA data images CFA数据图像的编码技术
Pub Date : 2003-09-17 DOI: 10.1109/ICIAP.2003.1234086
S. Battiato, A. Bruna, A. Buemi, F. Naccari
In this paper we present a comparison between different approaches to CFA (colour filter array) image encoding. We show different performance offered by a new algorithm based on a vector quantization technique, JPEG-LS, a low complexity encoding standard and classical JPEG. We also show the effects of CFA image encoding on the colour reconstructed images by a typical image generation pipeline. A discussion about the computational complexity and memory requirement of the different encoding approaches is also presented.
在本文中,我们提出了CFA(彩色滤波阵列)图像编码的不同方法的比较。我们展示了基于矢量量化技术的新算法、低复杂度编码标准JPEG- ls和经典JPEG的不同性能。我们还通过一个典型的图像生成管道展示了CFA图像编码对彩色重建图像的影响。讨论了不同编码方式的计算复杂度和存储需求。
{"title":"Coding techniques for CFA data images","authors":"S. Battiato, A. Bruna, A. Buemi, F. Naccari","doi":"10.1109/ICIAP.2003.1234086","DOIUrl":"https://doi.org/10.1109/ICIAP.2003.1234086","url":null,"abstract":"In this paper we present a comparison between different approaches to CFA (colour filter array) image encoding. We show different performance offered by a new algorithm based on a vector quantization technique, JPEG-LS, a low complexity encoding standard and classical JPEG. We also show the effects of CFA image encoding on the colour reconstructed images by a typical image generation pipeline. A discussion about the computational complexity and memory requirement of the different encoding approaches is also presented.","PeriodicalId":218076,"journal":{"name":"12th International Conference on Image Analysis and Processing, 2003.Proceedings.","volume":"45 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132636791","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
Shape recognition by distributed recursive learning of multiscale trees 基于多尺度树分布递归学习的形状识别
Pub Date : 2003-09-17 DOI: 10.1109/ICIAP.2003.1234020
L. Lombardi, A. Petrosino
We present an efficient and fully parallel 2D object recognition method based on the use of a multiscale tree representation of the object boundary and recursive learning of trees. Specifically, the object is represented by means of a tree where each node, corresponding to a boundary segment at some level of resolution, is characterized by a real vector containing curvature, length, and symmetry of the boundary segment, while the nodes are connected by arcs when segments at successive levels are spatially related. The recognition procedure is formulated as a training procedure made by recursive neural networks followed by a testing procedure over unknown tree structured patterns.
我们提出了一种基于物体边界的多尺度树表示和树的递归学习的高效且完全并行的二维物体识别方法。具体来说,对象通过树表示,其中每个节点对应于某个分辨率级别的边界段,其特征是包含边界段的曲率、长度和对称性的实向量,而当连续级别的段在空间上相关时,节点通过弧连接。识别过程是由递归神经网络进行的训练过程,然后是未知树结构模式的测试过程。
{"title":"Shape recognition by distributed recursive learning of multiscale trees","authors":"L. Lombardi, A. Petrosino","doi":"10.1109/ICIAP.2003.1234020","DOIUrl":"https://doi.org/10.1109/ICIAP.2003.1234020","url":null,"abstract":"We present an efficient and fully parallel 2D object recognition method based on the use of a multiscale tree representation of the object boundary and recursive learning of trees. Specifically, the object is represented by means of a tree where each node, corresponding to a boundary segment at some level of resolution, is characterized by a real vector containing curvature, length, and symmetry of the boundary segment, while the nodes are connected by arcs when segments at successive levels are spatially related. The recognition procedure is formulated as a training procedure made by recursive neural networks followed by a testing procedure over unknown tree structured patterns.","PeriodicalId":218076,"journal":{"name":"12th International Conference on Image Analysis and Processing, 2003.Proceedings.","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133140909","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
FNS and HEIV: relating two vision parameter estimation frameworks FNS和HEIV:两种视觉参数估计框架
Pub Date : 2003-09-17 DOI: 10.1109/ICIAP.2003.1234042
W. Chojnacki, M. Brooks, A. Hengel, D. Gawley
Problems requiring accurate determination of parameters from image-based quantities arise often in computer vision. Two recent, independently developed frameworks for estimating such parameters are the FNS and HEIV schemes. Here it is shown that FNS (fundamental numerical scheme) and a core version of HEIV (heteroscedastic errors-in-variables) are essentially equivalent, solving a common underlying equation via different means. The analysis is driven by the search for a nondegenerate form of a certain generalised eigenvalue problem, and effectively leads to a new derivation of the relevant case of the HEIV algorithm. This work may be seen as an extension of previous efforts to rationalise and inter-relate a spectrum of estimators, including the renormalisation method of Kanatani and the normalised eight-point method of Hartley.
需要从基于图像的量中精确确定参数的问题经常出现在计算机视觉中。两个最近独立开发的估计这些参数的框架是FNS和HEIV方案。本文表明,FNS(基本数值格式)和HEIV(异方差变量误差)的核心版本本质上是等效的,通过不同的方法求解一个共同的底层方程。该分析是由寻找某个广义特征值问题的非退化形式驱动的,并有效地推导出HEIV算法的相关案例。这项工作可以看作是以前的努力的延伸,以合理化和相互关联的频谱估计,包括Kanatani的重整方法和Hartley的规范化八点方法。
{"title":"FNS and HEIV: relating two vision parameter estimation frameworks","authors":"W. Chojnacki, M. Brooks, A. Hengel, D. Gawley","doi":"10.1109/ICIAP.2003.1234042","DOIUrl":"https://doi.org/10.1109/ICIAP.2003.1234042","url":null,"abstract":"Problems requiring accurate determination of parameters from image-based quantities arise often in computer vision. Two recent, independently developed frameworks for estimating such parameters are the FNS and HEIV schemes. Here it is shown that FNS (fundamental numerical scheme) and a core version of HEIV (heteroscedastic errors-in-variables) are essentially equivalent, solving a common underlying equation via different means. The analysis is driven by the search for a nondegenerate form of a certain generalised eigenvalue problem, and effectively leads to a new derivation of the relevant case of the HEIV algorithm. This work may be seen as an extension of previous efforts to rationalise and inter-relate a spectrum of estimators, including the renormalisation method of Kanatani and the normalised eight-point method of Hartley.","PeriodicalId":218076,"journal":{"name":"12th International Conference on Image Analysis and Processing, 2003.Proceedings.","volume":"151 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116629103","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Estimation of 3D gazed position using view lines 利用视图线估计三维凝视位置
Pub Date : 2003-09-17 DOI: 10.1109/ICIAP.2003.1234094
Ikuhisa Mitsugami, N. Ukita, M. Kidode
We propose a new wearable system that can estimate the 3D position of a gazed point by measuring multiple binocular view lines. In principle, 3D measurement is possible by the triangulation of binocular view lines. However, it is difficult to measure these lines accurately with a device for eye tracking, because of errors caused by (1) difficulty in calibrating the device and (2) the limitation that a human cannot gaze very accurately at a distant point. Concerning (1), the accuracy of calibration can be improved by considering the optical properties of a camera in the device. To solve (2), we propose a stochastic algorithm that determines a gazed 3D position by integrating information of view lines observed at multiple head positions. We validated the effectiveness of the proposed algorithm experimentally.
我们提出了一种新的可穿戴系统,该系统可以通过测量多个双目视线线来估计被凝视点的三维位置。原则上,三维测量是可能的双目视线的三角测量。然而,由于(1)校准设备的困难以及(2)人类无法非常准确地注视远处点的限制,使用眼动追踪设备很难准确地测量这些线条。对于(1),可以通过考虑设备中相机的光学特性来提高标定精度。为了解决(2),我们提出了一种随机算法,该算法通过整合多个头部位置观察到的视线信息来确定凝视的三维位置。通过实验验证了该算法的有效性。
{"title":"Estimation of 3D gazed position using view lines","authors":"Ikuhisa Mitsugami, N. Ukita, M. Kidode","doi":"10.1109/ICIAP.2003.1234094","DOIUrl":"https://doi.org/10.1109/ICIAP.2003.1234094","url":null,"abstract":"We propose a new wearable system that can estimate the 3D position of a gazed point by measuring multiple binocular view lines. In principle, 3D measurement is possible by the triangulation of binocular view lines. However, it is difficult to measure these lines accurately with a device for eye tracking, because of errors caused by (1) difficulty in calibrating the device and (2) the limitation that a human cannot gaze very accurately at a distant point. Concerning (1), the accuracy of calibration can be improved by considering the optical properties of a camera in the device. To solve (2), we propose a stochastic algorithm that determines a gazed 3D position by integrating information of view lines observed at multiple head positions. We validated the effectiveness of the proposed algorithm experimentally.","PeriodicalId":218076,"journal":{"name":"12th International Conference on Image Analysis and Processing, 2003.Proceedings.","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114953352","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 25
Camera calibration and 3D reconstruction using interval analysis 摄像机标定和三维重建使用区间分析
Pub Date : 2003-09-17 DOI: 10.1109/ICIAP.2003.1234078
B. Telle, M. Aldon, N. Ramdani
The paper deals with the problem of error estimation in 3D reconstruction. It shows how interval analysis can be used in this way for 3D vision applications. The description of an image point by an interval assumes an unknown but bounded localization. We present a new method based on interval analysis tools to propagate this bounded uncertainty. This way of computation can produce guaranteed results since a datum is not the most probabilistic value but an interval which contains the true value. We validate our method by computing a guaranteed model for a projective camera, and we achieve a guaranteed 3D reconstruction.
本文研究了三维重建中的误差估计问题。它展示了区间分析如何以这种方式用于3D视觉应用。用区间描述一个图像点,假设一个未知但有界的局部化。我们提出了一种基于区间分析工具的新方法来传播这种有界不确定性。这种计算方式可以产生有保证的结果,因为数据不是最有概率的值,而是包含真实值的区间。我们通过计算投影相机的保证模型来验证我们的方法,并实现了保证的三维重建。
{"title":"Camera calibration and 3D reconstruction using interval analysis","authors":"B. Telle, M. Aldon, N. Ramdani","doi":"10.1109/ICIAP.2003.1234078","DOIUrl":"https://doi.org/10.1109/ICIAP.2003.1234078","url":null,"abstract":"The paper deals with the problem of error estimation in 3D reconstruction. It shows how interval analysis can be used in this way for 3D vision applications. The description of an image point by an interval assumes an unknown but bounded localization. We present a new method based on interval analysis tools to propagate this bounded uncertainty. This way of computation can produce guaranteed results since a datum is not the most probabilistic value but an interval which contains the true value. We validate our method by computing a guaranteed model for a projective camera, and we achieve a guaranteed 3D reconstruction.","PeriodicalId":218076,"journal":{"name":"12th International Conference on Image Analysis and Processing, 2003.Proceedings.","volume":"73 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116350497","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
Cumulative level-line matching for image registration 用于图像配准的累积水平线匹配
Pub Date : 2003-09-17 DOI: 10.1109/ICIAP.2003.1234046
S. Bouchafa, B. Zavidovique
A new level-line registration technique is proposed for image transform estimation. This approach is robust towards contrast changes, does not require any estimate of the unknown transformation between images and tackles very challenging situations that usually lead to pairing ambiguities, such as repetitive patterns in the images. The registration itself is performed through an efficient level-line cumulative matching based on a multistage primitive election procedure. Each stage provides a coarse estimate of the transformation that the next stage gets to refine. Although we deal with similarity transforms (rotation, scale and translation), our approach can be easily adapted to more general transformations.
提出了一种新的用于图像变换估计的水平线配准技术。这种方法对对比度变化具有鲁棒性,不需要对图像之间的未知转换进行任何估计,并且可以处理通常导致配对模糊的非常具有挑战性的情况,例如图像中的重复模式。注册本身是通过基于多阶段原语选举过程的有效水平行累积匹配来执行的。每个阶段都提供了下一阶段要细化的转换的粗略估计。虽然我们处理相似变换(旋转、缩放和平移),但我们的方法可以很容易地适应更一般的变换。
{"title":"Cumulative level-line matching for image registration","authors":"S. Bouchafa, B. Zavidovique","doi":"10.1109/ICIAP.2003.1234046","DOIUrl":"https://doi.org/10.1109/ICIAP.2003.1234046","url":null,"abstract":"A new level-line registration technique is proposed for image transform estimation. This approach is robust towards contrast changes, does not require any estimate of the unknown transformation between images and tackles very challenging situations that usually lead to pairing ambiguities, such as repetitive patterns in the images. The registration itself is performed through an efficient level-line cumulative matching based on a multistage primitive election procedure. Each stage provides a coarse estimate of the transformation that the next stage gets to refine. Although we deal with similarity transforms (rotation, scale and translation), our approach can be easily adapted to more general transformations.","PeriodicalId":218076,"journal":{"name":"12th International Conference on Image Analysis and Processing, 2003.Proceedings.","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114888437","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Towards automatic transcription of Syriac handwriting 走向自动抄写叙利亚文笔迹
Pub Date : 2003-09-17 DOI: 10.1109/ICIAP.2003.1234126
W. Clocksin, P. P. Fernando
We describe a method implemented for the recognition of Syriac handwriting from historical manuscripts. The Syriac language has been a neglected area for handwriting recognition research, yet is interesting because the preponderance of scribe-written manuscripts offers a challenging yet tractable medium for OCR research between the extremes of typewritten text and free handwriting. Like Arabic, Syriac is written in a cursive form from right-to-left, and letter shape depends on the position within the word. The method described does not need to find character strokes or contours. Both whole words and character shapes were used in recognition experiments. After segmentation using a novel probabilistic method, features of these shapes are found that tolerate variation in formation and image quality. Each shape is recognised individually using a discriminative support vector machine with 10-fold cross-validation. We describe experiments using a variety of segmentation methods and combinations of features on characters and words. Images from scribe-written historical manuscripts are used, and the recognition results are compared with those for images taken from clearer 19th century typeset documents. Recognition rates vary from 61-100%, depending on the algorithms used and the size and source of the data set.
我们描述了一种从历史手稿中识别叙利亚笔迹的方法。在手写识别研究中,叙利亚语一直是一个被忽视的领域,但它很有趣,因为大量的抄写手稿为OCR研究提供了一种具有挑战性但易于处理的媒介,介于打字文本和自由手写之间。和阿拉伯语一样,叙利亚语也是草书形式,从右向左书写,字母的形状取决于在单词中的位置。所描述的方法不需要查找字符笔画或轮廓。在识别实验中采用了整词和汉字形状两种方法。在使用一种新的概率方法分割后,发现这些形状的特征可以容忍信息和图像质量的变化。使用具有10倍交叉验证的判别支持向量机单独识别每个形状。我们描述了使用各种分割方法和字符和单词特征组合的实验。使用了抄写历史手稿中的图像,并将识别结果与从更清晰的19世纪排版文件中获取的图像进行了比较。识别率从61-100%不等,取决于所使用的算法以及数据集的大小和来源。
{"title":"Towards automatic transcription of Syriac handwriting","authors":"W. Clocksin, P. P. Fernando","doi":"10.1109/ICIAP.2003.1234126","DOIUrl":"https://doi.org/10.1109/ICIAP.2003.1234126","url":null,"abstract":"We describe a method implemented for the recognition of Syriac handwriting from historical manuscripts. The Syriac language has been a neglected area for handwriting recognition research, yet is interesting because the preponderance of scribe-written manuscripts offers a challenging yet tractable medium for OCR research between the extremes of typewritten text and free handwriting. Like Arabic, Syriac is written in a cursive form from right-to-left, and letter shape depends on the position within the word. The method described does not need to find character strokes or contours. Both whole words and character shapes were used in recognition experiments. After segmentation using a novel probabilistic method, features of these shapes are found that tolerate variation in formation and image quality. Each shape is recognised individually using a discriminative support vector machine with 10-fold cross-validation. We describe experiments using a variety of segmentation methods and combinations of features on characters and words. Images from scribe-written historical manuscripts are used, and the recognition results are compared with those for images taken from clearer 19th century typeset documents. Recognition rates vary from 61-100%, depending on the algorithms used and the size and source of the data set.","PeriodicalId":218076,"journal":{"name":"12th International Conference on Image Analysis and Processing, 2003.Proceedings.","volume":"70 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121144445","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 35
Old fashioned state-of-the-art image classification 老式的最先进的图像分类
Pub Date : 2003-09-17 DOI: 10.1109/ICIAP.2003.1234110
A. Barla, F. Odone, A. Verri
In this paper we present a statistical learning scheme for image classification based on a mixture of old fashioned ideas and state of the art learning tools. We represent input images through large dimensional and usually sparse histograms which, depending on the task, are either color histograms or co-occurrence matrices. Support vector machines are trained on these sparse inputs directly, to solve problems like indoor/outdoor classification and cityscape retrieval from image databases. The experimental results indicate that the use of a kernel function derived from the computer vision literature leads to better recognition results than off the shelf kernels. According to our findings, it appears that image classification problems can be addressed with no need of explicit feature extraction or dimensionality reduction stages. We argue that this might be used as the starting point for developing image classification systems which can be easily tuned to a number of different tasks.
在本文中,我们提出了一种基于传统思想和最新学习工具的图像分类统计学习方案。我们通过大维度的、通常是稀疏的直方图来表示输入图像,根据任务的不同,这些直方图可以是颜色直方图,也可以是共生矩阵。支持向量机直接在这些稀疏输入上进行训练,以解决室内/室外分类和从图像数据库中检索城市景观等问题。实验结果表明,使用从计算机视觉文献中获得的核函数比使用现成的核函数具有更好的识别效果。根据我们的研究结果,似乎不需要明确的特征提取或降维阶段就可以解决图像分类问题。我们认为,这可能被用作开发图像分类系统的起点,它可以很容易地调整到许多不同的任务。
{"title":"Old fashioned state-of-the-art image classification","authors":"A. Barla, F. Odone, A. Verri","doi":"10.1109/ICIAP.2003.1234110","DOIUrl":"https://doi.org/10.1109/ICIAP.2003.1234110","url":null,"abstract":"In this paper we present a statistical learning scheme for image classification based on a mixture of old fashioned ideas and state of the art learning tools. We represent input images through large dimensional and usually sparse histograms which, depending on the task, are either color histograms or co-occurrence matrices. Support vector machines are trained on these sparse inputs directly, to solve problems like indoor/outdoor classification and cityscape retrieval from image databases. The experimental results indicate that the use of a kernel function derived from the computer vision literature leads to better recognition results than off the shelf kernels. According to our findings, it appears that image classification problems can be addressed with no need of explicit feature extraction or dimensionality reduction stages. We argue that this might be used as the starting point for developing image classification systems which can be easily tuned to a number of different tasks.","PeriodicalId":218076,"journal":{"name":"12th International Conference on Image Analysis and Processing, 2003.Proceedings.","volume":"44 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123866572","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 22
Content-based video summarization and adaptation for ubiquitous media access 基于内容的视频摘要和适应无处不在的媒体访问
Pub Date : 2003-09-17 DOI: 10.1109/ICIAP.2003.1234098
Shih-Fu Chang
Today's mobile and wireless users access multimedia content from different types of networks and terminals. Content analysis plays a critical role in developing effective solutions in meeting unique resource constraints and user preferences in such usage environments. Specifically, content analysis is central to automatic discovery of syntactic-level summaries and generation of concise semantic-level summaries. Content analysis also provides a promising direction for finding optimal adaptation methods under various resource-utility constraints. The paper presents brief overviews of such emerging, fruitful areas and promising research directions.
今天的移动和无线用户从不同类型的网络和终端访问多媒体内容。内容分析在开发有效的解决方案以满足此类使用环境中独特的资源限制和用户偏好方面起着关键作用。具体来说,内容分析是自动发现语法级摘要和生成简明语义级摘要的核心。内容分析也为寻找各种资源效用约束下的最优适应方法提供了有希望的方向。本文对这些新兴的、有成果的领域和有前景的研究方向作了简要的综述。
{"title":"Content-based video summarization and adaptation for ubiquitous media access","authors":"Shih-Fu Chang","doi":"10.1109/ICIAP.2003.1234098","DOIUrl":"https://doi.org/10.1109/ICIAP.2003.1234098","url":null,"abstract":"Today's mobile and wireless users access multimedia content from different types of networks and terminals. Content analysis plays a critical role in developing effective solutions in meeting unique resource constraints and user preferences in such usage environments. Specifically, content analysis is central to automatic discovery of syntactic-level summaries and generation of concise semantic-level summaries. Content analysis also provides a promising direction for finding optimal adaptation methods under various resource-utility constraints. The paper presents brief overviews of such emerging, fruitful areas and promising research directions.","PeriodicalId":218076,"journal":{"name":"12th International Conference on Image Analysis and Processing, 2003.Proceedings.","volume":"70 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124231282","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
期刊
12th International Conference on Image Analysis and Processing, 2003.Proceedings.
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1