A Shared Parts Model for Document Image Recognition

Ninth International Conference on Document Analysis and Recognition (ICDAR 2007) Pub Date : 2007-09-23 DOI:10.1109/ICDAR.2007.34

Mithun Das Gupta, Prateek Sarkar

引用次数: 6

Abstract

We address document image classification by visual appearance. An image is represented by a variable-length list of visually salient features. A hierarchical Bayesian network is used to model the joint density of these features. This model promotes generalization from a few samples by sharing component probability distributions among different categories, and by factoring out a common displacement vector shared by all features within an image. The Bayesian network is implemented as a factor graph, and parameter estimation and inference are both done by loopy belief propagation. We explain and illustrate our model on a simple shape classification task. We obtain close to 90% accuracy on classifying journal articles from memos in the UWASH-II dataset, as well as on other classification tasks on a home-grown data set of technical articles.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

文档图像识别的共享部件模型

我们通过视觉外观来处理文档图像分类。图像由视觉显著特征的可变长度列表表示。使用层次贝叶斯网络对这些特征的联合密度进行建模。该模型通过共享不同类别之间的分量概率分布，并通过分解出图像中所有特征共享的公共位移向量来促进少数样本的泛化。贝叶斯网络以因子图的形式实现，参数估计和推理都是通过循环信念传播来完成的。我们在一个简单的形状分类任务上解释和说明我们的模型。我们从UWASH-II数据集中的备忘录中对期刊文章进行分类的准确率接近90%，以及在本地技术文章数据集上的其他分类任务。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)

自引率

0.00%

发文量