Text Segmentation from Complex Background Using Sparse Representations

Ninth International Conference on Document Analysis and Recognition (ICDAR 2007) Pub Date : 2007-09-23 DOI:10.1109/ICDAR.2007.246

Wumo Pan, T. D. Bui, C. Suen

引用次数: 15

Abstract

A novel text segmentation method from complex background is presented in this paper. The idea is inspired by the recent development in searching for the sparse signal representation among a family of over-complete atoms, which is called a dictionary. We assume that the image under investigation is composed of two components: the foreground text and the complex background. We further assume that the latter can be modeled as a piece-wise smooth function. Then we choose two dictionaries, where the first one gives sparse representation to one component and non-sparse representation to another while the second one does the opposite. By looking for the sparse representations in each dictionary, we can decompose the image into the two composing components. After that, text segmentation can be easily achieved by applying simple thresholding to the text component. Preliminary experiments show some promising results.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于稀疏表示的复杂背景文本分割

提出了一种新的复杂背景文本分割方法。这个想法的灵感来自于最近在一组被称为字典的过完备原子中寻找稀疏信号表示的研究进展。我们假设所研究的图像由两部分组成:前景文本和复杂背景。我们进一步假设后者可以建模为分段平滑函数。然后，我们选择两个字典，其中第一个字典对一个组件进行稀疏表示，对另一个组件进行非稀疏表示，而第二个字典则相反。通过在每个字典中寻找稀疏表示，我们可以将图像分解为两个组成组件。之后，文本分割可以通过对文本组件应用简单的阈值化来轻松实现。初步实验显示了一些有希望的结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)

自引率

0.00%

发文量