首页 > 最新文献

Proceedings of Sixth International Conference on Document Analysis and Recognition最新文献

英文 中文
Automatic identification of English, Chinese, Arabic, Devnagari and Bangla script line 自动识别英文,中文,阿拉伯语,德文加里语和孟加拉语的文字线
Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953896
U. Pal, B. Chaudhuri
In a general situation, a document page may contain several scriptforms. For optical character recognition (OCR) of such a document page, it is necessary to separate the scripts before feeding them to their individual OCR systems. An automatic technique for the identification of printed Roman, Chinese, Arabic, Devnagari and Bangla text lines from a single document is proposed. Shape based features, statistical features and some features obtained from the concept of a water reservoir are used for script identification. The proposed scheme has an accuracy of about 97.33%.
一般情况下,一个文档页面可能包含几个脚本表单。对于这种文档页面的光学字符识别(OCR),有必要在将脚本输入各自的OCR系统之前分离脚本。提出了一种从单一文件中自动识别印刷的罗马文、中文、阿拉伯文、德文加里文和孟加拉文文本行的技术。利用基于形状的特征、统计特征和从水库概念中获得的一些特征进行文字识别。该方案的准确率约为97.33%。
{"title":"Automatic identification of English, Chinese, Arabic, Devnagari and Bangla script line","authors":"U. Pal, B. Chaudhuri","doi":"10.1109/ICDAR.2001.953896","DOIUrl":"https://doi.org/10.1109/ICDAR.2001.953896","url":null,"abstract":"In a general situation, a document page may contain several scriptforms. For optical character recognition (OCR) of such a document page, it is necessary to separate the scripts before feeding them to their individual OCR systems. An automatic technique for the identification of printed Roman, Chinese, Arabic, Devnagari and Bangla text lines from a single document is proposed. Shape based features, statistical features and some features obtained from the concept of a water reservoir are used for script identification. The proposed scheme has an accuracy of about 97.33%.","PeriodicalId":277816,"journal":{"name":"Proceedings of Sixth International Conference on Document Analysis and Recognition","volume":"7 1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122443578","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 78
Separation of overlapping text from graphics 从图形中分离重叠的文本
Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953752
Ruini Cao, C. Tan
The separation of overlapping text from graphics is a challenging problem in document image analysis. This paper proposes a specific method for detecting and extracting characters that are touching graphics. It is based on the observation that the constituent strokes of characters are usually short segments in comparison with those of graphics. It combines line continuation with the feature line width to decompose and reconstruct segments underlying the region of intersection. Experimental results showed that the proposed method improved the percentage of correctly detected text as well as the accuracy of character recognition significantly.
在文档图像分析中,重叠文本与图形的分离是一个具有挑战性的问题。本文提出了一种检测和提取触摸图形字符的具体方法。它的基础是观察到,与图形相比,汉字的组成笔画通常是短段的。该方法将线的延拓与特征线的宽度相结合,对相交区域下的线段进行分解和重构。实验结果表明,该方法显著提高了文本的正确率和字符识别的准确率。
{"title":"Separation of overlapping text from graphics","authors":"Ruini Cao, C. Tan","doi":"10.1109/ICDAR.2001.953752","DOIUrl":"https://doi.org/10.1109/ICDAR.2001.953752","url":null,"abstract":"The separation of overlapping text from graphics is a challenging problem in document image analysis. This paper proposes a specific method for detecting and extracting characters that are touching graphics. It is based on the observation that the constituent strokes of characters are usually short segments in comparison with those of graphics. It combines line continuation with the feature line width to decompose and reconstruct segments underlying the region of intersection. Experimental results showed that the proposed method improved the percentage of correctly detected text as well as the accuracy of character recognition significantly.","PeriodicalId":277816,"journal":{"name":"Proceedings of Sixth International Conference on Document Analysis and Recognition","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129019035","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 30
A new component based algorithm for newspaper layout analysis 一种新的基于分量的报纸版面分析算法
Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953970
Fei Liu, Yupin Luo, D. Hu, Masataka Yoshikawa
The aim of the layout analysis is to extract the geometric structure from a document image. It is a progress of labeling homogenous regions of a document image. In order to present a complex newspaper layout analysis, this paper proposes a new component based bottom-up algorithm. With a novel homogeneity related definition of distance, it maintains a dynamic minimal distance mechanism to decide the components merging sequence. Under the restricting rules generated from the newspaper layout heuristically, we derive the preferred analysis result. Experimental results reveal the proposed approach is effective.
版面分析的目的是从文档图像中提取几何结构。它是对文档图像的同质区域进行标记的过程。为了对复杂的报纸版面进行分析,本文提出了一种新的基于分量的自底向上算法。采用新颖的同质性相关距离定义,保持动态最小距离机制来决定组件合并顺序。在报纸版面启发式生成的约束规则下,推导出优选的分析结果。实验结果表明,该方法是有效的。
{"title":"A new component based algorithm for newspaper layout analysis","authors":"Fei Liu, Yupin Luo, D. Hu, Masataka Yoshikawa","doi":"10.1109/ICDAR.2001.953970","DOIUrl":"https://doi.org/10.1109/ICDAR.2001.953970","url":null,"abstract":"The aim of the layout analysis is to extract the geometric structure from a document image. It is a progress of labeling homogenous regions of a document image. In order to present a complex newspaper layout analysis, this paper proposes a new component based bottom-up algorithm. With a novel homogeneity related definition of distance, it maintains a dynamic minimal distance mechanism to decide the components merging sequence. Under the restricting rules generated from the newspaper layout heuristically, we derive the preferred analysis result. Experimental results reveal the proposed approach is effective.","PeriodicalId":277816,"journal":{"name":"Proceedings of Sixth International Conference on Document Analysis and Recognition","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129349595","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 27
DMOS: a generic document recognition method, application to an automatic generator of musical scores, mathematical formulae and table structures recognition systems 一种通用的文件识别方法,应用于一个自动生成乐谱、数学公式和表格结构的识别系统
Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953786
Bertrand Coüasnon
Genericity in structured document recognition is a difficult challenge. We therefore propose a new generic document recognition method, called DMOS (Description and MOdification of Segmentation), that is made up of a new grammatical formalism, called EPF (Enhanced Position Formalism) and an associated parser which is able to introduce context in segmentation. We implement this method to obtain a generator of document recognition systems. This generator can automatically produce new recognition systems. It is only necessary to describe the document with an EPF grammar, which is then simply compiled. In this way, we have developed various recognition systems: one on musical scores, one on mathematical formulae and one on recursive table structures. We have also defined a specific application to damaged military forms of the 19th Century. We have been able to test the generated system on 5,000 of these military forms. This has permitted us to validate the DMOS method on a real-world application.
结构化文档识别中的通用性是一个难题。因此,我们提出了一种新的通用文档识别方法,称为DMOS(分割的描述和修改),它由一种新的语法形式称为EPF(增强位置形式)和一个能够在分割中引入上下文的相关解析器组成。我们实现了这个方法来获得一个文档识别系统的生成器。该生成器可以自动生成新的识别系统。只需要用EPF语法描述文档,然后简单地编译它。通过这种方式,我们开发了各种识别系统:一个是乐谱识别系统,一个是数学公式识别系统,还有一个是递归表结构识别系统。我们还确定了19世纪受损军事形式的具体应用。我们已经能够在5000个这些军事表格上测试生成的系统。这使我们能够在实际应用程序中验证DMOS方法。
{"title":"DMOS: a generic document recognition method, application to an automatic generator of musical scores, mathematical formulae and table structures recognition systems","authors":"Bertrand Coüasnon","doi":"10.1109/ICDAR.2001.953786","DOIUrl":"https://doi.org/10.1109/ICDAR.2001.953786","url":null,"abstract":"Genericity in structured document recognition is a difficult challenge. We therefore propose a new generic document recognition method, called DMOS (Description and MOdification of Segmentation), that is made up of a new grammatical formalism, called EPF (Enhanced Position Formalism) and an associated parser which is able to introduce context in segmentation. We implement this method to obtain a generator of document recognition systems. This generator can automatically produce new recognition systems. It is only necessary to describe the document with an EPF grammar, which is then simply compiled. In this way, we have developed various recognition systems: one on musical scores, one on mathematical formulae and one on recursive table structures. We have also defined a specific application to damaged military forms of the 19th Century. We have been able to test the generated system on 5,000 of these military forms. This has permitted us to validate the DMOS method on a real-world application.","PeriodicalId":277816,"journal":{"name":"Proceedings of Sixth International Conference on Document Analysis and Recognition","volume":"40 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116991005","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 67
Individuality of handwriting: a validation study 笔迹的个性:一项验证性研究
Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953764
S. Srihari, Sung-Hyuk Cha, Hina Arora, Sangjik Lee
Motivated by several rulings in United States courts concerning expert testimony in general and handwriting testimony in particular, we undertook a study to objectively validate the hypothesis that handwriting is individualistic. Handwriting samples of 1500 individuals, representative of the US population with respect to gender, age, ethnic groups, etc., were obtained. Analyzing differences in handwriting was done by using computer algorithms for extracting features from scanned images of handwriting. Attributes characteristic of the handwriting were obtained, e.g., line separation, slant, character shapes, etc. These attributes, which are a subset of attributes used by expert document examiners, were used to quantitatively establish individuality by using machine learning approaches. Using global attributes of handwriting and very few characters in the writing, the ability to determine the writer with a high degree of confidence was established. The work is a step towards providing scientific support for admitting handwriting evidence in court. The mathematical approach and the resulting software also have the promise of aiding the expert document examiner.
受美国法院关于专家证词和笔迹证词的几项裁决的启发,我们进行了一项研究,以客观地验证笔迹是个人主义的假设。获得了1500个人的笔迹样本,这些人在性别、年龄、种族等方面代表了美国人口。使用计算机算法从扫描的笔迹图像中提取特征来分析笔迹的差异。获得了笔迹的属性特征,如行距、斜度、字符形状等。这些属性是专家文档审查员使用的属性的子集,用于通过使用机器学习方法定量地建立个性。利用笔迹的全局属性和笔迹中的很少字符,建立了高度自信地确定作者的能力。这项工作是向在法庭上承认笔迹证据提供科学支持迈出的一步。数学方法和由此产生的软件也有希望帮助专家文件审查员。
{"title":"Individuality of handwriting: a validation study","authors":"S. Srihari, Sung-Hyuk Cha, Hina Arora, Sangjik Lee","doi":"10.1109/ICDAR.2001.953764","DOIUrl":"https://doi.org/10.1109/ICDAR.2001.953764","url":null,"abstract":"Motivated by several rulings in United States courts concerning expert testimony in general and handwriting testimony in particular, we undertook a study to objectively validate the hypothesis that handwriting is individualistic. Handwriting samples of 1500 individuals, representative of the US population with respect to gender, age, ethnic groups, etc., were obtained. Analyzing differences in handwriting was done by using computer algorithms for extracting features from scanned images of handwriting. Attributes characteristic of the handwriting were obtained, e.g., line separation, slant, character shapes, etc. These attributes, which are a subset of attributes used by expert document examiners, were used to quantitatively establish individuality by using machine learning approaches. Using global attributes of handwriting and very few characters in the writing, the ability to determine the writer with a high degree of confidence was established. The work is a step towards providing scientific support for admitting handwriting evidence in court. The mathematical approach and the resulting software also have the promise of aiding the expert document examiner.","PeriodicalId":277816,"journal":{"name":"Proceedings of Sixth International Conference on Document Analysis and Recognition","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117098710","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 123
Genetic algorithms for feature selection and weighting, a review and study 遗传算法的特征选择与加权,综述与研究
Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953980
Faten Hussein, R. Ward, N. Kharma
Our aim is: a) to present a comprehensive survey of previous attempts at using genetic algorithms (GA) for feature selection in pattern recognition applications, with a special focus on character recognition; and b) to report on work that uses GA to optimize the weights of the classification module of a character recognition system. The main purpose of feature selection is to reduce the number of features, by eliminating irrelevant and redundant features, while simultaneously maintaining or enhancing classification accuracy. Many search algorithms have been used for feature selection. Among those, GA have proven to be an effective computational method, especially in situations where the search space is uncharacterized (mathematically), not fully understood, or/and highly dimensional.
我们的目标是:a)对以前在模式识别应用中使用遗传算法(GA)进行特征选择的尝试进行全面的调查,特别关注字符识别;b)报告使用遗传算法优化字符识别系统分类模块权重的工作。特征选择的主要目的是通过消除不相关和冗余的特征来减少特征的数量,同时保持或提高分类精度。许多搜索算法被用于特征选择。其中,遗传算法已被证明是一种有效的计算方法,特别是在搜索空间未被表征(数学上)、未被完全理解或/和高维的情况下。
{"title":"Genetic algorithms for feature selection and weighting, a review and study","authors":"Faten Hussein, R. Ward, N. Kharma","doi":"10.1109/ICDAR.2001.953980","DOIUrl":"https://doi.org/10.1109/ICDAR.2001.953980","url":null,"abstract":"Our aim is: a) to present a comprehensive survey of previous attempts at using genetic algorithms (GA) for feature selection in pattern recognition applications, with a special focus on character recognition; and b) to report on work that uses GA to optimize the weights of the classification module of a character recognition system. The main purpose of feature selection is to reduce the number of features, by eliminating irrelevant and redundant features, while simultaneously maintaining or enhancing classification accuracy. Many search algorithms have been used for feature selection. Among those, GA have proven to be an effective computational method, especially in situations where the search space is uncharacterized (mathematically), not fully understood, or/and highly dimensional.","PeriodicalId":277816,"journal":{"name":"Proceedings of Sixth International Conference on Document Analysis and Recognition","volume":"144 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114311143","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 80
Flexible Web document analysis for delivery to narrow-bandwidth devices 灵活的Web文档分析,以便交付到窄带宽设备
Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953951
Gerald Penn, Jianying Hu, Hengbin Luo, Ryan T. McDonald
We propose a set of baseline heuristics for identifying genuinely tabular information and news links in HTML documents. A prototype implementation of these heuristics is described for delivering content from news providers' home pages to a narrow-bandwidth device such as a portable digital assistant or cellular phone display. Its evaluation on 75 Web sites is provided, along with a discussion of topics for future research.
我们提出了一套基线启发式方法,用于识别HTML文档中真正的表格信息和新闻链接。描述了这些启发式的原型实现,用于将内容从新闻提供者的主页传送到诸如便携式数字助理或蜂窝电话显示器之类的窄带宽设备。提供了对75个网站的评估,以及对未来研究主题的讨论。
{"title":"Flexible Web document analysis for delivery to narrow-bandwidth devices","authors":"Gerald Penn, Jianying Hu, Hengbin Luo, Ryan T. McDonald","doi":"10.1109/ICDAR.2001.953951","DOIUrl":"https://doi.org/10.1109/ICDAR.2001.953951","url":null,"abstract":"We propose a set of baseline heuristics for identifying genuinely tabular information and news links in HTML documents. A prototype implementation of these heuristics is described for delivering content from news providers' home pages to a narrow-bandwidth device such as a portable digital assistant or cellular phone display. Its evaluation on 75 Web sites is provided, along with a discussion of topics for future research.","PeriodicalId":277816,"journal":{"name":"Proceedings of Sixth International Conference on Document Analysis and Recognition","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114343037","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 59
Form document identification using line structure based features 使用基于线条结构的特征进行表单文档识别
Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953881
Kuo-Chin Fan, Mei-Lin Chang, Yuan-Kai Wang
Form recognition is one of the special applications of document analysis (DA). We present a novel form recognition method by analyzing the line structure embedded in an input form document. First, all vertical and horizontal lines embedded in the form image are extracted. By analyzing the crossing relationships among horizontal lines and vertical lines, a line crossing relationship matrix can be built with each row corresponding to one horizontal line and each column corresponding to one vertical line. Moreover two line distance relationship matrices, horizontal and vertical line distance relationship matrices, are built by analyzing the distance relationships among horizontal lines and vertical lines, respectively. Last, the recognition task is performed by matching these three matrices. Experimental results reveal the feasibility and efficiency of our proposed method in recognizing form documents.
表单识别是文档分析(DA)的一个特殊应用。通过分析嵌入在输入表单文档中的线条结构,提出了一种新的表单识别方法。首先,提取嵌入到表单图像中的所有垂直线和水平线。通过分析水平线和垂直线之间的交叉关系,可以建立一个线条交叉关系矩阵,每一行对应一条水平线,每一列对应一条垂直线。通过分析水平线和垂直线之间的距离关系,分别建立了水平线和垂直线之间的距离关系矩阵。最后,通过匹配这三个矩阵来完成识别任务。实验结果表明了该方法在表单文档识别中的可行性和有效性。
{"title":"Form document identification using line structure based features","authors":"Kuo-Chin Fan, Mei-Lin Chang, Yuan-Kai Wang","doi":"10.1109/ICDAR.2001.953881","DOIUrl":"https://doi.org/10.1109/ICDAR.2001.953881","url":null,"abstract":"Form recognition is one of the special applications of document analysis (DA). We present a novel form recognition method by analyzing the line structure embedded in an input form document. First, all vertical and horizontal lines embedded in the form image are extracted. By analyzing the crossing relationships among horizontal lines and vertical lines, a line crossing relationship matrix can be built with each row corresponding to one horizontal line and each column corresponding to one vertical line. Moreover two line distance relationship matrices, horizontal and vertical line distance relationship matrices, are built by analyzing the distance relationships among horizontal lines and vertical lines, respectively. Last, the recognition task is performed by matching these three matrices. Experimental results reveal the feasibility and efficiency of our proposed method in recognizing form documents.","PeriodicalId":277816,"journal":{"name":"Proceedings of Sixth International Conference on Document Analysis and Recognition","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114084407","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Arabic hand-written text-line extraction 阿拉伯语手写文本行提取
Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953799
Abderrazak Zahour, B. Taconet, P. Mercy, Said Ramdane
This paper describes a text-line extraction based method. The typical segmentation for a printed binary document is based on the horizontal projection analysis and then the regrouping of the connected components. These techniques can't be used for handwritten unconstrained text because data frequently contain undulations and shifts in the baseline, baseline-skew variability and inter-line distance variability. So, we think that the border line for a handwritten unconstrained documents should be a collection of horizontal line segments. From this point of view, we use a partial contour following based method to detect the separating lines. In the current version of our algorithm, we proceed to text slant detection, text line number evaluation by using partial projection. Then we carry out a partial contour following of every line; first in the direction of the writing, then in the opposite direction. After the treatment, the adjacent lines are separated. In the experimental session, we describe the application of the algorithm used for the extraction of text line. Database images contains about one hundred handwritten Arabic texts written by different writers. Results about diacritical points affectation are also reported.
本文描述了一种基于文本行提取的方法。典型的打印二进制文档分割方法是基于水平投影分析,然后将连接的组件重新分组。这些技术不能用于手写的无约束文本,因为数据经常包含基线的波动和移动、基线倾斜可变性和线间距离可变性。因此,我们认为手写的无约束文档的边界线应该是水平线段的集合。从这个角度来看,我们使用基于部分轮廓跟踪的方法来检测分离线。在当前版本的算法中,我们继续使用部分投影进行文本倾斜检测,文本行数评估。然后对每条线进行局部轮廓跟踪;首先顺着书写的方向,然后顺着相反的方向。处理后,相邻的线被分开。在实验部分,我们描述了该算法在文本行提取中的应用。数据库图像包含大约100个由不同作者手写的阿拉伯文本。本文还报道了变音符点影响的结果。
{"title":"Arabic hand-written text-line extraction","authors":"Abderrazak Zahour, B. Taconet, P. Mercy, Said Ramdane","doi":"10.1109/ICDAR.2001.953799","DOIUrl":"https://doi.org/10.1109/ICDAR.2001.953799","url":null,"abstract":"This paper describes a text-line extraction based method. The typical segmentation for a printed binary document is based on the horizontal projection analysis and then the regrouping of the connected components. These techniques can't be used for handwritten unconstrained text because data frequently contain undulations and shifts in the baseline, baseline-skew variability and inter-line distance variability. So, we think that the border line for a handwritten unconstrained documents should be a collection of horizontal line segments. From this point of view, we use a partial contour following based method to detect the separating lines. In the current version of our algorithm, we proceed to text slant detection, text line number evaluation by using partial projection. Then we carry out a partial contour following of every line; first in the direction of the writing, then in the opposite direction. After the treatment, the adjacent lines are separated. In the experimental session, we describe the application of the algorithm used for the extraction of text line. Database images contains about one hundred handwritten Arabic texts written by different writers. Results about diacritical points affectation are also reported.","PeriodicalId":277816,"journal":{"name":"Proceedings of Sixth International Conference on Document Analysis and Recognition","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125347342","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 127
Planar Markov modeling for Arabic writing recognition: advancement state 阿拉伯文字识别的平面马尔可夫模型:进展状态
Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953757
H. Miled, N. Amara
In this paper, we show how planar hidden Markov models (PHMM) can offer great potential to solve difficult Arabic character recognition problems, especially its cursivness. A convenient architecture is defined for printed Arabic sub-words. It yields an easy solution to implement the modeling of the different morphological variations of the Arabic writing, i.e., vertical and variable horizontal linkages. A more flexible architecture, developed for Arabic handwritten words, is under test. The structure proposed presents the aptitude to absorb the variability of the manuscript. Indeed, the experiments have shown promising results and directions for further improvements. In the present paper, we describe both retained architectures, showing the applicability of the PHMM to the Arabic complexities. This is owed precisely to the definition of the PHMM, which permits to follow efficiently the natural variations in bands of the Arabic script.
在本文中,我们展示了平面隐马尔可夫模型(PHMM)如何在解决困难的阿拉伯字符识别问题,特别是其草书问题上提供巨大的潜力。为打印的阿拉伯子词定义了一种方便的体系结构。它产生了一个简单的解决方案来实现阿拉伯文字的不同形态变化的建模,即垂直和可变的水平联系。为阿拉伯手写文字开发的更灵活的架构正在测试中。所提出的结构表现出吸收手稿变异性的能力。事实上,实验已经显示出有希望的结果和进一步改进的方向。在本文中,我们描述了这两种保留的体系结构,展示了PHMM对阿拉伯复杂性的适用性。这完全归功于PHMM的定义,它允许有效地遵循阿拉伯文字带的自然变化。
{"title":"Planar Markov modeling for Arabic writing recognition: advancement state","authors":"H. Miled, N. Amara","doi":"10.1109/ICDAR.2001.953757","DOIUrl":"https://doi.org/10.1109/ICDAR.2001.953757","url":null,"abstract":"In this paper, we show how planar hidden Markov models (PHMM) can offer great potential to solve difficult Arabic character recognition problems, especially its cursivness. A convenient architecture is defined for printed Arabic sub-words. It yields an easy solution to implement the modeling of the different morphological variations of the Arabic writing, i.e., vertical and variable horizontal linkages. A more flexible architecture, developed for Arabic handwritten words, is under test. The structure proposed presents the aptitude to absorb the variability of the manuscript. Indeed, the experiments have shown promising results and directions for further improvements. In the present paper, we describe both retained architectures, showing the applicability of the PHMM to the Arabic complexities. This is owed precisely to the definition of the PHMM, which permits to follow efficiently the natural variations in bands of the Arabic script.","PeriodicalId":277816,"journal":{"name":"Proceedings of Sixth International Conference on Document Analysis and Recognition","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128060524","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 40
期刊
Proceedings of Sixth International Conference on Document Analysis and Recognition
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1