首页 > 最新文献

Proceedings of 3rd International Conference on Document Analysis and Recognition最新文献

英文 中文
Segmentation of complex documents multilevel images: a robust and fast text bodies-headers detection and extraction scheme 复杂文档多层次图像分割:一种鲁棒快速的文本正文-标题检测与提取方案
Pub Date : 1995-08-14 DOI: 10.1109/ICDAR.1995.602016
D. Olivier, B. Dominique
We present a method for segmenting multilevels images of documents. The documents are considered difficult ones in the sense they may contain text paragraphs with different orientations and shapes, mixed with graphics and photographs. The proposed method extracts and separates blocks of text lines (printed or handwritten characters) and headers as well as stroke structures. The generic approach is first based on a multiscale analysis with the use of a pyramid representation of the image. At each level, text location is performed by a line borders detection scheme. Then, an efficient bottom-up procedure generates bodies (text paragraphs) as the output of algebric transformations upon a set of four directed graphs associated with the topological relationships of physical components.
提出了一种多层次图像分割方法。这些文件被认为是困难的文件,因为它们可能包含不同方向和形状的文本段落,并夹杂着图形和照片。提出的方法提取和分离文本行块(印刷或手写字符)和标题以及笔画结构。通用方法首先基于多尺度分析,使用图像的金字塔表示。在每个级别上,文本位置由行边界检测方案执行。然后,一个有效的自下而上过程生成主体(文本段落),作为与物理组件的拓扑关系相关的一组四个有向图的代数转换的输出。
{"title":"Segmentation of complex documents multilevel images: a robust and fast text bodies-headers detection and extraction scheme","authors":"D. Olivier, B. Dominique","doi":"10.1109/ICDAR.1995.602016","DOIUrl":"https://doi.org/10.1109/ICDAR.1995.602016","url":null,"abstract":"We present a method for segmenting multilevels images of documents. The documents are considered difficult ones in the sense they may contain text paragraphs with different orientations and shapes, mixed with graphics and photographs. The proposed method extracts and separates blocks of text lines (printed or handwritten characters) and headers as well as stroke structures. The generic approach is first based on a multiscale analysis with the use of a pyramid representation of the image. At each level, text location is performed by a line borders detection scheme. Then, an efficient bottom-up procedure generates bodies (text paragraphs) as the output of algebric transformations upon a set of four directed graphs associated with the topological relationships of physical components.","PeriodicalId":273519,"journal":{"name":"Proceedings of 3rd International Conference on Document Analysis and Recognition","volume":"28 2","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133238085","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
A new approach for Latin/Arabic character segmentation 拉丁/阿拉伯字符分割的一种新方法
Pub Date : 1995-08-14 DOI: 10.1109/ICDAR.1995.602040
K. Romeo-Pakker, H. Miled, Y. Lecourtier
In this paper, we propose two methods of character segmentation for Arabic handwritten characters and cursive Latin characters. Classical horizontal and vertical projections detect the lowercase writing area in lines. The problem of overlapping lower or upper strokes is resolved with a contour-following algorithm which starts in the lowercase writing area and labels the detected contours. In the first method, the junction segments connecting the characters to each other are detected by taking into account the writing line thickness. The second method detects the upper contour of each word. The strokes are detected in order to find primary segmentation points (PSP). These points are analysed with an automaton that considers the shape of the word for the determination of definitive segmentation points (DSP). The two methods are compared and the results are discussed.
本文提出了阿拉伯文手写字符和草书拉丁字符的两种字符分割方法。传统的水平和垂直投影检测行中的小写书写区域。采用轮廓跟踪算法解决上下笔画重叠问题,该算法从小写书写区域开始,并标记检测到的轮廓。在第一种方法中,通过考虑书写线粗细来检测连接字符彼此的连接段。第二种方法检测每个单词的上轮廓。检测笔画以找到主要分割点(PSP)。用自动机分析这些点,自动机考虑单词的形状以确定最终分割点(DSP)。对两种方法进行了比较,并对结果进行了讨论。
{"title":"A new approach for Latin/Arabic character segmentation","authors":"K. Romeo-Pakker, H. Miled, Y. Lecourtier","doi":"10.1109/ICDAR.1995.602040","DOIUrl":"https://doi.org/10.1109/ICDAR.1995.602040","url":null,"abstract":"In this paper, we propose two methods of character segmentation for Arabic handwritten characters and cursive Latin characters. Classical horizontal and vertical projections detect the lowercase writing area in lines. The problem of overlapping lower or upper strokes is resolved with a contour-following algorithm which starts in the lowercase writing area and labels the detected contours. In the first method, the junction segments connecting the characters to each other are detected by taking into account the writing line thickness. The second method detects the upper contour of each word. The strokes are detected in order to find primary segmentation points (PSP). These points are analysed with an automaton that considers the shape of the word for the determination of definitive segmentation points (DSP). The two methods are compared and the results are discussed.","PeriodicalId":273519,"journal":{"name":"Proceedings of 3rd International Conference on Document Analysis and Recognition","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133748559","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 51
Experiments on extracting structural information from paper documents using syntactic pattern analysis 基于句法模式分析的纸质文档结构信息提取实验
Pub Date : 1995-08-14 DOI: 10.1109/ICDAR.1995.599039
T. Bayer, H. Walischewski
Extracting structural information from paper documents supports the daily document processing by, for example, automatically finding index terms, document topics, etc. Knowledge about such components are modeled in a semantic net, which describes geometric properties, spatial relationships, lexical entities as well as lexical relationships. The document model is used to extract the sender, date, recipient, opening and closing formula from a business letter. 181 business letters have been processed, divided into a training set of 20 and the remaining ones for testing. The error rates for the test set range from 0.022 to 0.049 by an average rejection rate of 0.4. Results show that the computational effort can be limited to O(n/sup 2/) given n primitive objects for matching.
从纸质文档中提取结构信息支持日常文档处理,例如,自动查找索引术语、文档主题等。关于这些组件的知识在语义网络中建模,语义网络描述了几何属性、空间关系、词汇实体以及词汇关系。文档模型用于从商业信函中提取发件人、日期、收件人、开始和结束公式。已处理181封商务信函,分为训练集20封,其余为测试集。测试集的错误率范围为0.022至0.049,平均拒绝率为0.4。结果表明,在给定n个基本匹配对象的情况下,计算量可以限制在O(n/sup 2/)。
{"title":"Experiments on extracting structural information from paper documents using syntactic pattern analysis","authors":"T. Bayer, H. Walischewski","doi":"10.1109/ICDAR.1995.599039","DOIUrl":"https://doi.org/10.1109/ICDAR.1995.599039","url":null,"abstract":"Extracting structural information from paper documents supports the daily document processing by, for example, automatically finding index terms, document topics, etc. Knowledge about such components are modeled in a semantic net, which describes geometric properties, spatial relationships, lexical entities as well as lexical relationships. The document model is used to extract the sender, date, recipient, opening and closing formula from a business letter. 181 business letters have been processed, divided into a training set of 20 and the remaining ones for testing. The error rates for the test set range from 0.022 to 0.049 by an average rejection rate of 0.4. Results show that the computational effort can be limited to O(n/sup 2/) given n primitive objects for matching.","PeriodicalId":273519,"journal":{"name":"Proceedings of 3rd International Conference on Document Analysis and Recognition","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134561407","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 24
Computer processing on the identification of a Chinese seal image 中国印章图像识别的计算机处理
Pub Date : 1995-08-14 DOI: 10.1109/ICDAR.1995.599027
Yung-Sheng Chen
Seal identification is usually performed by hard-matching of human visual inspection. A computer processing of the hard-matching is presented to identify a Chinese seal image. Seals of the author's own are used for experiments. Results show that the proposed approach is feasible.
海豹的识别通常是通过人眼视觉的硬匹配来完成的。提出了一种中文印章图像硬匹配识别的计算机处理方法。作者自己的印章用于实验。结果表明,该方法是可行的。
{"title":"Computer processing on the identification of a Chinese seal image","authors":"Yung-Sheng Chen","doi":"10.1109/ICDAR.1995.599027","DOIUrl":"https://doi.org/10.1109/ICDAR.1995.599027","url":null,"abstract":"Seal identification is usually performed by hard-matching of human visual inspection. A computer processing of the hard-matching is presented to identify a Chinese seal image. Seals of the author's own are used for experiments. Results show that the proposed approach is feasible.","PeriodicalId":273519,"journal":{"name":"Proceedings of 3rd International Conference on Document Analysis and Recognition","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133121233","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
An intelligent Chinese official document processing system 智能中文公文处理系统
Pub Date : 1995-08-14 DOI: 10.1109/ICDAR.1995.602064
Tun-Wen Pai, Tieh-Ming Wu, Gan-How Chang, Pei-Yih Ting
Automated Chinese official document processing techniques and the public-key based cryptographic methodology are used to improve the efficiency of the existing Chinese executive information systems. They provide executive officials more room to achieve the best decision-making. One of the best solutions in document automation and message transmission is to combine low-cost computing power with stable communication capability. In this paper a computer-based architecture for intelligent Chinese official document processing is proposed. The automation of form analysis, document processing, data filing, security control, and information retrieval are discussed in detail. A digital multisignature technology is employed in this system to meet the basic requirements of data security and trust handling. By using such a technology, users will have confidence in utilizing information transmission systems without security suspicion.
采用中文公文自动处理技术和基于公钥的密码学方法,提高了现有中文行政信息系统的效率。它们为行政官员提供了更大的空间来实现最佳决策。文档自动化和消息传输的最佳解决方案之一是将低成本的计算能力与稳定的通信能力相结合。本文提出了一种基于计算机的公文智能处理体系结构。详细讨论了表单分析、文档处理、数据归档、安全控制和信息检索的自动化。该系统采用了数字多重签名技术,满足了数据安全和信任处理的基本要求。通过使用这种技术,用户将有信心使用没有安全疑虑的信息传输系统。
{"title":"An intelligent Chinese official document processing system","authors":"Tun-Wen Pai, Tieh-Ming Wu, Gan-How Chang, Pei-Yih Ting","doi":"10.1109/ICDAR.1995.602064","DOIUrl":"https://doi.org/10.1109/ICDAR.1995.602064","url":null,"abstract":"Automated Chinese official document processing techniques and the public-key based cryptographic methodology are used to improve the efficiency of the existing Chinese executive information systems. They provide executive officials more room to achieve the best decision-making. One of the best solutions in document automation and message transmission is to combine low-cost computing power with stable communication capability. In this paper a computer-based architecture for intelligent Chinese official document processing is proposed. The automation of form analysis, document processing, data filing, security control, and information retrieval are discussed in detail. A digital multisignature technology is employed in this system to meet the basic requirements of data security and trust handling. By using such a technology, users will have confidence in utilizing information transmission systems without security suspicion.","PeriodicalId":273519,"journal":{"name":"Proceedings of 3rd International Conference on Document Analysis and Recognition","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133823570","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Character detection based on multi-scale measurement 基于多尺度测量的特征检测
Pub Date : 1995-08-14 DOI: 10.1109/ICDAR.1995.601978
H. Hontani, S. Shimotsuji
The paper presents a new method for character string extraction that works efficiently even for a complicated geographical map. The concept of multi-scale measurement is introduced to achieve development of an efficient string detection technique. In this paper, scale means the size of the area where character candidates may exist. The proposed method first merges small black regions within a certain area into a mass. When the size of the area changes, the mass will change. The proposed method observes the change of a mass corresponding to the change of the size of the area, and searches for a stable mass as a character string. Multi-scale measurement enables the detection process to find the adequate size of an area to detect a string. Because a stable mass may include small figures, a test of the shape of a detected mass and a character recognition process follow to judge whether a mass forms a character string. If a mass is rejected, it is split into smaller masses according to the results of multi-scale measurement. These judgment and split processes are repeated to detect character strings from a pattern where several strings are written closely.
本文提出了一种新的字符串提取方法,即使对复杂的地理地图也能有效地进行提取。为了开发高效的管柱检测技术,引入了多尺度测量的概念。在本文中,尺度是指角色候选可能存在的区域的大小。该方法首先将特定区域内的小黑色区域合并成一个团块。当面积的大小改变时,质量也会改变。该方法通过观察质量随区域大小的变化而变化,并以字符串形式搜索稳定质量。多尺度测量使检测过程能够找到合适尺寸的区域来检测管柱。由于稳定的质量可能包含小的数字,因此需要对检测到的质量进行形状测试并进行字符识别处理,以判断质量是否形成字符串。如果一个质量被拒绝,则根据多尺度测量的结果将其分成更小的质量。重复这些判断和分割过程,以从几个字符串写得很近的模式中检测字符串。
{"title":"Character detection based on multi-scale measurement","authors":"H. Hontani, S. Shimotsuji","doi":"10.1109/ICDAR.1995.601978","DOIUrl":"https://doi.org/10.1109/ICDAR.1995.601978","url":null,"abstract":"The paper presents a new method for character string extraction that works efficiently even for a complicated geographical map. The concept of multi-scale measurement is introduced to achieve development of an efficient string detection technique. In this paper, scale means the size of the area where character candidates may exist. The proposed method first merges small black regions within a certain area into a mass. When the size of the area changes, the mass will change. The proposed method observes the change of a mass corresponding to the change of the size of the area, and searches for a stable mass as a character string. Multi-scale measurement enables the detection process to find the adequate size of an area to detect a string. Because a stable mass may include small figures, a test of the shape of a detected mass and a character recognition process follow to judge whether a mass forms a character string. If a mass is rejected, it is split into smaller masses according to the results of multi-scale measurement. These judgment and split processes are repeated to detect character strings from a pattern where several strings are written closely.","PeriodicalId":273519,"journal":{"name":"Proceedings of 3rd International Conference on Document Analysis and Recognition","volume":"110 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124251932","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
A rule learning method for academic document image processing 学术文献图像处理的规则学习方法
Pub Date : 1995-08-14 DOI: 10.1109/ICDAR.1995.598985
A. Takasu, S. Satoh, E. Katsura
A syntactic rule learning method is presented for analyzing document images and constructing a database from them. This method is used in a digital library system named CyberMagazine, where document images are sequentially converted into database tuples by block segmentation, rough classification, and syntactic analysis. The syntactic rule has an ability to analyze symbols located in two dimensional plane, and has a syntax similar to an ordinal context free grammar except for the concatenation of symbols. In the presented learning method, the syntactic rules are generated from a set of parse trees by decomposing the trees according to non terminal symbols, generalizing the decomposed trees to a syntactic rule, and merging them.
提出了一种语法规则学习方法,用于分析文档图像并从中构建数据库。该方法在名为CyberMagazine的数字图书馆系统中使用,其中文档图像通过块分割、粗略分类和语法分析依次转换为数据库元组。该语法规则具有分析位于二维平面上的符号的能力,除了符号的连接之外,其语法与有序上下文无关语法相似。在该学习方法中,通过对一组解析树进行非终结符分解,将分解树泛化为一个语法规则,并将其合并,从而生成语法规则。
{"title":"A rule learning method for academic document image processing","authors":"A. Takasu, S. Satoh, E. Katsura","doi":"10.1109/ICDAR.1995.598985","DOIUrl":"https://doi.org/10.1109/ICDAR.1995.598985","url":null,"abstract":"A syntactic rule learning method is presented for analyzing document images and constructing a database from them. This method is used in a digital library system named CyberMagazine, where document images are sequentially converted into database tuples by block segmentation, rough classification, and syntactic analysis. The syntactic rule has an ability to analyze symbols located in two dimensional plane, and has a syntax similar to an ordinal context free grammar except for the concatenation of symbols. In the presented learning method, the syntactic rules are generated from a set of parse trees by decomposing the trees according to non terminal symbols, generalizing the decomposed trees to a syntactic rule, and merging them.","PeriodicalId":273519,"journal":{"name":"Proceedings of 3rd International Conference on Document Analysis and Recognition","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134294853","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
Recovering decorative patterns of ceramic objects from a monocular image using a genetic algorithm 利用遗传算法从单眼图像中恢复陶瓷物体的装饰图案
Pub Date : 1995-08-14 DOI: 10.1109/ICDAR.1995.599008
H. Tanahashi, K. Sakaue, Kazuhiko Yamamoto
In order to develop a shape and decorative pattern database of old pottery and ceramics objects, it is necessary to obtain the surface pattern on the 3-dimensional shape of the object. This paper describes the recovery of the revolution surface from a monocular image of unknown camera parameters and the retrieval of the 2-dimensional pattern. The camera parameters are obtained by using a genetic algorithm (GA). After the revolution surfaces are reconstructed, these surfaces are developed into a 2-dimensional plane. We show that a scanner-digitized image of an old ceramic object can be analyzed by GA to reconstruct the revolution surface and to develop the 2-dimensional image on the 3-dimensional object.
为了开发古陶瓷器物的形状和装饰图案数据库,需要获得器物三维形状上的表面图案。本文描述了从未知相机参数的单眼图像中恢复旋转表面和二维模式的检索。采用遗传算法获得摄像机参数。对旋转曲面进行重构后,将其展开成二维平面。通过遗传算法分析旧陶瓷物体的扫描数字化图像,可以重建旋转表面,并在三维物体上显示二维图像。
{"title":"Recovering decorative patterns of ceramic objects from a monocular image using a genetic algorithm","authors":"H. Tanahashi, K. Sakaue, Kazuhiko Yamamoto","doi":"10.1109/ICDAR.1995.599008","DOIUrl":"https://doi.org/10.1109/ICDAR.1995.599008","url":null,"abstract":"In order to develop a shape and decorative pattern database of old pottery and ceramics objects, it is necessary to obtain the surface pattern on the 3-dimensional shape of the object. This paper describes the recovery of the revolution surface from a monocular image of unknown camera parameters and the retrieval of the 2-dimensional pattern. The camera parameters are obtained by using a genetic algorithm (GA). After the revolution surfaces are reconstructed, these surfaces are developed into a 2-dimensional plane. We show that a scanner-digitized image of an old ceramic object can be analyzed by GA to reconstruct the revolution surface and to develop the 2-dimensional image on the 3-dimensional object.","PeriodicalId":273519,"journal":{"name":"Proceedings of 3rd International Conference on Document Analysis and Recognition","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131565229","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Stroke-based time warping for signature verification 签名验证的基于笔划的时间扭曲
Pub Date : 1995-08-14 DOI: 10.1109/ICDAR.1995.598971
B. Wirtz
This paper presents a new technique for dynamic signature verification. A dynamic programming (DP) approach is used for function-based signature verification. Dynamic data such as pressure is treated as a function of positional data and therefore evaluated locally. Verification is based on strokes as the structural units of the signature. This global knowledge is fed into the verification procedure. The application of a 3D non-linear correlation of the signature signals uses the stroke index as the third DP index. In conjunction with the definition of a finite state automaton on the set of reference strokes the system can handle different stroke numbers, missing or additional strokes correctly. The correct alignment of matching strokes is determined simultaneously to the signature verification process; an additional alignment stage before the actual nonlinear correlation is obsolete.
提出了一种新的动态签名验证技术。基于函数的签名验证采用动态规划(DP)方法。动态数据,如压力,被视为位置数据的函数,因此在局部进行评估。验证是基于笔画作为签名的结构单位。这种全局知识被输入到验证程序中。三维非线性相关特征信号的应用使用笔划指数作为第三DP指数。结合有限状态自动机的定义,系统可以正确处理不同的笔画数,缺失或额外的笔画。在签名验证过程中同时确定匹配笔画的正确对齐;在实际的非线性相关之前,一个额外的对准阶段已经过时了。
{"title":"Stroke-based time warping for signature verification","authors":"B. Wirtz","doi":"10.1109/ICDAR.1995.598971","DOIUrl":"https://doi.org/10.1109/ICDAR.1995.598971","url":null,"abstract":"This paper presents a new technique for dynamic signature verification. A dynamic programming (DP) approach is used for function-based signature verification. Dynamic data such as pressure is treated as a function of positional data and therefore evaluated locally. Verification is based on strokes as the structural units of the signature. This global knowledge is fed into the verification procedure. The application of a 3D non-linear correlation of the signature signals uses the stroke index as the third DP index. In conjunction with the definition of a finite state automaton on the set of reference strokes the system can handle different stroke numbers, missing or additional strokes correctly. The correct alignment of matching strokes is determined simultaneously to the signature verification process; an additional alignment stage before the actual nonlinear correlation is obsolete.","PeriodicalId":273519,"journal":{"name":"Proceedings of 3rd International Conference on Document Analysis and Recognition","volume":"34 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131610022","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 56
(Chem)DeT/sub E/X automatic generation of a markup language description of (chemical) documents from bitmap images (化学)DeT/sub E/X从位图图像自动生成(化学)文档的标记语言描述
Pub Date : 1995-08-14 DOI: 10.1109/ICDAR.1995.599035
A. Simon, Jean-Christophe Pret, A.P. Johnson
This paper presents a novel view of document processing, as being the reverse process to T/sub E/X. This concept simplifies the analysis of the physical structure of documents, and also suggests the use of a style file for layout recognition. An algorithm is given for both phases, layout analysis and layout recognition. The bottom-up layout analysis method employed is based on the Kruskal's algorithm and uses the distances between the components to construct the physical page structure. The algorithm is linear with respect to the number of the connected components. For layout recognition, a document style description language (DSDL) is introduced. This helps a fault-tolerant, recursive parsing algorithm to label the blocks of the document. The presented methods were designed to be used for scientific publications (papers, reports, books), but could be applied to a broader range of documents.
本文提出了一种新的文档处理观点,认为它是与T/sub / E/X相反的过程。这个概念简化了对文档物理结构的分析,还建议使用样式文件进行布局识别。给出了布局分析和布局识别两个阶段的算法。采用的自底向上布局分析方法基于Kruskal算法,利用组件之间的距离来构造物理页面结构。该算法与连接组件的数量呈线性关系。在版面识别方面,引入了文档样式描述语言(DSDL)。这有助于容错的递归解析算法标记文档的块。所提出的方法旨在用于科学出版物(论文、报告、书籍),但可以应用于更广泛的文件。
{"title":"(Chem)DeT/sub E/X automatic generation of a markup language description of (chemical) documents from bitmap images","authors":"A. Simon, Jean-Christophe Pret, A.P. Johnson","doi":"10.1109/ICDAR.1995.599035","DOIUrl":"https://doi.org/10.1109/ICDAR.1995.599035","url":null,"abstract":"This paper presents a novel view of document processing, as being the reverse process to T/sub E/X. This concept simplifies the analysis of the physical structure of documents, and also suggests the use of a style file for layout recognition. An algorithm is given for both phases, layout analysis and layout recognition. The bottom-up layout analysis method employed is based on the Kruskal's algorithm and uses the distances between the components to construct the physical page structure. The algorithm is linear with respect to the number of the connected components. For layout recognition, a document style description language (DSDL) is introduced. This helps a fault-tolerant, recursive parsing algorithm to label the blocks of the document. The presented methods were designed to be used for scientific publications (papers, reports, books), but could be applied to a broader range of documents.","PeriodicalId":273519,"journal":{"name":"Proceedings of 3rd International Conference on Document Analysis and Recognition","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133892353","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
Proceedings of 3rd International Conference on Document Analysis and Recognition
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1