Pub Date : 1997-08-18DOI: 10.1109/ICDAR.1997.620614
G. Hutton, M. Cripps, D. Elliman, C. Higgins
This paper describes a strategy for online interpretation of sketched engineering drawings. It represents the design for The Designer's Apprentice-a pen-based system for producing detailed mechanical engineering drawings on a realistic electronic drawing board. The paper examines the problems of interpreting scanned drawings and assesses how such problems affect online systems. It is therefore relevant to online and offline systems. The strategy is enhanced by making an early distinction between annotation and the object outline. This discrimination shapes the subsequent processing: the object outline is subjected to node connecting, face-finding and beautification whereas annotation is classified according to the BS308 engineering standard.
{"title":"A strategy for on-line interpretation of sketched engineering drawings","authors":"G. Hutton, M. Cripps, D. Elliman, C. Higgins","doi":"10.1109/ICDAR.1997.620614","DOIUrl":"https://doi.org/10.1109/ICDAR.1997.620614","url":null,"abstract":"This paper describes a strategy for online interpretation of sketched engineering drawings. It represents the design for The Designer's Apprentice-a pen-based system for producing detailed mechanical engineering drawings on a realistic electronic drawing board. The paper examines the problems of interpreting scanned drawings and assesses how such problems affect online systems. It is therefore relevant to online and offline systems. The strategy is enhanced by making an early distinction between annotation and the object outline. This discrimination shapes the subsequent processing: the object outline is subjected to node connecting, face-finding and beautification whereas annotation is classified according to the BS308 engineering standard.","PeriodicalId":435320,"journal":{"name":"Proceedings of the Fourth International Conference on Document Analysis and Recognition","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126705000","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 1997-08-18DOI: 10.1109/ICDAR.1997.620596
Y. Tang, Jiming Liu, Lihua Yang
The paper presents a wavelet based approach to edge detection in document processing. According to local analysis of the document images using wavelet theory, a novel method is developed to detect the edges in document processing, including extraction of the contours of characters and extraction of the reference lines in the form document images with gray levels. In this method, the quadratic spline wavelet is utilized. Experiments have been contacted. The positive results show the effectiveness of the application of the quadratic spline wavelet to edge detection, especially to extract the reference lines and image boundaries in document processing.
{"title":"Quadratic spline wavelet approach to automatic extraction of baselines from document images","authors":"Y. Tang, Jiming Liu, Lihua Yang","doi":"10.1109/ICDAR.1997.620596","DOIUrl":"https://doi.org/10.1109/ICDAR.1997.620596","url":null,"abstract":"The paper presents a wavelet based approach to edge detection in document processing. According to local analysis of the document images using wavelet theory, a novel method is developed to detect the edges in document processing, including extraction of the contours of characters and extraction of the reference lines in the form document images with gray levels. In this method, the quadratic spline wavelet is utilized. Experiments have been contacted. The positive results show the effectiveness of the application of the quadratic spline wavelet to edge detection, especially to extract the reference lines and image boundaries in document processing.","PeriodicalId":435320,"journal":{"name":"Proceedings of the Fourth International Conference on Document Analysis and Recognition","volume":"40 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127348120","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 1997-08-18DOI: 10.1109/ICDAR.1997.620595
V. Eglin, H. Emptoz
The paper presents a page segmentation method which is based on perception phenomena and displays the unequal importance of information in the visual field. The access of information is directly linked to the search of attractive areas. This search is based on the idea of freeing oneself from an unbending physical structure and from a uniform vertical and horizontal scanning of the document, so as to classify the data in order of importance and interest. Using a space variant geometry for block selection, the page image, instead of being represented by a bitmap format, can be abstractly represented by the block format. This space variant geometry lays a sound basis for elaborating the kinetics of the ocular shifting on a document, which provides not only a meaningless document representation in blocks, but shows a unified view corresponding to the integration of time variant representations of the same visual field.
{"title":"Logarithmic spiral grid and gaze control for the development of strategies of visual segmentation on a document","authors":"V. Eglin, H. Emptoz","doi":"10.1109/ICDAR.1997.620595","DOIUrl":"https://doi.org/10.1109/ICDAR.1997.620595","url":null,"abstract":"The paper presents a page segmentation method which is based on perception phenomena and displays the unequal importance of information in the visual field. The access of information is directly linked to the search of attractive areas. This search is based on the idea of freeing oneself from an unbending physical structure and from a uniform vertical and horizontal scanning of the document, so as to classify the data in order of importance and interest. Using a space variant geometry for block selection, the page image, instead of being represented by a bitmap format, can be abstractly represented by the block format. This space variant geometry lays a sound basis for elaborating the kinetics of the ocular shifting on a document, which provides not only a meaningless document representation in blocks, but shows a unified view corresponding to the integration of time variant representations of the same visual field.","PeriodicalId":435320,"journal":{"name":"Proceedings of the Fourth International Conference on Document Analysis and Recognition","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126553709","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 1997-08-18DOI: 10.1109/ICDAR.1997.620573
K. Keeni, H. Shimodaira, K. Nakayama
The paper presents an automatic coding scheme for representing the output layer of a neural network. Compared to local representation where the number of output unit is p, the number of output unit required for the proposed representation is close to logp. The output of seven different printers were used for evaluating the performance of the system. The proposed automatic representation gave the average recognition rate of 98.7% for 71 categories.
{"title":"On distributed representation of output layer for recognizing Japanese Kana characters using neural networks","authors":"K. Keeni, H. Shimodaira, K. Nakayama","doi":"10.1109/ICDAR.1997.620573","DOIUrl":"https://doi.org/10.1109/ICDAR.1997.620573","url":null,"abstract":"The paper presents an automatic coding scheme for representing the output layer of a neural network. Compared to local representation where the number of output unit is p, the number of output unit required for the proposed representation is close to logp. The output of seven different printers were used for evaluating the performance of the system. The proposed automatic representation gave the average recognition rate of 98.7% for 71 categories.","PeriodicalId":435320,"journal":{"name":"Proceedings of the Fourth International Conference on Document Analysis and Recognition","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121811422","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 1997-08-18DOI: 10.1109/ICDAR.1997.620645
S. Masaki, M. Kobayashi, O. Miyamoto, Y. Nakagawa, T. Matsumoto
A new algorithm RAV (Reparameterized Angle Variations) is proposed which makes explicit use of trajectory information where the time evolution of the pen coordinates plays a crucial role. The algorithm is extremely robust against stroke connections ("Tsuzukeji") as well as shape distortions ("Kuzushi-ji"). Preliminary experiments are reported on tests against the Kuchibue.d-96-02 data base from Tokyo University of Agriculture and Technology.
{"title":"An on-line handwriting character recognition algorithm. RAV (reparameterized angle variations)","authors":"S. Masaki, M. Kobayashi, O. Miyamoto, Y. Nakagawa, T. Matsumoto","doi":"10.1109/ICDAR.1997.620645","DOIUrl":"https://doi.org/10.1109/ICDAR.1997.620645","url":null,"abstract":"A new algorithm RAV (Reparameterized Angle Variations) is proposed which makes explicit use of trajectory information where the time evolution of the pen coordinates plays a crucial role. The algorithm is extremely robust against stroke connections (\"Tsuzukeji\") as well as shape distortions (\"Kuzushi-ji\"). Preliminary experiments are reported on tests against the Kuchibue.d-96-02 data base from Tokyo University of Agriculture and Technology.","PeriodicalId":435320,"journal":{"name":"Proceedings of the Fourth International Conference on Document Analysis and Recognition","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126448107","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 1997-08-18DOI: 10.1109/ICDAR.1997.619859
Jie Zhou, Q. Gan, C. Suen
The paper describes a high performance offline system for recognizing hand printed numerals. An innovative verification module is applied which drastically improves the recognition rate. The approaches used in the modules are described. The importance of the verification module is analysed in detail. A practical automatic form reading system TOCR V1.0 was developed based on the algorithms. The system was put into practical use in several provinces of China for statistical analysis of Revenue China. Test results are given based on: 1) data collected when the system was used in China, as well as 2) the CENPARMI database.
{"title":"A high performance hand-printed numeral recognition system with verification module","authors":"Jie Zhou, Q. Gan, C. Suen","doi":"10.1109/ICDAR.1997.619859","DOIUrl":"https://doi.org/10.1109/ICDAR.1997.619859","url":null,"abstract":"The paper describes a high performance offline system for recognizing hand printed numerals. An innovative verification module is applied which drastically improves the recognition rate. The approaches used in the modules are described. The importance of the verification module is analysed in detail. A practical automatic form reading system TOCR V1.0 was developed based on the algorithms. The system was put into practical use in several provinces of China for statistical analysis of Revenue China. Test results are given based on: 1) data collected when the system was used in China, as well as 2) the CENPARMI database.","PeriodicalId":435320,"journal":{"name":"Proceedings of the Fourth International Conference on Document Analysis and Recognition","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125558553","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 1997-08-18DOI: 10.1109/ICDAR.1997.619876
L. Vuurpijl, Lambert Schomaker
The paper introduces a variant of agglomerative hierarchical clustering techniques. The new technique is used for categorizing character shapes (allographs) in large data sets of handwriting into a hierarchical structure. Such a technique may be used as the basis for a systematic naming scheme of character shapes. Problems with existing methods are described and the proposed method is explained. After application of the method to a very large set of characters, separately for all the letters of the alphabet, relevant clusters are identified and given a unique name. Each cluster represents an allograph prototype.
{"title":"Finding structure in diversity: a hierarchical clustering method for the categorization of allographs in handwriting","authors":"L. Vuurpijl, Lambert Schomaker","doi":"10.1109/ICDAR.1997.619876","DOIUrl":"https://doi.org/10.1109/ICDAR.1997.619876","url":null,"abstract":"The paper introduces a variant of agglomerative hierarchical clustering techniques. The new technique is used for categorizing character shapes (allographs) in large data sets of handwriting into a hierarchical structure. Such a technique may be used as the basis for a systematic naming scheme of character shapes. Problems with existing methods are described and the proposed method is explained. After application of the method to a very large set of characters, separately for all the letters of the alphabet, relevant clusters are identified and given a unique name. Each cluster represents an allograph prototype.","PeriodicalId":435320,"journal":{"name":"Proceedings of the Fourth International Conference on Document Analysis and Recognition","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129549067","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 1997-08-18DOI: 10.1109/ICDAR.1997.620673
François Parmentier, A. Belaïd
Presents an approach for the logical structure recognition of bibliographic references. The objective is to produce, for each reference (given in a display format such as Postscript), structured data containing the hierarchy of fields recognized. As a result of variation among bibliographic references (in the order and typographic format of fields, or writing style of the author, for example), we need a robust and tolerant system architecture. Thus, recognition is performed by a concept-oriented system that uses a model which is automatically built from a reference database. This model represents the reference fields and includes statistics on the occurrence of their terms. Recognition is achieved by a step-by-step activation of the more pertinent concepts. Each activated concept causes the execution of an appropriate searching agent. This architecture is robust and non-deterministic, allowing a solution even in difficult cases.
{"title":"Logical structure recognition of scientific bibliographic references","authors":"François Parmentier, A. Belaïd","doi":"10.1109/ICDAR.1997.620673","DOIUrl":"https://doi.org/10.1109/ICDAR.1997.620673","url":null,"abstract":"Presents an approach for the logical structure recognition of bibliographic references. The objective is to produce, for each reference (given in a display format such as Postscript), structured data containing the hierarchy of fields recognized. As a result of variation among bibliographic references (in the order and typographic format of fields, or writing style of the author, for example), we need a robust and tolerant system architecture. Thus, recognition is performed by a concept-oriented system that uses a model which is automatically built from a reference database. This model represents the reference fields and includes statistics on the occurrence of their terms. Recognition is achieved by a step-by-step activation of the more pertinent concepts. Each activated concept causes the execution of an appropriate searching agent. This architecture is robust and non-deterministic, allowing a solution even in difficult cases.","PeriodicalId":435320,"journal":{"name":"Proceedings of the Fourth International Conference on Document Analysis and Recognition","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129758513","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 1997-08-18DOI: 10.1109/ICDAR.1997.620556
J. Ogier, R. Mullot, J. Labiche, Y. Lecourtier
The methodology we have used for the interpretation of French cadastral documents focuses on a number of studied results of the visual perception and a hierarchical description of the document. The strategy used has been based on the "model" document, employing a mixed approach including various "points of view" about the image to be processed. The results of this mixed analysis reveal the appearance of noninterpretable objects on the cadaster, due to the presence of semantic uncoherence. Thanks to the return cycles between the high and low level processing, an analytical strategy is proposed to independently cure the incoherence, thus to attain the most reliable interpretation of the cadastral map.
{"title":"An image interpretation device can not be reliable without any semantic coherency analysis of the interpreted objects-application to French cadastral maps","authors":"J. Ogier, R. Mullot, J. Labiche, Y. Lecourtier","doi":"10.1109/ICDAR.1997.620556","DOIUrl":"https://doi.org/10.1109/ICDAR.1997.620556","url":null,"abstract":"The methodology we have used for the interpretation of French cadastral documents focuses on a number of studied results of the visual perception and a hierarchical description of the document. The strategy used has been based on the \"model\" document, employing a mixed approach including various \"points of view\" about the image to be processed. The results of this mixed analysis reveal the appearance of noninterpretable objects on the cadaster, due to the presence of semantic uncoherence. Thanks to the return cycles between the high and low level processing, an analytical strategy is proposed to independently cure the incoherence, thus to attain the most reliable interpretation of the cadastral map.","PeriodicalId":435320,"journal":{"name":"Proceedings of the Fourth International Conference on Document Analysis and Recognition","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131651462","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 1997-08-18DOI: 10.1109/ICDAR.1997.620549
L. Prevost, M. Milgram
The authors introduce a new method for on-line character recognition based on the co-operation of two classifiers, a static one and a dynamic one. In fact, on-line and off-line recognition present very different qualities and small redundancy. Its complementary treatment can bring very interesting results. In their approach, each classifier which operates respectively on static and dynamic character properties, uses the k-nearest-neighbour algorithm. References have been selected previously, using a clustering technic based on dynamic programming, which takes into account the intra-class variability of dynamics characters. This allows data compilation and increases recognition speed. Test data are presented to both classifiers and results are integrated by a static supervisor which provides the final decision. They present the results on their omniscriptor database which count 36 different classes of character and more than 36000 different characters.
{"title":"Static and dynamic classifier fusion for character recognition","authors":"L. Prevost, M. Milgram","doi":"10.1109/ICDAR.1997.620549","DOIUrl":"https://doi.org/10.1109/ICDAR.1997.620549","url":null,"abstract":"The authors introduce a new method for on-line character recognition based on the co-operation of two classifiers, a static one and a dynamic one. In fact, on-line and off-line recognition present very different qualities and small redundancy. Its complementary treatment can bring very interesting results. In their approach, each classifier which operates respectively on static and dynamic character properties, uses the k-nearest-neighbour algorithm. References have been selected previously, using a clustering technic based on dynamic programming, which takes into account the intra-class variability of dynamics characters. This allows data compilation and increases recognition speed. Test data are presented to both classifiers and results are integrated by a static supervisor which provides the final decision. They present the results on their omniscriptor database which count 36 different classes of character and more than 36000 different characters.","PeriodicalId":435320,"journal":{"name":"Proceedings of the Fourth International Conference on Document Analysis and Recognition","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133959146","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}