{"title":"基于知识的中文档案文献理解方法","authors":"Shih-Shien You, Gan-How Chang, Pao-Chung Chang, Bing-Shan Chien","doi":"10.1109/ICDAR.1995.601957","DOIUrl":null,"url":null,"abstract":"The Chinese archive document possesses special geometrical and logical properties due to its construction based upon rectangular field which contain either title strings or data strings related to some other titles. In this paper, we propose a knowledge-based approach to analyze the logical relationship among the fields. After extracting the lines and fields of an archive document image, this procedure can identify fields as the title fields, the sub-title fields (if there exist such tree-structure logical relationship), and the corresponding data fields. This proposed approach enables us to achieve a better performance in information manipulation of archive documents.","PeriodicalId":273519,"journal":{"name":"Proceedings of 3rd International Conference on Document Analysis and Recognition","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1995-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A knowledge-based approach to Chinese archive document understanding\",\"authors\":\"Shih-Shien You, Gan-How Chang, Pao-Chung Chang, Bing-Shan Chien\",\"doi\":\"10.1109/ICDAR.1995.601957\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The Chinese archive document possesses special geometrical and logical properties due to its construction based upon rectangular field which contain either title strings or data strings related to some other titles. In this paper, we propose a knowledge-based approach to analyze the logical relationship among the fields. After extracting the lines and fields of an archive document image, this procedure can identify fields as the title fields, the sub-title fields (if there exist such tree-structure logical relationship), and the corresponding data fields. This proposed approach enables us to achieve a better performance in information manipulation of archive documents.\",\"PeriodicalId\":273519,\"journal\":{\"name\":\"Proceedings of 3rd International Conference on Document Analysis and Recognition\",\"volume\":\"22 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1995-08-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of 3rd International Conference on Document Analysis and Recognition\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICDAR.1995.601957\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of 3rd International Conference on Document Analysis and Recognition","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDAR.1995.601957","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A knowledge-based approach to Chinese archive document understanding
The Chinese archive document possesses special geometrical and logical properties due to its construction based upon rectangular field which contain either title strings or data strings related to some other titles. In this paper, we propose a knowledge-based approach to analyze the logical relationship among the fields. After extracting the lines and fields of an archive document image, this procedure can identify fields as the title fields, the sub-title fields (if there exist such tree-structure logical relationship), and the corresponding data fields. This proposed approach enables us to achieve a better performance in information manipulation of archive documents.