B. Allier, J. Duong, Antoine Gagneux, Pierre Mallet, H. Emptoz
{"title":"用于逻辑预标注的纹理特征表征","authors":"B. Allier, J. Duong, Antoine Gagneux, Pierre Mallet, H. Emptoz","doi":"10.1109/ICDAR.2003.1227728","DOIUrl":null,"url":null,"abstract":"In this article we present a study based on the use of texture features for logical pre-labeling. The aim of our work is to calculate a great number of texture features over three sets of machine-printed document images and to study their joint discriminant power using SVM classifiers. The three corpuses we use are: the Archives of Savoie (AoS), composed of strongly structured documents, a subset of the UW3 database, and a third that is not structured at all, since it is composed of Web site images. The originality of our contribution is to sum up various methods that have been used for many years in our domain, and to test them on documents having very different specificities.","PeriodicalId":249193,"journal":{"name":"Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings.","volume":"18 3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2003-08-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"11","resultStr":"{\"title\":\"Texture feature characterization for logical pre-labeling\",\"authors\":\"B. Allier, J. Duong, Antoine Gagneux, Pierre Mallet, H. Emptoz\",\"doi\":\"10.1109/ICDAR.2003.1227728\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this article we present a study based on the use of texture features for logical pre-labeling. The aim of our work is to calculate a great number of texture features over three sets of machine-printed document images and to study their joint discriminant power using SVM classifiers. The three corpuses we use are: the Archives of Savoie (AoS), composed of strongly structured documents, a subset of the UW3 database, and a third that is not structured at all, since it is composed of Web site images. The originality of our contribution is to sum up various methods that have been used for many years in our domain, and to test them on documents having very different specificities.\",\"PeriodicalId\":249193,\"journal\":{\"name\":\"Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings.\",\"volume\":\"18 3 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2003-08-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"11\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICDAR.2003.1227728\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDAR.2003.1227728","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 11
摘要
在这篇文章中,我们提出了一个基于纹理特征的逻辑预标记的研究。我们的工作目的是在三组机器打印的文档图像上计算大量的纹理特征,并使用SVM分类器研究它们的联合判别能力。我们使用的三个语料库是:萨瓦档案馆(Archives of Savoie, AoS),它由强结构化文档组成,是UW3数据库的一个子集;第三个语料库完全没有结构化,因为它由Web站点图像组成。我们贡献的独创性在于总结了在我们的领域中已经使用多年的各种方法,并在具有非常不同的特殊性的文档上对它们进行了测试。
Texture feature characterization for logical pre-labeling
In this article we present a study based on the use of texture features for logical pre-labeling. The aim of our work is to calculate a great number of texture features over three sets of machine-printed document images and to study their joint discriminant power using SVM classifiers. The three corpuses we use are: the Archives of Savoie (AoS), composed of strongly structured documents, a subset of the UW3 database, and a third that is not structured at all, since it is composed of Web site images. The originality of our contribution is to sum up various methods that have been used for many years in our domain, and to test them on documents having very different specificities.