Land Map Image Dataset: Ground-Truth And Classification Using Visual And Textural Features

S. Mandal, Samit Biswas, A. Das, B. Chanda
{"title":"Land Map Image Dataset: Ground-Truth And Classification Using Visual And Textural Features","authors":"S. Mandal, Samit Biswas, A. Das, B. Chanda","doi":"10.1515/ipc-2015-0024","DOIUrl":null,"url":null,"abstract":"Abstract Research on document image analysis is actively pursued in the last few decades and services like OCR, vectorization of drawings/graphics and various types of form processing are very common. Handwritten documents, old historical documents and documents captured through camera are now being the subjects of active research. However, another very important type of paper document, namely the map document image processing research suffers due to the inherent complexities of the map document and also for nonavailability of benchmark public data-sets. This paper presents a new data-set, namely, the Land Map Image Database (LMIDb) that consists of a variety of land maps images (446 images at present and growing; scanned at 200/300 dpi in TIF format) and the corresponding ground-truth. Using semiautomatic tools non-text part of the images are deleted and the text-only ground-truth is also kept in the database. This paper also presents a classification strategy for map images using which the maps in the database are automatically classified into Political (Po), Physical (Ph), Resource (R) and Topographic (T) maps. The automatic classification of maps help indexing of the images in LMIDb for archival and easy retrieval of the right maps to get the appropriate geographical information. Classification accuracy is also tested on the proposed data-set and the result is encouraging.","PeriodicalId":271906,"journal":{"name":"Image Processing & Communications","volume":"57 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Image Processing & Communications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1515/ipc-2015-0024","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

Abstract Research on document image analysis is actively pursued in the last few decades and services like OCR, vectorization of drawings/graphics and various types of form processing are very common. Handwritten documents, old historical documents and documents captured through camera are now being the subjects of active research. However, another very important type of paper document, namely the map document image processing research suffers due to the inherent complexities of the map document and also for nonavailability of benchmark public data-sets. This paper presents a new data-set, namely, the Land Map Image Database (LMIDb) that consists of a variety of land maps images (446 images at present and growing; scanned at 200/300 dpi in TIF format) and the corresponding ground-truth. Using semiautomatic tools non-text part of the images are deleted and the text-only ground-truth is also kept in the database. This paper also presents a classification strategy for map images using which the maps in the database are automatically classified into Political (Po), Physical (Ph), Resource (R) and Topographic (T) maps. The automatic classification of maps help indexing of the images in LMIDb for archival and easy retrieval of the right maps to get the appropriate geographical information. Classification accuracy is also tested on the proposed data-set and the result is encouraging.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
土地地图图像数据集:基于视觉和纹理特征的地面真实度和分类
摘要近几十年来,文档图像分析的研究得到了积极的发展,诸如OCR、绘图/图形矢量化和各种类型的表单处理等服务非常普遍。手写文件、古老的历史文件和通过相机拍摄的文件正在成为活跃的研究对象。然而,另一种非常重要的纸质文档,即地图文档图像处理研究,由于地图文档固有的复杂性以及基准公共数据集的不可用性而受到影响。本文提出了一种新的数据集,即土地地图图像数据库(LMIDb),该数据库由各种土地地图图像组成,目前有446幅图像,并在不断增长;以TIF格式扫描200/300 dpi)和相应的地基真相。使用半自动工具删除图像的非文本部分,并将仅文本的基本事实保留在数据库中。本文还提出了一种地图图像的分类策略,使用该策略,数据库中的地图将自动分类为政治(Po),物理(Ph),资源(R)和地形(T)地图。地图的自动分类有助于在LMIDb中对图像进行索引,以便存档,并方便地检索正确的地图,以获得适当的地理信息。在提出的数据集上对分类精度进行了测试,结果令人鼓舞。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Analysis of the Influence of Transmission Resources Control in Tree Structure Networks Effectiveness of BPSK Modulation With Peak Noise Avoidance Algorithm in Smart Street Lighting Communications Based on PLC Design of a Telemedical Vest for Sleep Disorder Diagnosis - A Preliminary Analysis Railway Sign Power Line As Transmission Medium for Narrowband PLC Identification and Assessment of Selected Handwritten Function Graphs Using Least Square Approximation Combined with General Hough Transform
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1