HaFT: A handwritten Farsi text database

2013 8th Iranian Conference on Machine Vision and Image Processing (MVIP) Pub Date : 2013-09-01 DOI:10.1109/IRANIANMVIP.2013.6779956

Reza Safabaksh, A. Ghanbarian, Golnaz Ghiasi

{"title":"HaFT: A handwritten Farsi text database","authors":"Reza Safabaksh, A. Ghanbarian, Golnaz Ghiasi","doi":"10.1109/IRANIANMVIP.2013.6779956","DOIUrl":null,"url":null,"abstract":"Standard databases provide for evaluation and comparison of various pattern recognition techniques by different researchers; thus they are essential for the advance of research. There are different handwritten databases in various languages, but there is not a large standard database of handwritten text for the evaluation of different algorithms for writer identification and verification in Farsi. This paper introduces a large handwritten Farsi text database called HaFT. The database contains 1800 gray scale images of unconstrained text written by 600 writers. Each participant gave three separate eight-line samples of his handwriting, each of which was written at a different time on a separate sheet. HaFT is presented in several versions each including different lengths of text and using identical or different writing instruments. A new measure, called CVM, is defined which effectively reflects the size of handwriting and thus the content volume of a given text image. This database is designed for training and testing Farsi writer identification and verification using handwritten text. In addition, the database can also be used in training and testing handwritten Farsi text segmentation and recognition algorithms. HaFT is available for research use.","PeriodicalId":297204,"journal":{"name":"2013 8th Iranian Conference on Machine Vision and Image Processing (MVIP)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 8th Iranian Conference on Machine Vision and Image Processing (MVIP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IRANIANMVIP.2013.6779956","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 10

Abstract

Standard databases provide for evaluation and comparison of various pattern recognition techniques by different researchers; thus they are essential for the advance of research. There are different handwritten databases in various languages, but there is not a large standard database of handwritten text for the evaluation of different algorithms for writer identification and verification in Farsi. This paper introduces a large handwritten Farsi text database called HaFT. The database contains 1800 gray scale images of unconstrained text written by 600 writers. Each participant gave three separate eight-line samples of his handwriting, each of which was written at a different time on a separate sheet. HaFT is presented in several versions each including different lengths of text and using identical or different writing instruments. A new measure, called CVM, is defined which effectively reflects the size of handwriting and thus the content volume of a given text image. This database is designed for training and testing Farsi writer identification and verification using handwritten text. In addition, the database can also be used in training and testing handwritten Farsi text segmentation and recognition algorithms. HaFT is available for research use.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

一个手写的波斯语文本数据库

标准数据库提供了不同研究人员对各种模式识别技术的评价和比较;因此，它们对研究的进展至关重要。各种语言都有不同的手写数据库，但没有一个大型的标准手写文本数据库，用于评估波斯语写作者识别和验证的不同算法。本文介绍了一个名为HaFT的大型手写波斯语文本数据库。该数据库包含600位作者所写的1800张无约束文本的灰度图像。每个参与者提供了三个单独的八行笔迹样本，每一行都是在不同的时间写在一张单独的纸上。HaFT以几个版本呈现，每个版本包括不同长度的文本，并使用相同或不同的书写工具。定义了一种新的测量方法，称为CVM，它可以有效地反映笔迹的大小，从而反映给定文本图像的内容体积。这个数据库的目的是训练和测试波斯语作家识别和核查使用手写文本。此外，该数据库还可用于训练和测试手写波斯语文本分割和识别算法。HaFT可用于研究。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2013 8th Iranian Conference on Machine Vision and Image Processing (MVIP)

自引率

0.00%

发文量

期刊最新文献

Automated lung CT image segmentation using kernel mean shift analysis A simple and efficient approach for 3D model decomposition MRI image reconstruction via new K-space sampling scheme based on separable transform Fusion of SPECT and MRI images using back and fore ground information Real time occlusion handling using Kalman Filter and mean-shift