Reading the ransom: Methodological advancements in extracting the Swedish Wealth Tax of 1571

IF 2.6 1区 历史学 Q1 ECONOMICS Explorations in Economic History Pub Date : 2023-01-01 DOI:10.1016/j.eeh.2022.101470
Christopher Blomqvist , Kerstin Enflo , Andreas Jakobsson , Kalle Åström
{"title":"Reading the ransom: Methodological advancements in extracting the Swedish Wealth Tax of 1571","authors":"Christopher Blomqvist ,&nbsp;Kerstin Enflo ,&nbsp;Andreas Jakobsson ,&nbsp;Kalle Åström","doi":"10.1016/j.eeh.2022.101470","DOIUrl":null,"url":null,"abstract":"<div><p>We describe a deep learning method to read hand-written records from the 16th century. The method consists of a combination of a segmentation module and a Handwritten Text Recognition (HTR) module. The transformer-based HTR module exploits both language and image features in reading, classifying and extracting the position of each word on the page. The method is demonstrated on a unique historical document: The Swedish Wealth Tax of 1571. Results suggest that the segmentation module performs significantly better than the lay-out analysis implemented in state-of-the art programs, enabling us to trace many more text blocks correctly on each page. The HTR module has a low character error rate (CER), in addition to being able to classify words and help organize them into tabular formats. By demonstrating an automated process to transform loosely structured handwritten information from the 16th century into organized tables, our method should interest economic historians seeking to digitize and organize quantitative material from pre-industrial periods.</p></div>","PeriodicalId":47413,"journal":{"name":"Explorations in Economic History","volume":null,"pages":null},"PeriodicalIF":2.6000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Explorations in Economic History","FirstCategoryId":"98","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0014498322000481","RegionNum":1,"RegionCategory":"历史学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ECONOMICS","Score":null,"Total":0}
引用次数: 0

Abstract

We describe a deep learning method to read hand-written records from the 16th century. The method consists of a combination of a segmentation module and a Handwritten Text Recognition (HTR) module. The transformer-based HTR module exploits both language and image features in reading, classifying and extracting the position of each word on the page. The method is demonstrated on a unique historical document: The Swedish Wealth Tax of 1571. Results suggest that the segmentation module performs significantly better than the lay-out analysis implemented in state-of-the art programs, enabling us to trace many more text blocks correctly on each page. The HTR module has a low character error rate (CER), in addition to being able to classify words and help organize them into tabular formats. By demonstrating an automated process to transform loosely structured handwritten information from the 16th century into organized tables, our method should interest economic historians seeking to digitize and organize quantitative material from pre-industrial periods.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
解读赎金:1571年瑞典财产税征收方法的进步
我们描述了一种深度学习方法来读取16世纪的手写记录。该方法由分割模块和手写文本识别(HTR)模块组成。基于转换器的HTR模块在阅读中利用语言和图像的特征,分类和提取每个单词在页面上的位置。该方法在一份独特的历史文件上得到了证明:1571年的瑞典财富税。结果表明,分割模块的性能明显优于在最先进的程序中实现的布局分析,使我们能够在每个页面上正确地跟踪更多的文本块。HTR模块除了能够对单词进行分类并帮助将它们组织成表格格式外,还具有较低的字符错误率(CER)。通过演示将16世纪松散结构的手写信息转换为有组织表格的自动化过程,我们的方法应该会引起寻求数字化和组织前工业时期定量材料的经济历史学家的兴趣。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
2.50
自引率
8.70%
发文量
27
期刊介绍: Explorations in Economic History provides broad coverage of the application of economic analysis to historical episodes. The journal has a tradition of innovative applications of theory and quantitative techniques, and it explores all aspects of economic change, all historical periods, all geographical locations, and all political and social systems. The journal includes papers by economists, economic historians, demographers, geographers, and sociologists. Explorations in Economic History is the only journal where you will find "Essays in Exploration." This unique department alerts economic historians to the potential in a new area of research, surveying the recent literature and then identifying the most promising issues to pursue.
期刊最新文献
Access to kin, economic stress, and late-life mortality in North Orkney, Scotland, 1851–1911 Corporations and partnerships: Factory productivity in late Imperial Russia Ethnic wealth inequality in England and Wales, 1858–2018 Fertility responses to short-term economic stress: Price volatility and wealth shocks in a pre-transitional settler colony Transportation, decentralization, and path dependence: How did the old tramway shape Shanghai, China?
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1