理解手写文本识别技术在遗产环境中的应用:Transkribus已发表研究的系统综述

IF 1.4 Q2 INFORMATION SCIENCE & LIBRARY SCIENCE ARCHIVAL SCIENCE Pub Date : 2022-06-17 DOI:10.1007/s10502-022-09397-0
Joe Nockels, Paul Gooding, Sarah Ames, Melissa Terras
{"title":"理解手写文本识别技术在遗产环境中的应用:Transkribus已发表研究的系统综述","authors":"Joe Nockels,&nbsp;Paul Gooding,&nbsp;Sarah Ames,&nbsp;Melissa Terras","doi":"10.1007/s10502-022-09397-0","DOIUrl":null,"url":null,"abstract":"<div><p>Handwritten Text Recognition (HTR) technology is now a mature machine learning tool, becoming integrated in the digitisation processes of libraries and archives, speeding up the transcription of primary sources and facilitating full text searching and analysis of historic texts at scale. However, research into how HTR is changing our information environment is scant. This paper presents a systematic literature review regarding how researchers are using one particular HTR platform, Transkribus, to indicate the domains where HTR is applied, the approach taken, and how the technology is understood. 381 papers from 2015 to 2020 were gathered from Google Scholar, Scopus, and Web of Science, then grouped and coded into categories using quantitative and qualitative approaches. Published research that mentions Transkribus is international and rapidly growing. Transkribus features primarily in archival and library science publications, while a long tail of broad and eclectic disciplines, including history, computer science, citizen science, law and education, demonstrate the wider applicability of the tool. The most common paper categories were <i>humanities applications</i> (67%), <i>technological (25%), users</i> (5%) and <i>tutorials (3%)</i>. This paper presents the first overarching review of HTR as featured in published research, while also elucidating how HTR is affecting the information environment.</p></div>","PeriodicalId":46131,"journal":{"name":"ARCHIVAL SCIENCE","volume":"22 3","pages":"367 - 392"},"PeriodicalIF":1.4000,"publicationDate":"2022-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10502-022-09397-0.pdf","citationCount":"8","resultStr":"{\"title\":\"Understanding the application of handwritten text recognition technology in heritage contexts: a systematic review of Transkribus in published research\",\"authors\":\"Joe Nockels,&nbsp;Paul Gooding,&nbsp;Sarah Ames,&nbsp;Melissa Terras\",\"doi\":\"10.1007/s10502-022-09397-0\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Handwritten Text Recognition (HTR) technology is now a mature machine learning tool, becoming integrated in the digitisation processes of libraries and archives, speeding up the transcription of primary sources and facilitating full text searching and analysis of historic texts at scale. However, research into how HTR is changing our information environment is scant. This paper presents a systematic literature review regarding how researchers are using one particular HTR platform, Transkribus, to indicate the domains where HTR is applied, the approach taken, and how the technology is understood. 381 papers from 2015 to 2020 were gathered from Google Scholar, Scopus, and Web of Science, then grouped and coded into categories using quantitative and qualitative approaches. Published research that mentions Transkribus is international and rapidly growing. Transkribus features primarily in archival and library science publications, while a long tail of broad and eclectic disciplines, including history, computer science, citizen science, law and education, demonstrate the wider applicability of the tool. The most common paper categories were <i>humanities applications</i> (67%), <i>technological (25%), users</i> (5%) and <i>tutorials (3%)</i>. This paper presents the first overarching review of HTR as featured in published research, while also elucidating how HTR is affecting the information environment.</p></div>\",\"PeriodicalId\":46131,\"journal\":{\"name\":\"ARCHIVAL SCIENCE\",\"volume\":\"22 3\",\"pages\":\"367 - 392\"},\"PeriodicalIF\":1.4000,\"publicationDate\":\"2022-06-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://link.springer.com/content/pdf/10.1007/s10502-022-09397-0.pdf\",\"citationCount\":\"8\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ARCHIVAL SCIENCE\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s10502-022-09397-0\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"INFORMATION SCIENCE & LIBRARY SCIENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ARCHIVAL SCIENCE","FirstCategoryId":"1085","ListUrlMain":"https://link.springer.com/article/10.1007/s10502-022-09397-0","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"INFORMATION SCIENCE & LIBRARY SCIENCE","Score":null,"Total":0}
引用次数: 8

摘要

手写文本识别(HTR)技术现在是一种成熟的机器学习工具,正在集成到图书馆和档案馆的数字化过程中,加快了主要来源的转录,并促进了大规模历史文本的全文搜索和分析。然而,关于HTR如何改变我们的信息环境的研究却很少。本文对研究人员如何使用一个特定的HTR平台Transkribus进行了系统的文献综述,以指出HTR的应用领域、所采取的方法以及如何理解该技术。2015年至2020年的381篇论文来自谷歌学者、Scopus和科学网,然后使用定量和定性方法进行分组和编码。已发表的关于Transkribus的研究是国际性的,并且正在迅速发展。Transkribus主要出现在档案学和图书馆学出版物中,而历史、计算机科学、公民科学、法律和教育等广泛而兼收并蓄的学科的长尾表明了该工具的更广泛适用性。最常见的论文类别是人文应用(67%)、技术(25%)、用户(5%)和教程(3%)。本文首次对已发表的研究中的HTR进行了全面综述,同时也阐明了HTR如何影响信息环境。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

摘要图片

摘要图片

摘要图片

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Understanding the application of handwritten text recognition technology in heritage contexts: a systematic review of Transkribus in published research

Handwritten Text Recognition (HTR) technology is now a mature machine learning tool, becoming integrated in the digitisation processes of libraries and archives, speeding up the transcription of primary sources and facilitating full text searching and analysis of historic texts at scale. However, research into how HTR is changing our information environment is scant. This paper presents a systematic literature review regarding how researchers are using one particular HTR platform, Transkribus, to indicate the domains where HTR is applied, the approach taken, and how the technology is understood. 381 papers from 2015 to 2020 were gathered from Google Scholar, Scopus, and Web of Science, then grouped and coded into categories using quantitative and qualitative approaches. Published research that mentions Transkribus is international and rapidly growing. Transkribus features primarily in archival and library science publications, while a long tail of broad and eclectic disciplines, including history, computer science, citizen science, law and education, demonstrate the wider applicability of the tool. The most common paper categories were humanities applications (67%), technological (25%), users (5%) and tutorials (3%). This paper presents the first overarching review of HTR as featured in published research, while also elucidating how HTR is affecting the information environment.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
ARCHIVAL SCIENCE
ARCHIVAL SCIENCE INFORMATION SCIENCE & LIBRARY SCIENCE-
CiteScore
2.70
自引率
18.20%
发文量
26
期刊介绍: Archival Science promotes the development of archival science as an autonomous scientific discipline. The journal covers all aspects of archival science theory, methodology, and practice. Moreover, it investigates different cultural approaches to creation, management and provision of access to archives, records, and data. It also seeks to promote the exchange and comparison of concepts, views and attitudes related to recordkeeping issues around the world.Archival Science''s approach is integrated, interdisciplinary, and intercultural. Its scope encompasses the entire field of recorded process-related information, analyzed in terms of form, structure, and context. To meet its objectives, the journal draws from scientific disciplines that deal with the function of records and the way they are created, preserved, and retrieved; the context in which information is generated, managed, and used; and the social and cultural environment of records creation at different times and places.Covers all aspects of archival science theory, methodology, and practiceInvestigates different cultural approaches to creation, management and provision of access to archives, records, and dataPromotes the exchange and comparison of concepts, views, and attitudes related to recordkeeping issues around the worldAddresses the entire field of recorded process-related information, analyzed in terms of form, structure, and context
期刊最新文献
An archival world turns: Armenian women’s archives in Southeast Michigan Seventy years of strenuous efforts: tracing the development of archival higher education in China (1952–2022) Dedication and introduction to the provenance special issue Kindred contexts: archives, archaeology, and the concept of provenance The power of provenance in the records continuum
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1