A Hybrid Deep Learning Approach to Keyword Spotting in Vietnamese Stele Images

IF 2.8 4区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Informatica Pub Date : 2023-08-29 DOI:10.31449/inf.v47i3.4785

Anna Scius-Bertrand, Marc Bui, Andreas Fischer

引用次数: 0

Abstract

In order to access the rich cultural heritage conveyed in Vietnamese steles, automatic reading of stone engravings would be a great support for historians, who are analyzing tens of thousands of stele images. Approaching the challenging problem with deep learning alone is difficult because the data-driven models require large representative datasets with expert human annotations, which are not available for the steles and costly to obtain. In this article, we present a hybrid approach to spot keywords in stele images that combines data-driven deep learning with knowledge-based structural modeling and matching of Chu Nom characters. The main advantage of the proposed method is that it is annotation-free, i.e. no human data annotation is required. In an experimental evaluation, we demonstrate that keywords can be successfully spotted with a mean average precision of more than 70% when a single engraving style is considered.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

越南石碑图像关键字识别的混合深度学习方法

为了了解越南石碑所传达的丰富文化遗产，自动读取石刻将成为分析数万个石碑图像的历史学家的巨大支持。仅用深度学习来解决具有挑战性的问题是困难的，因为数据驱动的模型需要具有专家注释的大型代表性数据集，而这些数据集对于石碑来说是不可用的，并且获取成本很高。在本文中，我们提出了一种结合数据驱动的深度学习和基于知识的结构建模和Chu Nom字符匹配的混合方法来识别石碑图像中的关键词。该方法的主要优点是无需注释，即不需要人工数据注释。在实验评估中，我们证明了当考虑单一雕刻风格时，关键词可以成功地识别，平均精度超过70%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Informatica 工程技术-计算机：信息系统

CiteScore

5.90

自引率

6.90%

发文量

审稿时长

12 months

期刊介绍： The quarterly journal Informatica provides an international forum for high-quality original research and publishes papers on mathematical simulation and optimization, recognition and control, programming theory and systems, automation systems and elements. Informatica provides a multidisciplinary forum for scientists and engineers involved in research and design including experts who implement and manage information systems applications.