Yue Wang, Ming Wen, Xiao Zhou, Feng Gao, Shuai Tian, Dan Jue, Hongmei Lu, Zhimin Zhang
{"title":"Automatic damage identification of Sanskrit palm leaf manuscripts with SegFormer","authors":"Yue Wang, Ming Wen, Xiao Zhou, Feng Gao, Shuai Tian, Dan Jue, Hongmei Lu, Zhimin Zhang","doi":"10.1186/s40494-023-01125-w","DOIUrl":null,"url":null,"abstract":"<p>Palm leaf manuscripts (PLMs) are of great importance in recording Buddhist Scriptures, medicine, history, philosophy, etc. Some damages occur during the use, spread, and preservation procedure. The comprehensive investigation of Sanskrit PLMs is a prerequisite for further conservation and restoration. However, current damage identification and investigation are carried out manually. They require strong professional skills and are extraordinarily time-consuming. In this study, PLM-SegFormer is developed to provide an automated damage segmentation for Sanskrit PLMs based on the SegFormer architecture. Firstly, a digital image dataset of Sanskrit PLMs (the PLM dataset) was obtained from the Potala Palace in Tibet. Then, the hyperparameters for pre-processing, model training, prediction, and post-processing phases were fully optimized to make the SegFormer model more suitable for the PLM damage segmentation task. The optimized segmentation model reaches 70.1% mHit and 51.2% mIoU. The proposed framework automates the damage segmentation of 10,064 folios of PLMs within 12 h. The PLM-SegFormer framework will facilitate the preservation state survey and record of the Palm-leaf manuscript and be of great value to the subsequent preservation and restoration. The source code is available at https://github.com/Ryan21wy/PLM_SegFormer.</p>","PeriodicalId":13109,"journal":{"name":"Heritage Science","volume":"72 1","pages":""},"PeriodicalIF":2.6000,"publicationDate":"2024-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Heritage Science","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1186/s40494-023-01125-w","RegionNum":1,"RegionCategory":"艺术学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, ANALYTICAL","Score":null,"Total":0}
引用次数: 0
Abstract
Palm leaf manuscripts (PLMs) are of great importance in recording Buddhist Scriptures, medicine, history, philosophy, etc. Some damages occur during the use, spread, and preservation procedure. The comprehensive investigation of Sanskrit PLMs is a prerequisite for further conservation and restoration. However, current damage identification and investigation are carried out manually. They require strong professional skills and are extraordinarily time-consuming. In this study, PLM-SegFormer is developed to provide an automated damage segmentation for Sanskrit PLMs based on the SegFormer architecture. Firstly, a digital image dataset of Sanskrit PLMs (the PLM dataset) was obtained from the Potala Palace in Tibet. Then, the hyperparameters for pre-processing, model training, prediction, and post-processing phases were fully optimized to make the SegFormer model more suitable for the PLM damage segmentation task. The optimized segmentation model reaches 70.1% mHit and 51.2% mIoU. The proposed framework automates the damage segmentation of 10,064 folios of PLMs within 12 h. The PLM-SegFormer framework will facilitate the preservation state survey and record of the Palm-leaf manuscript and be of great value to the subsequent preservation and restoration. The source code is available at https://github.com/Ryan21wy/PLM_SegFormer.
期刊介绍:
Heritage Science is an open access journal publishing original peer-reviewed research covering:
Understanding of the manufacturing processes, provenances, and environmental contexts of material types, objects, and buildings, of cultural significance including their historical significance.
Understanding and prediction of physico-chemical and biological degradation processes of cultural artefacts, including climate change, and predictive heritage studies.
Development and application of analytical and imaging methods or equipments for non-invasive, non-destructive or portable analysis of artwork and objects of cultural significance to identify component materials, degradation products and deterioration markers.
Development and application of invasive and destructive methods for understanding the provenance of objects of cultural significance.
Development and critical assessment of treatment materials and methods for artwork and objects of cultural significance.
Development and application of statistical methods and algorithms for data analysis to further understanding of culturally significant objects.
Publication of reference and corpus datasets as supplementary information to the statistical and analytical studies above.
Description of novel technologies that can assist in the understanding of cultural heritage.