Giuseppe De Gregorio, Simon Perrin, Rodrigo C. G. Pena, Isabelle Marthot-Santaniello, Harold Mouchère
{"title":"神经纸莎草纸:用于手写纸莎草纸检索的深度注意力嵌入网络","authors":"Giuseppe De Gregorio, Simon Perrin, Rodrigo C. G. Pena, Isabelle Marthot-Santaniello, Harold Mouchère","doi":"arxiv-2408.07785","DOIUrl":null,"url":null,"abstract":"The intersection of computer vision and machine learning has emerged as a\npromising avenue for advancing historical research, facilitating a more\nprofound exploration of our past. However, the application of machine learning\napproaches in historical palaeography is often met with criticism due to their\nperceived ``black box'' nature. In response to this challenge, we introduce\nNeuroPapyri, an innovative deep learning-based model specifically designed for\nthe analysis of images containing ancient Greek papyri. To address concerns\nrelated to transparency and interpretability, the model incorporates an\nattention mechanism. This attention mechanism not only enhances the model's\nperformance but also provides a visual representation of the image regions that\nsignificantly contribute to the decision-making process. Specifically\ncalibrated for processing images of papyrus documents with lines of handwritten\ntext, the model utilizes individual attention maps to inform the presence or\nabsence of specific characters in the input image. This paper presents the\nNeuroPapyri model, including its architecture and training methodology. Results\nfrom the evaluation demonstrate NeuroPapyri's efficacy in document retrieval,\nshowcasing its potential to advance the analysis of historical manuscripts.","PeriodicalId":501285,"journal":{"name":"arXiv - CS - Digital Libraries","volume":"1 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"NeuroPapyri: A Deep Attention Embedding Network for Handwritten Papyri Retrieval\",\"authors\":\"Giuseppe De Gregorio, Simon Perrin, Rodrigo C. G. Pena, Isabelle Marthot-Santaniello, Harold Mouchère\",\"doi\":\"arxiv-2408.07785\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The intersection of computer vision and machine learning has emerged as a\\npromising avenue for advancing historical research, facilitating a more\\nprofound exploration of our past. However, the application of machine learning\\napproaches in historical palaeography is often met with criticism due to their\\nperceived ``black box'' nature. In response to this challenge, we introduce\\nNeuroPapyri, an innovative deep learning-based model specifically designed for\\nthe analysis of images containing ancient Greek papyri. To address concerns\\nrelated to transparency and interpretability, the model incorporates an\\nattention mechanism. This attention mechanism not only enhances the model's\\nperformance but also provides a visual representation of the image regions that\\nsignificantly contribute to the decision-making process. Specifically\\ncalibrated for processing images of papyrus documents with lines of handwritten\\ntext, the model utilizes individual attention maps to inform the presence or\\nabsence of specific characters in the input image. This paper presents the\\nNeuroPapyri model, including its architecture and training methodology. Results\\nfrom the evaluation demonstrate NeuroPapyri's efficacy in document retrieval,\\nshowcasing its potential to advance the analysis of historical manuscripts.\",\"PeriodicalId\":501285,\"journal\":{\"name\":\"arXiv - CS - Digital Libraries\",\"volume\":\"1 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-08-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - CS - Digital Libraries\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2408.07785\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Digital Libraries","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2408.07785","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
NeuroPapyri: A Deep Attention Embedding Network for Handwritten Papyri Retrieval
The intersection of computer vision and machine learning has emerged as a
promising avenue for advancing historical research, facilitating a more
profound exploration of our past. However, the application of machine learning
approaches in historical palaeography is often met with criticism due to their
perceived ``black box'' nature. In response to this challenge, we introduce
NeuroPapyri, an innovative deep learning-based model specifically designed for
the analysis of images containing ancient Greek papyri. To address concerns
related to transparency and interpretability, the model incorporates an
attention mechanism. This attention mechanism not only enhances the model's
performance but also provides a visual representation of the image regions that
significantly contribute to the decision-making process. Specifically
calibrated for processing images of papyrus documents with lines of handwritten
text, the model utilizes individual attention maps to inform the presence or
absence of specific characters in the input image. This paper presents the
NeuroPapyri model, including its architecture and training methodology. Results
from the evaluation demonstrate NeuroPapyri's efficacy in document retrieval,
showcasing its potential to advance the analysis of historical manuscripts.