{"title":"Text Mining for User Query Analysis - A 5-Step Method for Cultural Heritage Institutions","authors":"Anne Chardonnens, Simon Hengchen","doi":"10.18452/1443","DOIUrl":null,"url":null,"abstract":"The recent development of Web Analytics offers new perspectives to libraries, archives and museums to improve their knowledge of user needs and behaviours. In order to dive into the mind of their end users, institutions can explore queries from a digital catalogue. However, a manual exploration demands a major time commitment and only leads to limited results. This paper explores how text mining techniques can help automate the analysis of large volumes of log files. A 5-step methodology including clustering is illustrated by a case study from the State Archives of Belgium.","PeriodicalId":90875,"journal":{"name":"ISI ... : ... IEEE Intelligence and Security Informatics. IEEE International Conference on Intelligence and Security Informatics","volume":"36 1","pages":"177-190"},"PeriodicalIF":0.0000,"publicationDate":"2017-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ISI ... : ... IEEE Intelligence and Security Informatics. IEEE International Conference on Intelligence and Security Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.18452/1443","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
The recent development of Web Analytics offers new perspectives to libraries, archives and museums to improve their knowledge of user needs and behaviours. In order to dive into the mind of their end users, institutions can explore queries from a digital catalogue. However, a manual exploration demands a major time commitment and only leads to limited results. This paper explores how text mining techniques can help automate the analysis of large volumes of log files. A 5-step methodology including clustering is illustrated by a case study from the State Archives of Belgium.
Web Analytics的最新发展为图书馆、档案馆和博物馆提供了新的视角,以提高他们对用户需求和行为的了解。为了深入了解最终用户的想法,机构可以探索来自数字目录的查询。然而,手工探索需要花费大量的时间,并且只能产生有限的结果。本文探讨了文本挖掘技术如何帮助对大量日志文件进行自动化分析。比利时国家档案馆的一个案例研究说明了包括聚类在内的五步方法。