{"title":"Measuring XML document similarity: a case study for evaluating information extraction systems","authors":"G. Canfora, L. Cerulo, R. Scognamiglio","doi":"10.1109/METRICS.2004.29","DOIUrl":null,"url":null,"abstract":"Measuring similarity between trees, such as XML structured information, has an important role in many applications, and in particular in the evaluation of the effectiveness of information extraction systems (IES). In this paper we present an experience in evaluating the effectiveness of IES in terms of extraction and adaptation effectiveness. In the first part of the paper a similarity measure between XML trees based on a common subtree detection algorithm is introduced; then, a case study aimed at the evaluation of the effectiveness of a group of IES is presented as an example of application.","PeriodicalId":261807,"journal":{"name":"10th International Symposium on Software Metrics, 2004. Proceedings.","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2004-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"10th International Symposium on Software Metrics, 2004. Proceedings.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/METRICS.2004.29","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6
Abstract
Measuring similarity between trees, such as XML structured information, has an important role in many applications, and in particular in the evaluation of the effectiveness of information extraction systems (IES). In this paper we present an experience in evaluating the effectiveness of IES in terms of extraction and adaptation effectiveness. In the first part of the paper a similarity measure between XML trees based on a common subtree detection algorithm is introduced; then, a case study aimed at the evaluation of the effectiveness of a group of IES is presented as an example of application.