Keramat Hassani, R. Roustaei, H. Zafari, E. Zohrevandi, M. Shiri, O. M. Talab
{"title":"An Approach to Tracking Data Lineage in Mediator Based Information Integration Systems","authors":"Keramat Hassani, R. Roustaei, H. Zafari, E. Zohrevandi, M. Shiri, O. M. Talab","doi":"10.1109/ICIME.2009.75","DOIUrl":null,"url":null,"abstract":"The problem of providing explanation for a query answer is referred to as lineage tracing. This problem has been studied extensively in data warehouse systems, but for mediator-based systems, this is identified as a research problem. In such a system, the mediator does not store data. This means for query processing as well as for tracing, the mediator has to communicate with the data sources. which this communication could be expensive or impossible. so To resolve this, we clearly define forward lineage tracing and show its properties. We propose a tracing method computes data lineage without storing any data and effectively supports aggregation and variable granularity lineage. And we illustrate that our method is more efficient than methods that compute the lineage by executing the reverse query.","PeriodicalId":445284,"journal":{"name":"2009 International Conference on Information Management and Engineering","volume":"100 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 International Conference on Information Management and Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIME.2009.75","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
The problem of providing explanation for a query answer is referred to as lineage tracing. This problem has been studied extensively in data warehouse systems, but for mediator-based systems, this is identified as a research problem. In such a system, the mediator does not store data. This means for query processing as well as for tracing, the mediator has to communicate with the data sources. which this communication could be expensive or impossible. so To resolve this, we clearly define forward lineage tracing and show its properties. We propose a tracing method computes data lineage without storing any data and effectively supports aggregation and variable granularity lineage. And we illustrate that our method is more efficient than methods that compute the lineage by executing the reverse query.