{"title":"Global schema as local data integrator using active learning to identify candidates attributes","authors":"Clóvis Santos, Carina Dorneles","doi":"10.1504/ijams.2023.134427","DOIUrl":null,"url":null,"abstract":"Data integration represents a challenge in application development. Although there are several alternatives to data integration, such as federated and distributed databases, there are still problems with the standardisation of distinct data sources, and this happens because different companies develop distinct systems with different paradigms and concepts. In this paper, we present a case study, in the agriculture and environment domain, of an essential point in the data integration domain which is to show resources to identify nearby attributes concerning the characteristics of the content foreseen in the requirements presented in the proposed schema. Information technology experts in agribusiness help map the most relevant attributes for the investigated scenario. In our experimental tests, we used a quantitative method data analysis approach to validate the results with quantitative comparisons regarding the percentages of proximity between the attribute contents in the databases. Our proposal presents an alternative to simplify data integration without intermediate application or middleware layers. The results were measured on a scale between 0% and 100% to identify candidate attributes. The results were good in identifying attributes in the databases in almost 67% of the cases.","PeriodicalId":38716,"journal":{"name":"International Journal of Applied Management Science","volume":null,"pages":null},"PeriodicalIF":0.3000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Applied Management Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1504/ijams.2023.134427","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"MANAGEMENT","Score":null,"Total":0}
引用次数: 0
Abstract
Data integration represents a challenge in application development. Although there are several alternatives to data integration, such as federated and distributed databases, there are still problems with the standardisation of distinct data sources, and this happens because different companies develop distinct systems with different paradigms and concepts. In this paper, we present a case study, in the agriculture and environment domain, of an essential point in the data integration domain which is to show resources to identify nearby attributes concerning the characteristics of the content foreseen in the requirements presented in the proposed schema. Information technology experts in agribusiness help map the most relevant attributes for the investigated scenario. In our experimental tests, we used a quantitative method data analysis approach to validate the results with quantitative comparisons regarding the percentages of proximity between the attribute contents in the databases. Our proposal presents an alternative to simplify data integration without intermediate application or middleware layers. The results were measured on a scale between 0% and 100% to identify candidate attributes. The results were good in identifying attributes in the databases in almost 67% of the cases.