Huili Su, Yukun Li, Xiaoye Wang, Gang Hao, Yongxuan Lai, Weiwei Wang
{"title":"Transforming a Nonstandard Table into Formalized Tables","authors":"Huili Su, Yukun Li, Xiaoye Wang, Gang Hao, Yongxuan Lai, Weiwei Wang","doi":"10.1109/WISA.2017.38","DOIUrl":null,"url":null,"abstract":"Tables and spreadsheets on the Internet often contain valuable information, but are created by people who have different individuation. As a result, the similar data are often issued with different structures. This limits the integration of such tables. This paper aims to overcome this problem by automatically analyzing the structure area and propose the method transforming the tables into formal relational tables. We propose the methods on identifying structure area, modeling the table structure based on tree and methods to generate the 1NF schema of the original table. We proved the correctness of the method in semantic and the experiment results with tables from different areas demonstrate the effectiveness of our method.","PeriodicalId":204706,"journal":{"name":"2017 14th Web Information Systems and Applications Conference (WISA)","volume":"36 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 14th Web Information Systems and Applications Conference (WISA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WISA.2017.38","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
Tables and spreadsheets on the Internet often contain valuable information, but are created by people who have different individuation. As a result, the similar data are often issued with different structures. This limits the integration of such tables. This paper aims to overcome this problem by automatically analyzing the structure area and propose the method transforming the tables into formal relational tables. We propose the methods on identifying structure area, modeling the table structure based on tree and methods to generate the 1NF schema of the original table. We proved the correctness of the method in semantic and the experiment results with tables from different areas demonstrate the effectiveness of our method.