{"title":"深网:编织来捕捉中间地带","authors":"Wensheng Wu","doi":"10.1145/2512405.2512408","DOIUrl":null,"url":null,"abstract":"The massive and diverse data sources on the Deep Web presents a serious data integration challenge. Existing virtual integration approaches suffer from slow query response, while surfacing approaches demand hefty storage space and incur huge costs in maintaining data freshness. We propose a novel hybrid integration approach that strikes a balance between the virtual and surfacing approaches. The key idea is to capture user needs in query templates and focus the integration efforts on the templates. However, realizing this approach requires innovations in template-driven query planning, query parsing, and template discovery. We elaborate on these challenges and propose our solutions.","PeriodicalId":266349,"journal":{"name":"Web-KR '13","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2013-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"The deep web: woven to catch the middle ground\",\"authors\":\"Wensheng Wu\",\"doi\":\"10.1145/2512405.2512408\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The massive and diverse data sources on the Deep Web presents a serious data integration challenge. Existing virtual integration approaches suffer from slow query response, while surfacing approaches demand hefty storage space and incur huge costs in maintaining data freshness. We propose a novel hybrid integration approach that strikes a balance between the virtual and surfacing approaches. The key idea is to capture user needs in query templates and focus the integration efforts on the templates. However, realizing this approach requires innovations in template-driven query planning, query parsing, and template discovery. We elaborate on these challenges and propose our solutions.\",\"PeriodicalId\":266349,\"journal\":{\"name\":\"Web-KR '13\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2013-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Web-KR '13\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/2512405.2512408\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Web-KR '13","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2512405.2512408","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
The massive and diverse data sources on the Deep Web presents a serious data integration challenge. Existing virtual integration approaches suffer from slow query response, while surfacing approaches demand hefty storage space and incur huge costs in maintaining data freshness. We propose a novel hybrid integration approach that strikes a balance between the virtual and surfacing approaches. The key idea is to capture user needs in query templates and focus the integration efforts on the templates. However, realizing this approach requires innovations in template-driven query planning, query parsing, and template discovery. We elaborate on these challenges and propose our solutions.