首页 > 最新文献

Vldb Journal最新文献

英文 中文
SWOOP: top-k similarity joins over set streams. SWOOP:集合流上的top-k相似连接。
IF 2.8 2区 计算机科学 Q2 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-01-01 Epub Date: 2024-12-23 DOI: 10.1007/s00778-024-00880-x
Willi Mann, Nikolaus Augsten, Christian S Jensen, Mateusz Pawlik

We provide efficient support for applications that aim to continuously find pairs of similar sets in rapid streams, such as Twitter streams that emit tweets as sets of words. Using a sliding window model, the top-k result changes as new sets enter the window or existing ones leave the window. Specifically, when a set arrives, it may form a new top-k result pair with any set already in the window. When a set leaves the window, all its pairings in the top-k result must be replaced with other pairs. It is therefore not sufficient to maintain the k most similar pairs since less similar pairs may become top-k pairs later. We propose SWOOP, a highly scalable stream join algorithm. Novel indexing techniques and sophisticated filters efficiently prune obsolete pairs as new sets enter the window. SWOOP incrementally maintains a provably minimal stock of similar pairs to update the top-k result at any time. Empirical studies confirm that SWOOP is able to support stream rates that are orders of magnitude faster than the rates supported by existing approaches.

我们为应用程序提供了高效的支持,这些应用程序的目标是在快速流中不断地找到相似的集合对,比如Twitter流,它以单词集的形式发出tweet。使用滑动窗口模型,当新集进入窗口或现有集离开窗口时,top-k结果会发生变化。具体来说,当一个集合到达时,它可能会与窗口中已经存在的任何集合形成一个新的top-k结果对。当一个集合离开窗口时,它在前k个结果中的所有配对都必须用其他配对替换。因此,维持k对最相似的配对是不够的,因为不太相似的配对可能会在以后成为前k对配对。我们提出了SWOOP,一个高度可扩展的流连接算法。新颖的索引技术和复杂的过滤器有效地修剪过时的对,因为新的集进入窗口。SWOOP增量地维护一个可证明的最小相似对库存,以便随时更新top-k结果。实证研究证实,SWOOP能够支持比现有方法支持的速率快几个数量级的流速率。
{"title":"SWOOP: top-k similarity joins over set streams.","authors":"Willi Mann, Nikolaus Augsten, Christian S Jensen, Mateusz Pawlik","doi":"10.1007/s00778-024-00880-x","DOIUrl":"10.1007/s00778-024-00880-x","url":null,"abstract":"<p><p>We provide efficient support for applications that aim to continuously find pairs of similar sets in rapid streams, such as Twitter streams that emit tweets as sets of words. Using a sliding window model, the top-<i>k</i> result changes as new sets enter the window or existing ones leave the window. Specifically, when a set arrives, it may form a new top-<i>k</i> result pair with any set already in the window. When a set leaves the window, all its pairings in the top-<i>k</i> result must be replaced with other pairs. It is therefore not sufficient to maintain the <i>k</i> most similar pairs since less similar pairs may become top-<i>k</i> pairs later. We propose SWOOP, a highly scalable stream join algorithm. Novel indexing techniques and sophisticated filters efficiently prune obsolete pairs as new sets enter the window. SWOOP incrementally maintains a provably minimal stock of similar pairs to update the top-<i>k</i> result at any time. Empirical studies confirm that SWOOP is able to support stream rates that are orders of magnitude faster than the rates supported by existing approaches.</p>","PeriodicalId":49373,"journal":{"name":"Vldb Journal","volume":"34 1","pages":"13"},"PeriodicalIF":2.8,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11666680/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142899857","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Optimizing RPQs over a compact graph representation 优化rpq在一个紧凑的图表示
IF 4.2 2区 计算机科学 Q2 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2023-09-07 DOI: 10.1007/s00778-023-00811-2
Diego Arroyuelo, Adrián Gómez-Brandón, A. Hogan, G. Navarro, J. Rojas-Ledesma
{"title":"Optimizing RPQs over a compact graph representation","authors":"Diego Arroyuelo, Adrián Gómez-Brandón, A. Hogan, G. Navarro, J. Rojas-Ledesma","doi":"10.1007/s00778-023-00811-2","DOIUrl":"https://doi.org/10.1007/s00778-023-00811-2","url":null,"abstract":"","PeriodicalId":49373,"journal":{"name":"Vldb Journal","volume":" ","pages":""},"PeriodicalIF":4.2,"publicationDate":"2023-09-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44150829","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Cardinality estimation using normalizing flow 使用归一化流的基数估计
IF 4.2 2区 计算机科学 Q2 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2023-08-29 DOI: 10.1007/s00778-023-00808-x
Jiayi Wang, Chengliang Chai, Jiabin Liu, Guoliang Li
{"title":"Cardinality estimation using normalizing flow","authors":"Jiayi Wang, Chengliang Chai, Jiabin Liu, Guoliang Li","doi":"10.1007/s00778-023-00808-x","DOIUrl":"https://doi.org/10.1007/s00778-023-00808-x","url":null,"abstract":"","PeriodicalId":49373,"journal":{"name":"Vldb Journal","volume":" ","pages":""},"PeriodicalIF":4.2,"publicationDate":"2023-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45641911","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MinJoin++: a fast algorithm for string similarity joins under edit distance MinJoin++:一种编辑距离下字符串相似性联接的快速算法
IF 4.2 2区 计算机科学 Q2 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2023-08-21 DOI: 10.1007/s00778-023-00806-z
Nikolai Karpov, Haoyu Zhang, Qin Zhang
{"title":"MinJoin++: a fast algorithm for string similarity joins under edit distance","authors":"Nikolai Karpov, Haoyu Zhang, Qin Zhang","doi":"10.1007/s00778-023-00806-z","DOIUrl":"https://doi.org/10.1007/s00778-023-00806-z","url":null,"abstract":"","PeriodicalId":49373,"journal":{"name":"Vldb Journal","volume":" ","pages":""},"PeriodicalIF":4.2,"publicationDate":"2023-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45719826","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Tabular data synthesis with generative adversarial networks: design space and optimizations 具有生成对抗性网络的表格数据合成:设计空间和优化
IF 4.2 2区 计算机科学 Q2 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2023-08-15 DOI: 10.1007/s00778-023-00807-y
Tongyu Liu, Ju Fan, Guoliang Li, Nan Tang, Xiaoyong Du
{"title":"Tabular data synthesis with generative adversarial networks: design space and optimizations","authors":"Tongyu Liu, Ju Fan, Guoliang Li, Nan Tang, Xiaoyong Du","doi":"10.1007/s00778-023-00807-y","DOIUrl":"https://doi.org/10.1007/s00778-023-00807-y","url":null,"abstract":"","PeriodicalId":49373,"journal":{"name":"Vldb Journal","volume":" ","pages":""},"PeriodicalIF":4.2,"publicationDate":"2023-08-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43778051","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Quantifying the competitiveness of a dataset in relation to general preferences 量化数据集相对于一般偏好的竞争力
IF 4.2 2区 计算机科学 Q2 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2023-08-08 DOI: 10.1007/s00778-023-00804-1
K. Mouratidis, Keming Li, Bo Tang
{"title":"Quantifying the competitiveness of a dataset in relation to general preferences","authors":"K. Mouratidis, Keming Li, Bo Tang","doi":"10.1007/s00778-023-00804-1","DOIUrl":"https://doi.org/10.1007/s00778-023-00804-1","url":null,"abstract":"","PeriodicalId":49373,"journal":{"name":"Vldb Journal","volume":" ","pages":""},"PeriodicalIF":4.2,"publicationDate":"2023-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46573430","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Accelerating directed densest subgraph queries with software and hardware approaches 用软件和硬件方法加速定向最密集子图查询
IF 4.2 2区 计算机科学 Q2 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2023-07-31 DOI: 10.1007/s00778-023-00805-0
Chenhao Ma, Yixiang Fang, Reynold Cheng, L. Lakshmanan, Xiaolin Han, Xiaodong Li
{"title":"Accelerating directed densest subgraph queries with software and hardware approaches","authors":"Chenhao Ma, Yixiang Fang, Reynold Cheng, L. Lakshmanan, Xiaolin Han, Xiaodong Li","doi":"10.1007/s00778-023-00805-0","DOIUrl":"https://doi.org/10.1007/s00778-023-00805-0","url":null,"abstract":"","PeriodicalId":49373,"journal":{"name":"Vldb Journal","volume":"38 9","pages":""},"PeriodicalIF":4.2,"publicationDate":"2023-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41306900","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Coalition-based task assignment with priority-aware fairness in spatial crowdsourcing 空间众包中基于联盟的优先级公平任务分配
IF 4.2 2区 计算机科学 Q2 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2023-07-07 DOI: 10.1007/s00778-023-00802-3
Yan Zhao, Kai Zheng, Ziwei Wang, Liwei Deng, B. Yang, T. Pedersen, Christian S. Jensen, Xiaofang Zhou
{"title":"Coalition-based task assignment with priority-aware fairness in spatial crowdsourcing","authors":"Yan Zhao, Kai Zheng, Ziwei Wang, Liwei Deng, B. Yang, T. Pedersen, Christian S. Jensen, Xiaofang Zhou","doi":"10.1007/s00778-023-00802-3","DOIUrl":"https://doi.org/10.1007/s00778-023-00802-3","url":null,"abstract":"","PeriodicalId":49373,"journal":{"name":"Vldb Journal","volume":" ","pages":""},"PeriodicalIF":4.2,"publicationDate":"2023-07-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42035332","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Anytime bottom-up rule learning for large-scale knowledge graph completion 用于大规模知识图完成的任意时间自下而上的规则学习
IF 4.2 2区 计算机科学 Q2 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2023-06-16 DOI: 10.1007/s00778-023-00800-5
Christian Meilicke, M. Chekol, Patrick Betz, Manuel Fink, Heiner Stuckeschmidt
{"title":"Anytime bottom-up rule learning for large-scale knowledge graph completion","authors":"Christian Meilicke, M. Chekol, Patrick Betz, Manuel Fink, Heiner Stuckeschmidt","doi":"10.1007/s00778-023-00800-5","DOIUrl":"https://doi.org/10.1007/s00778-023-00800-5","url":null,"abstract":"","PeriodicalId":49373,"journal":{"name":"Vldb Journal","volume":" ","pages":""},"PeriodicalIF":4.2,"publicationDate":"2023-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47152879","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
BatchHL$$^{+}$$: batch dynamic labelling for distance queries on large-scale networks BatchHL $$^{+}$$:大规模网络上批量动态标注距离查询
IF 4.2 2区 计算机科学 Q2 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2023-06-07 DOI: 10.1007/s00778-023-00799-9
Muhammad Farhan, Henning Koehler, Qing Wang
{"title":"BatchHL$$^{+}$$: batch dynamic labelling for distance queries on large-scale networks","authors":"Muhammad Farhan, Henning Koehler, Qing Wang","doi":"10.1007/s00778-023-00799-9","DOIUrl":"https://doi.org/10.1007/s00778-023-00799-9","url":null,"abstract":"","PeriodicalId":49373,"journal":{"name":"Vldb Journal","volume":" ","pages":""},"PeriodicalIF":4.2,"publicationDate":"2023-06-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41900702","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
Vldb Journal
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1