Towards Building a High Performance Spatial Query System for Large Scale Medical Imaging Data.

Ablimit Aji, Fusheng Wang, Joel H Saltz
{"title":"Towards Building a High Performance Spatial Query System for Large Scale Medical Imaging Data.","authors":"Ablimit Aji, Fusheng Wang, Joel H Saltz","doi":"10.1145/2424321.2424361","DOIUrl":null,"url":null,"abstract":"<p><p>Support of high performance queries on large volumes of scientific spatial data is becoming increasingly important in many applications. This growth is driven by not only geospatial problems in numerous fields, but also emerging scientific applications that are increasingly data- and compute-intensive. For example, digital pathology imaging has become an emerging field during the past decade, where examination of high resolution images of human tissue specimens enables more effective diagnosis, prediction and treatment of diseases. Systematic analysis of large-scale pathology images generates tremendous amounts of spatially derived quantifications of micro-anatomic objects, such as nuclei, blood vessels, and tissue regions. Analytical pathology imaging provides high potential to support image based computer aided diagnosis. One major requirement for this is effective <i>querying</i> of such enormous amount of data with fast response, which is faced with two major challenges: the \"big data\" challenge and the high computation complexity. In this paper, we present our work towards building a high performance spatial query system for querying massive spatial data on MapReduce. Our framework takes an on demand index building approach for processing spatial queries and a partition-merge approach for building parallel spatial query pipelines, which fits nicely with the computing model of MapReduce. We demonstrate our framework on supporting multi-way spatial joins for algorithm evaluation and nearest neighbor queries for microanatomic objects. To reduce query response time, we propose cost based query optimization to mitigate the effect of data skew. Our experiments show that the framework can efficiently support complex analytical spatial queries on MapReduce.</p>","PeriodicalId":90295,"journal":{"name":"Proceedings of the ... ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems : ACM GIS. ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2012-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3909999/pdf/nihms480782.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the ... ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems : ACM GIS. ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2424321.2424361","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Support of high performance queries on large volumes of scientific spatial data is becoming increasingly important in many applications. This growth is driven by not only geospatial problems in numerous fields, but also emerging scientific applications that are increasingly data- and compute-intensive. For example, digital pathology imaging has become an emerging field during the past decade, where examination of high resolution images of human tissue specimens enables more effective diagnosis, prediction and treatment of diseases. Systematic analysis of large-scale pathology images generates tremendous amounts of spatially derived quantifications of micro-anatomic objects, such as nuclei, blood vessels, and tissue regions. Analytical pathology imaging provides high potential to support image based computer aided diagnosis. One major requirement for this is effective querying of such enormous amount of data with fast response, which is faced with two major challenges: the "big data" challenge and the high computation complexity. In this paper, we present our work towards building a high performance spatial query system for querying massive spatial data on MapReduce. Our framework takes an on demand index building approach for processing spatial queries and a partition-merge approach for building parallel spatial query pipelines, which fits nicely with the computing model of MapReduce. We demonstrate our framework on supporting multi-way spatial joins for algorithm evaluation and nearest neighbor queries for microanatomic objects. To reduce query response time, we propose cost based query optimization to mitigate the effect of data skew. Our experiments show that the framework can efficiently support complex analytical spatial queries on MapReduce.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
为大规模医学影像数据建立高性能空间查询系统。
在许多应用中,支持对大量科学空间数据进行高性能查询正变得越来越重要。推动这一增长的不仅有众多领域的地理空间问题,还有数据和计算日益密集的新兴科学应用。例如,数字病理成像在过去十年中已成为一个新兴领域,通过对人体组织标本的高分辨率图像进行检查,可以更有效地诊断、预测和治疗疾病。对大规模病理图像进行系统分析,可生成大量微观原子对象(如细胞核、血管和组织区域)的空间衍生量化数据。病理成像分析为支持基于图像的计算机辅助诊断提供了巨大潜力。其主要要求之一是对如此巨大的数据量进行有效查询并快速响应,这面临着两大挑战:"大数据 "挑战和高计算复杂性。本文介绍了我们为在 MapReduce 上查询海量空间数据而构建高性能空间查询系统所做的工作。我们的框架采用按需构建索引的方法来处理空间查询,并采用分区-合并的方法来构建并行空间查询管道,这与 MapReduce 的计算模型非常契合。我们在支持算法评估的多向空间连接和微观原子对象的近邻查询上演示了我们的框架。为了缩短查询响应时间,我们提出了基于成本的查询优化,以减轻数据偏斜的影响。我们的实验表明,该框架可以在 MapReduce 上高效地支持复杂的分析性空间查询。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Hide Your Distance: Privacy Risks and Protection in Spatial Accessibility Analysis. GPU-based Real-time Contact Tracing at Scale. Mining Public Datasets for Modeling Intra-City PM2.5 Concentrations at a Fine Spatial Resolution. SparkGIS: Resource Aware Efficient In-Memory Spatial Query Processing. iSPEED: an Efficient In-Memory Based Spatial Query System for Large-Scale 3D Data with Complex Structures.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1