ACM International Workshop on Multimedia Databases最新文献_第2页

Indexing of variable length multi-attribute motion data 可变长度多属性运动数据的索引

ACM International Workshop on Multimedia Databases

Pub Date : 2004-11-13 DOI: 10.1145/1032604.1032617

Chuanjun Li, G. Pradhan, Si-Qing Zheng, B. Prabhakaran

Haptic data such as 3D motion capture data and sign language animation data are new forms of multimedia data. The motion data is multi-attribute, and indexing of multi-attribute data is important for quickly pruning the majority of irrelevant motions in order to have real-time animation applications. Indexing of multi-attribute data has been attempted for data of a few attributes by using R-tree or its variants after dimensionality reduction. In this paper, we exploit the singular value decomposition (SVD) properties of multi-attribute motion data matrices to obtain one representative vector for each of the motion data matrices of dozens or hundreds of attributes. Based on this representative vector, we propose a simple and efficient interval-tree based index structure for indexing motion data with large amount of attributes. At each tree level, only one component of the query vector needs to be checked during searching, comparing to all the components of the query vector that should get involved if an R-tree or its variants are used for indexing. Searching time is independent of the number of pattern motions indexed by the tree, making the index structure well scalable to large data repositories. Experiments show that up to 91∼93% irrelevant motions can be pruned for a query with no false dismissals, and the query searching time is less than 30 μ s with the existence of motion variations.

三维动作捕捉数据、手语动画数据等触觉数据是多媒体数据的新形式。运动数据是多属性的，多属性数据的索引对于快速修剪大部分不相关的运动以实现实时动画应用非常重要。利用r树或r树的变体在降维后，尝试对少数属性的数据进行多属性数据索引。本文利用多属性运动数据矩阵的奇异值分解(SVD)特性，对包含数十个或数百个属性的运动数据矩阵分别获得一个代表向量。在此代表性向量的基础上，提出了一种简单高效的基于区间树的索引结构，用于索引具有大量属性的运动数据。在每个树级别，在搜索期间只需要检查查询向量的一个组件，而如果使用r树或其变体进行索引，则应该检查查询向量的所有组件。搜索时间与树索引的模式运动的数量无关，这使得索引结构可以很好地扩展到大型数据存储库。实验表明，在不存在假解散的情况下，查询可以修剪高达91 ~ 93%的不相关运动，并且在存在运动变化的情况下，查询搜索时间小于30 μ s。

{"title":"Indexing of variable length multi-attribute motion data","authors":"Chuanjun Li, G. Pradhan, Si-Qing Zheng, B. Prabhakaran","doi":"10.1145/1032604.1032617","DOIUrl":"https://doi.org/10.1145/1032604.1032617","url":null,"abstract":"Haptic data such as 3D motion capture data and sign language animation data are new forms of multimedia data. The motion data is multi-attribute, and indexing of multi-attribute data is important for quickly pruning the majority of irrelevant motions in order to have real-time animation applications. Indexing of multi-attribute data has been attempted for data of a few attributes by using R-tree or its variants after dimensionality reduction. In this paper, we exploit the singular value decomposition (SVD) properties of multi-attribute motion data matrices to obtain one representative vector for each of the motion data matrices of dozens or hundreds of attributes. Based on this representative vector, we propose a simple and efficient interval-tree based index structure for indexing motion data with large amount of attributes. At each tree level, only one component of the query vector needs to be checked during searching, comparing to all the components of the query vector that should get involved if an R-tree or its variants are used for indexing. Searching time is independent of the number of pattern motions indexed by the tree, making the index structure well scalable to large data repositories. Experiments show that up to 91∼93% irrelevant motions can be pruned for a query with no false dismissals, and the query searching time is less than 30 μ <i>s</i> with the existence of motion variations.","PeriodicalId":415406,"journal":{"name":"ACM International Workshop on Multimedia Databases","volume":"53 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116416536","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 19

Scheduling methods for broadcasting multiple continuous media data 广播多个连续媒体数据的调度方法

ACM International Workshop on Multimedia Databases

Pub Date : 2003-11-07 DOI: 10.1145/951676.951685

T. Yoshihisa, M. Tsukamoto, S. Nishio

Recently, various schemes for broadcasting continuous media data such as music or movies have been studied. These schemes reduce the waiting time for playing the data under the continuity condition, i.e., playing continuous media data without any intermittence until the end of the data. These schemes usually employ multiple channels to broadcast the data. However, most clients are not able to receive data from multiple channels concurrently. In this paper, we propose methods for reducing client waiting time for multiple data via a single channel. In our proposed methods, we divide each data into several segments and produce a schedule that includes the first segment of each data more frequent than the rest. By changing the number of segments according to the playback ratio, client waiting time is effectively reduced.

最近，人们研究了各种播放连续媒体数据(如音乐或电影)的方案。这些方案减少了连续条件下播放数据的等待时间，即连续播放媒体数据，没有任何间断，直到数据结束。这些方案通常采用多个通道来广播数据。然而，大多数客户端不能同时从多个通道接收数据。在本文中，我们提出了通过单一通道减少客户端对多个数据等待时间的方法。在我们提出的方法中，我们将每个数据分成几个部分，并生成一个时间表，其中每个数据的第一个部分比其他部分更频繁。通过根据播放比例改变段数，可以有效减少客户端等待时间。

引用次数: 3

Robust content-based image searches for copyright protection 鲁棒的基于内容的图像版权保护搜索

ACM International Workshop on Multimedia Databases

Pub Date : 2003-11-07 DOI: 10.1145/951676.951690

Sid-Ahmed Berrani, L. Amsaleg, P. Gros

This paper proposes a novel content-based image retrieval scheme for image copy identification. Its goal is to detect matches between a set of doubtful images and the ones stored in the database of the legal holders of the photographies. If an image was stolen and used to create a pirated copy, it tries to identify from which original image that copy was created. The image recognition scheme is based on local differential descriptors. Therefore, the matching process takes into account a large set of variations that might have been applied to stolen images in order to create pirated copies. The high cost and the complexity of this image recognition scheme requires a very efficient retrieval process since many individual queries must be executed before being able to construct the final result. This paper therefore proposes to use a novel search method that trades the precision of each individual search for reduced query execution time. This imprecision has only little impact on the overall recognition performance since the final result is a consolidation of many partial results. However, it dramatically accelerates queries. This result has then been corroborated by a theoretically study. Experiments show the efficiency and the robustness of the proposed scheme.

提出了一种新的基于内容的图像检索方案，用于图像副本识别。它的目标是检测一组可疑图像与存储在照片合法持有者数据库中的图像之间的匹配。如果图像被盗并用于创建盗版副本，它会尝试识别该副本是从哪个原始图像创建的。该图像识别方案基于局部微分描述符。因此，匹配过程要考虑到大量的变化，这些变化可能已经应用于被盗图像，以创建盗版副本。这种图像识别方案的高成本和复杂性要求非常高效的检索过程，因为在能够构造最终结果之前必须执行许多单独的查询。因此，本文提出了一种新的搜索方法，该方法以每个单独搜索的精度为代价来减少查询执行时间。这种不精确对整体识别性能的影响很小，因为最终结果是许多部分结果的整合。然而，它极大地加快了查询速度。这一结果随后被一项理论研究所证实。实验证明了该方法的有效性和鲁棒性。

{"title":"Robust content-based image searches for copyright protection","authors":"Sid-Ahmed Berrani, L. Amsaleg, P. Gros","doi":"10.1145/951676.951690","DOIUrl":"https://doi.org/10.1145/951676.951690","url":null,"abstract":"This paper proposes a novel content-based image retrieval scheme for image copy identification. Its goal is to detect matches between a set of doubtful images and the ones stored in the database of the legal holders of the photographies. If an image was stolen and used to create a pirated copy, it tries to identify from which original image that copy was created. The image recognition scheme is based on local differential descriptors. Therefore, the matching process takes into account a large set of variations that might have been applied to stolen images in order to create pirated copies. The high cost and the complexity of this image recognition scheme requires a very efficient retrieval process since many individual queries must be executed before being able to construct the final result. This paper therefore proposes to use a novel search method that trades the precision of each individual search for reduced query execution time. This imprecision has only little impact on the overall recognition performance since the final result is a consolidation of many partial results. However, it dramatically accelerates queries. This result has then been corroborated by a theoretically study. Experiments show the efficiency and the robustness of the proposed scheme.","PeriodicalId":415406,"journal":{"name":"ACM International Workshop on Multimedia Databases","volume":"102 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131807760","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 95

Mining spatio-temporal patterns and knowledge structures in multimedia collection 多媒体馆藏的时空模式与知识结构挖掘

ACM International Workshop on Multimedia Databases

Pub Date : 2003-11-07 DOI: 10.1145/951676.951677

Shih-Fu Chang

Detection and recognition of semantic events has been a major research challenge for multimedia indexing. An emerging direction in this field has been unsupervised discovery (mining) of patterns in spatial-temporal multimedia data. Patterns are recurrent, predictable occurrences of one or more entities that satisfy associative, statistical, or relational conditions. Patterns at the feature level may signify the occurrence of events (e.g., passing pedestrians). At the event level, patterns may represent multi-event transitions, e.g., play-break alternations in sports. Patterns in an annotated image collection may indicate collocations of related semantic concepts and perceptual clusters.Mining of patterns of different types at different levels offers rich benefits, including automatic discovery of salient events in a new domain, automatic alert generation from massive real-time data (such as surveillance data in a new environment), and discovery of novel event relationships.Many challenging issues emerge. What are the adequate representations and statistical models for patterns that may exist at different levels and different time scales? How to handle patterns that may have relatively sparse occurring frequencies? How do we evaluate the accuracy and quality of mining results given its unsupervised nature?In this talk, we will present results of our preliminary attempts in mining patterns in structured video sequences (such as sports and surveillance video) and large annotated image collections. Specifically, we will discuss the potential of statistical models like Hierarchical HMM for video mining, and the integrative exploration of electronic knowledge (such as WordNet) and statistical clustering for image knowledge mining.

语义事件的检测和识别一直是多媒体索引研究面临的主要挑战。该领域的一个新兴方向是对时空多媒体数据中的模式进行无监督发现(挖掘)。模式是满足关联、统计或关系条件的一个或多个实体反复出现、可预测的情况。特征级别的模式可能表示事件的发生(例如，经过的行人)。在事件层面上，模式可能代表多事件转换，例如，运动中的游戏-休息交替。注释图像集合中的模式可以指示相关语义概念和感知聚类的搭配。在不同层次上挖掘不同类型的模式提供了丰富的好处，包括自动发现新领域中的重要事件，从大量实时数据(例如新环境中的监视数据)自动生成警报，以及发现新的事件关系。许多具有挑战性的问题出现了。对于可能存在于不同层次和不同时间尺度的模式，什么是适当的表示和统计模型?如何处理出现频率相对稀疏的模式?鉴于其无监督性质，我们如何评估挖掘结果的准确性和质量?在这次演讲中，我们将展示我们在结构化视频序列(如体育和监控视频)和大型带注释的图像集合中挖掘模式的初步尝试的结果。具体来说，我们将讨论统计模型的潜力，如用于视频挖掘的分层HMM，以及用于图像知识挖掘的电子知识(如WordNet)和统计聚类的综合探索。

{"title":"Mining spatio-temporal patterns and knowledge structures in multimedia collection","authors":"Shih-Fu Chang","doi":"10.1145/951676.951677","DOIUrl":"https://doi.org/10.1145/951676.951677","url":null,"abstract":"Detection and recognition of semantic events has been a major research challenge for multimedia indexing. An emerging direction in this field has been <u>unsupervised discovery (mining) of patterns</u> in spatial-temporal multimedia data. Patterns are recurrent, predictable occurrences of one or more entities that satisfy associative, statistical, or relational conditions. Patterns at the feature level may signify the occurrence of events (e.g., passing pedestrians). At the event level, patterns may represent multi-event transitions, e.g., play-break alternations in sports. Patterns in an annotated image collection may indicate collocations of related semantic concepts and perceptual clusters.Mining of patterns of different types at different levels offers rich benefits, including automatic discovery of salient events in a new domain, automatic alert generation from massive real-time data (such as surveillance data in a new environment), and discovery of novel event relationships.Many challenging issues emerge. What are the adequate representations and statistical models for patterns that may exist at different levels and different time scales? How to handle patterns that may have relatively sparse occurring frequencies? How do we evaluate the accuracy and quality of mining results given its unsupervised nature?In this talk, we will present results of our preliminary attempts in mining patterns in structured video sequences (such as sports and surveillance video) and large annotated image collections. Specifically, we will discuss the potential of statistical models like Hierarchical HMM for video mining, and the integrative exploration of electronic knowledge (such as WordNet) and statistical clustering for image knowledge mining.","PeriodicalId":415406,"journal":{"name":"ACM International Workshop on Multimedia Databases","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128893297","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Facilitate knowledge communications 促进知识交流

ACM International Workshop on Multimedia Databases

Pub Date : 2003-11-07 DOI: 10.1145/951676.951684

Weihong Huang, T. Tao, Mohand-Said Hacid, A. Mille

With current multimedia information management techniques, the knowledge communications among users in multimedia e-Learning environments are still limited at a relative low single type media servicing level. New developments in multimedia knowledge discovery, representation and integration are needed to improve the intelligence of the knowledge management and communications at the semantic level. This paper proposes a novel contextual knowledge management framework to improve the current isolated learning information retrieval and communication status, by enabling flexible knowledge representation beyond heterogeneous multimedia learning resources and facilitating multilevel knowledge communications between instructors and learners. Based on knowledge communication model analysis in university e-Learning environments, a contextual knowledge representation model is presented. Corresponding knowledge retrieval techniques are discussed afterwards. To demonstrate the proposed concepts and techniques, a case study in a virtual scenario-based learning environment shows how the presented framework works with existing e-Learning content description standards and multimedia information retrieval techniques, and consequently enables a semantic-based interactive learning environment.

在现有的多媒体信息管理技术下，多媒体电子学习环境中用户之间的知识交流仍然局限于较低的单一类型媒体服务水平。在语义层面上提高知识管理和通信的智能化，需要在多媒体知识发现、表示和集成方面取得新的发展。本文提出了一种新的情境知识管理框架，通过在异构多媒体学习资源之外实现知识的灵活表达，促进教师和学习者之间的多层次知识交流，改善当前孤立的学习信息检索和交流状况。在分析大学电子学习环境中知识传播模型的基础上，提出了一种情境知识表示模型。随后讨论了相应的知识检索技术。为了演示所提出的概念和技术，一个基于虚拟场景的学习环境中的案例研究展示了所提出的框架如何与现有的电子学习内容描述标准和多媒体信息检索技术一起工作，从而实现基于语义的交互式学习环境。

{"title":"Facilitate knowledge communications","authors":"Weihong Huang, T. Tao, Mohand-Said Hacid, A. Mille","doi":"10.1145/951676.951684","DOIUrl":"https://doi.org/10.1145/951676.951684","url":null,"abstract":"With current multimedia information management techniques, the knowledge communications among users in multimedia e-Learning environments are still limited at a relative low single type media servicing level. New developments in multimedia knowledge discovery, representation and integration are needed to improve the intelligence of the knowledge management and communications at the semantic level. This paper proposes a novel contextual knowledge management framework to improve the current isolated learning information retrieval and communication status, by enabling flexible knowledge representation beyond heterogeneous multimedia learning resources and facilitating multilevel knowledge communications between instructors and learners. Based on knowledge communication model analysis in university e-Learning environments, a contextual knowledge representation model is presented. Corresponding knowledge retrieval techniques are discussed afterwards. To demonstrate the proposed concepts and techniques, a case study in a virtual scenario-based learning environment shows how the presented framework works with existing e-Learning content description standards and multimedia information retrieval techniques, and consequently enables a semantic-based interactive learning environment.","PeriodicalId":415406,"journal":{"name":"ACM International Workshop on Multimedia Databases","volume":"92 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123043132","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 6

Bitmap indexing method for complex similarity queries with relevance feedback 具有关联反馈的复杂相似查询的位图索引方法

ACM International Workshop on Multimedia Databases

Pub Date : 2003-11-07 DOI: 10.1145/951676.951687

Guang-Ho Cha

The similarity indexing and searching is well known to be a difficult one for high-dimensional applications such as multimedia databases. Especially, it becomes more difficult when multiple features have to be indexed together. Moreover, few indexing methods are currently available to effectively support disjunctive queries for relevance feedback.In this paper, we propose a novel indexing method that is designed to efficiently handle complex similarity queries as well as relevance feedback in high-dimensional image and video databases. In order to provide the indexing method with the flexibility in control multiple features and multiple query objects, our method treats every dimension independently. The efficiency of our method is realized by a specialized bitmap indexing that represents all objects in a database as a set of bitmaps. The percentage of data accessed in our indexing method is inversely proportional to the overall dimensionality, and thus the performance deterioration with the increasing dimensionality does not occur.Our main contributions are three-fold: (1) We provide a novel way to index high-dimensional data; (2) Our method efficiently handles complex similarity queries; and (3) Disjunctive queries driven by relevance feedback are efficiently treated. Our empirical results demonstrate that our indexing method achieves speedups of 10 to 15 over the linear scan.

对于多媒体数据库等高维应用来说，相似度索引和搜索是一个非常困难的问题。特别是，当多个特征必须一起索引时，这变得更加困难。此外，目前很少有索引方法可以有效地支持相关反馈的析取查询。在本文中，我们提出了一种新的索引方法，旨在有效地处理高维图像和视频数据库中复杂的相似查询和相关反馈。为了使索引方法能够灵活地控制多个特征和多个查询对象，我们的方法对每个维度进行独立处理。我们的方法的效率是通过一个专门的位图索引来实现的，该索引将数据库中的所有对象表示为一组位图。在我们的索引方法中，访问的数据百分比与总体维数成反比，因此不会出现随着维数的增加而导致性能下降的情况。我们的主要贡献有三个方面:(1)我们提供了一种新的高维数据索引方法;(2)该方法能有效处理复杂的相似度查询;(3)有效处理由关联反馈驱动的析取查询。我们的经验结果表明，我们的索引方法实现了10到15比线性扫描的速度。

{"title":"Bitmap indexing method for complex similarity queries with relevance feedback","authors":"Guang-Ho Cha","doi":"10.1145/951676.951687","DOIUrl":"https://doi.org/10.1145/951676.951687","url":null,"abstract":"The similarity indexing and searching is well known to be a difficult one for high-dimensional applications such as multimedia databases. Especially, it becomes more difficult when multiple features have to be indexed together. Moreover, few indexing methods are currently available to effectively support disjunctive queries for relevance feedback.In this paper, we propose a novel indexing method that is designed to efficiently handle complex similarity queries as well as relevance feedback in high-dimensional image and video databases. In order to provide the indexing method with the flexibility in control multiple features and multiple query objects, our method treats every dimension independently. The efficiency of our method is realized by a specialized bitmap indexing that represents all objects in a database as a set of bitmaps. The percentage of data accessed in our indexing method is inversely proportional to the overall dimensionality, and thus the performance deterioration with the increasing dimensionality does not occur.Our main contributions are three-fold: (1) We provide a novel way to index high-dimensional data; (2) Our method efficiently handles complex similarity queries; and (3) Disjunctive queries driven by relevance feedback are efficiently treated. Our empirical results demonstrate that our indexing method achieves speedups of 10 to 15 over the linear scan.","PeriodicalId":415406,"journal":{"name":"ACM International Workshop on Multimedia Databases","volume":"361 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132745628","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 14

Video query processing in the VDBMS testbed for video database research 视频查询处理在VDBMS试验台进行视频数据库的研究

ACM International Workshop on Multimedia Databases

Pub Date : 2003-11-07 DOI: 10.1145/951676.951682

Walid G. Aref, M. Hammad, A. Catlin, I. Ilyas, T. Ghanem, A. Elmagarmid, M. Marzouk

The increased use of video data sets for multimedia-based applications has created a demand for strong video database support, including efficient methods for handling the content-based query and retrieval of video data. Video query processing presents significant research challenges, mainly associated with the size, complexity and unstructured nature of video data. A video query processor must support video operations for search by content and streaming, new query types, and the incorporation of video methods and operators in generating, optimizing and executing query plans. In this paper, we address these query processing issues in two contexts, first as applied to the video data type and then as applied to the stream data type. We first present the query processing functionality of the VDBMS video database management system as a framework designed to support the full range of functionality for video as an abstract data type. We describe two query operators for the video data type which implement the rank-join and stop-after algorithms. As videos may be considered streams of consecutive image frames, video query processing can be expressed as continuous queries over video data streams. The stream data type was therefore introduced into the VDBMS system, and system functionality was extended to support general data streams. From this viewpoint, we present an approach for defining and processing streams, including video, through the query execution engine. We describe the implementation of several algorithms for video query processing expressed as continuous queries over video streams, such as fast forward, region-based blurring and left outer join. We include a description of the window-join algorithm as a core operator for continuous query systems, and discuss shared execution as an optimization approach for stream query processing.

基于多媒体的应用程序越来越多地使用视频数据集，这就产生了对强大视频数据库支持的需求，包括处理基于内容的视频数据查询和检索的有效方法。视频查询处理的研究面临着巨大的挑战，主要与视频数据的大小、复杂性和非结构化有关。视频查询处理器必须支持基于内容和流的视频操作，支持新的查询类型，支持在生成、优化和执行查询计划时结合视频方法和操作符。在本文中，我们在两种上下文中解决了这些查询处理问题，首先应用于视频数据类型，然后应用于流数据类型。我们首先将VDBMS视频数据库管理系统的查询处理功能作为一个框架来展示，该框架旨在支持作为抽象数据类型的视频的全部功能。我们描述了视频数据类型的两个查询运算符，它们实现了排名连接和停止后算法。由于视频可以看作是连续图像帧的流，因此视频查询处理可以表示为对视频数据流的连续查询。因此，流数据类型被引入到VDBMS系统中，系统功能被扩展为支持一般数据流。从这个角度出发，我们提出了一种通过查询执行引擎定义和处理流(包括视频)的方法。我们描述了几种视频查询处理算法的实现，这些算法表示为对视频流的连续查询，如快进、基于区域的模糊和左外连接。我们描述了窗口连接算法作为连续查询系统的核心操作符，并讨论了共享执行作为流查询处理的优化方法。

{"title":"Video query processing in the VDBMS testbed for video database research","authors":"Walid G. Aref, M. Hammad, A. Catlin, I. Ilyas, T. Ghanem, A. Elmagarmid, M. Marzouk","doi":"10.1145/951676.951682","DOIUrl":"https://doi.org/10.1145/951676.951682","url":null,"abstract":"The increased use of video data sets for multimedia-based applications has created a demand for strong video database support, including efficient methods for handling the content-based query and retrieval of video data. Video query processing presents significant research challenges, mainly associated with the size, complexity and unstructured nature of video data. A video query processor must support video operations for search by content and streaming, new query types, and the incorporation of video methods and operators in generating, optimizing and executing query plans. In this paper, we address these query processing issues in two contexts, first as applied to the video data type and then as applied to the stream data type. We first present the query processing functionality of the VDBMS video database management system as a framework designed to support the full range of functionality for video as an abstract data type. We describe two query operators for the video data type which implement the rank-join and stop-after algorithms. As videos may be considered streams of consecutive image frames, video query processing can be expressed as continuous queries over video data streams. The stream data type was therefore introduced into the VDBMS system, and system functionality was extended to support general data streams. From this viewpoint, we present an approach for defining and processing streams, including video, through the query execution engine. We describe the implementation of several algorithms for video query processing expressed as continuous queries over video streams, such as fast forward, region-based blurring and left outer join. We include a description of the window-join algorithm as a core operator for continuous query systems, and discuss shared execution as an optimization approach for stream query processing.","PeriodicalId":415406,"journal":{"name":"ACM International Workshop on Multimedia Databases","volume":"44 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127510153","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 27

Content based sub-image retrieval via hierarchical tree matching 基于内容的分层树匹配子图像检索

ACM International Workshop on Multimedia Databases

Pub Date : 2003-11-07 DOI: 10.1145/951676.951689

Jie Luo, M. Nascimento

This paper deals with the problem of finding images that contain a given query image, the so-called content-based sub-image retrieval. We propose an approach based on a hierarchical tree that encodes the color feature of image tiles which are in turn stored as an index sequence. The index sequences of both candidate images and the query sub-image are then compared in order to rank the database images suitability with respect to the query. In our experiments, using 10,000 images and disk-resident metadata, for 60Σ (80Σ) of the queries the relevant image, i.e., the one where the query sub-image was extracted from, was found among the first 10 (50) retrieved images in about 0.15 sec.

本文研究的问题是寻找包含给定查询图像的图像，即所谓的基于内容的子图像检索。我们提出了一种基于层次树的方法，该方法对图像贴片的颜色特征进行编码，然后将其存储为索引序列。然后比较候选图像和查询子图像的索引序列，以便对数据库图像相对于查询的适用性进行排序。在我们的实验中，使用10,000张图像和磁盘驻留元数据，对于60Σ (80Σ)的查询，在大约0.15秒内从前10(50)张检索到的图像中找到相关图像，即提取查询子图像的图像。

引用次数: 32

Kernel VA-files for relevance feedback retrieva 用于相关反馈检索的内核va文件

ACM International Workshop on Multimedia Databases

Pub Date : 2003-11-07 DOI: 10.1145/951676.951686

Douglas R. Heisterkamp, Jing Peng

Many data partitioning index methods perform poorly in high dimensional space and do not support relevance feedback retrieval. The vector approximation file (VA-File) approach overcomes some of the difficulties of high dimensional vector spaces, but cannot be applied to relevance feedback retrieval using kernel distances in the data measurement space. This paper introduces a novel KVA-File (kernel VA-File) that extends VA-File to kernel-based retrieval methods. A key observation is that kernel distances may be non-linear in the data measurement space but is still linear in an induced feature space. It is this linear invariance in the induced feature space that enables KVA-File to work with kernel distances. An efficient approach to approximating vectors in an induced feature space is presented with the corresponding upper and lower distance bounds. Thus an effective indexing method is provided for kernel-based relevance feedback image retrieval methods. Experimental results using large image data sets (approximately 100,000 images with 463 dimensions of measurement) validate the efficacy of our method.

许多数据分区索引方法在高维空间中表现不佳，不支持相关反馈检索。向量逼近文件(VA-File)方法克服了高维向量空间的一些困难，但不能应用于利用数据测量空间中的核距离进行相关反馈检索。本文介绍了一种新的KVA-File(内核VA-File)，它将VA-File扩展到基于内核的检索方法。一个关键的观察是，核距离在数据测量空间中可能是非线性的，但在诱导特征空间中仍然是线性的。正是这种诱导特征空间中的线性不变性使KVA-File能够处理内核距离。提出了一种在诱导特征空间中近似向量的有效方法，并给出了相应的上下距离边界。从而为基于核的相关反馈图像检索方法提供了一种有效的索引方法。使用大型图像数据集(约100,000张图像，463个测量维度)的实验结果验证了我们的方法的有效性。

引用次数: 15

Image database retrieval utilizing affinity relationships 利用亲和关系的图像数据库检索

ACM International Workshop on Multimedia Databases

Pub Date : 2003-11-07 DOI: 10.1145/951676.951691

M. Shyu, Shu‐Ching Chen, Min Chen, Chengcui Zhang, Kanoksri Sarinnapakorn

Recent research effort in Content-Based Image Retrieval (CBIR) focuses on bridging the gap between low-level features and high-level semantic contents of images as this gap has become the bottleneck of CBIR. In this paper, an effective image database retrieval framework using a new mechanism called the Markov Model Mediator (MMM) is presented to meet this demand by taking into consideration not only the low-level image features, but also the high-level concepts learned from the history of user's access pattern and access frequencies on the images in the database. Also, the proposed framework is efficient in two aspects: 1) Overhead for real-time training is avoided in the image retrieval process because the high-level concepts of images are captured in the off-line training process. 2) Before the exact similarity matching process, Principal Component Analysis (PCA) is applied to reduce the image search space. A training subsystem for this framework is implemented and integrated into our system. The experimental results demonstrate that the MMM mechanism can effectively assist in retrieving more accurate results from image databases.

目前基于内容的图像检索(CBIR)的研究主要集中在弥合图像的低级特征和高级语义内容之间的差距，这一差距已经成为CBIR的瓶颈。为了满足这一需求，本文提出了一种有效的图像数据库检索框架，该框架采用一种名为马尔可夫模型中介(MMM)的新机制，该机制不仅考虑了图像的底层特征，而且考虑了从数据库中图像的用户访问模式和访问频率的历史中学习到的高层概念。此外，该框架还具有以下两方面的效率:1)在离线训练过程中捕获图像的高级概念，避免了图像检索过程中实时训练的开销;2)在精确相似度匹配之前，采用主成分分析(PCA)减小图像搜索空间。实现了该框架的训练子系统，并将其集成到系统中。实验结果表明，MMM机制可以有效地辅助从图像数据库中检索更准确的结果。

{"title":"Image database retrieval utilizing affinity relationships","authors":"M. Shyu, Shu‐Ching Chen, Min Chen, Chengcui Zhang, Kanoksri Sarinnapakorn","doi":"10.1145/951676.951691","DOIUrl":"https://doi.org/10.1145/951676.951691","url":null,"abstract":"Recent research effort in Content-Based Image Retrieval (CBIR) focuses on bridging the gap between low-level features and high-level semantic contents of images as this gap has become the bottleneck of CBIR. In this paper, an effective image database retrieval framework using a new mechanism called the Markov Model Mediator (MMM) is presented to meet this demand by taking into consideration not only the low-level image features, but also the high-level concepts learned from the history of user's access pattern and access frequencies on the images in the database. Also, the proposed framework is efficient in two aspects: 1) Overhead for real-time training is avoided in the image retrieval process because the high-level concepts of images are captured in the off-line training process. 2) Before the exact similarity matching process, Principal Component Analysis (PCA) is applied to reduce the image search space. A training subsystem for this framework is implemented and integrated into our system. The experimental results demonstrate that the MMM mechanism can effectively assist in retrieving more accurate results from image databases.","PeriodicalId":415406,"journal":{"name":"ACM International Workshop on Multimedia Databases","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130786837","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 48