首页 > 最新文献

ACM International Workshop on Multimedia Databases最新文献

英文 中文
A unified framework for image database clustering and content-based retrieval 一个统一的图像数据库聚类和基于内容的检索框架
Pub Date : 2004-11-13 DOI: 10.1145/1032604.1032609
M. Shyu, Shu‐Ching Chen, Min Chen, Chengcui Zhang
With the proliferation of image data, the need to search and retrieve images efficiently and accurately from a large image database or a collection of image databases has drastically increased. To address such a demand, a unified framework called Markov Model Mediators (MMMs) is proposed in this paper to facilitate conceptual database clustering and to improve the query processing performance by analyzing the summarized knowledge. The unique characteristics of MMMs are that it provides the capabilities of exploring the affinity relations among the images at the database level and among the databases at the cluster level respectively, using an effective data mining process. At the database level, each database is modeled by an intra-database MMM which enables accurate image retrieval within the database. Then the conceptual database clustering is performed and cluster-level knowledge summarization is conducted to reduce the cost of retrieving images across the databases. This framework has been tested using a set of image databases, which contain various numbers of images with different dimensions and concept categories. The experimental results demonstrate that our framework achieves better retrieval accuracy via inter-cluster retrieval than that of intra-cluster retrieval with minimal extra effort.
随着图像数据的激增,从大型图像数据库或图像数据库集合中高效、准确地搜索和检索图像的需求急剧增加。针对这一需求,本文提出了一种统一的马尔可夫模型中介(Markov Model mediator, MMMs)框架,以促进概念数据库聚类,并通过分析汇总的知识来提高查询处理性能。mm的独特之处在于,它提供了使用有效的数据挖掘过程分别在数据库级别和集群级别探索图像之间和数据库之间的亲和关系的能力。在数据库级别上,每个数据库都由数据库内的MMM建模,从而在数据库内实现准确的图像检索。然后进行概念数据库聚类,并进行聚类级知识汇总,以降低跨数据库检索图像的成本。这个框架已经使用一组图像数据库进行了测试,这些数据库包含不同尺寸和概念类别的不同数量的图像。实验结果表明,该框架通过簇间检索比簇内检索获得了更好的检索精度。
{"title":"A unified framework for image database clustering and content-based retrieval","authors":"M. Shyu, Shu‐Ching Chen, Min Chen, Chengcui Zhang","doi":"10.1145/1032604.1032609","DOIUrl":"https://doi.org/10.1145/1032604.1032609","url":null,"abstract":"With the proliferation of image data, the need to search and retrieve images efficiently and accurately from a large image database or a collection of image databases has drastically increased. To address such a demand, a unified framework called <i>Markov Model Mediators</i> (MMMs) is proposed in this paper to facilitate conceptual database clustering and to improve the query processing performance by analyzing the summarized knowledge. The unique characteristics of MMMs are that it provides the capabilities of exploring the affinity relations among the images at the database level and among the databases at the cluster level respectively, using an effective data mining process. At the database level, each database is modeled by an intra-database MMM which enables accurate image retrieval within the database. Then the conceptual database clustering is performed and cluster-level knowledge summarization is conducted to reduce the cost of retrieving images across the databases. This framework has been tested using a set of image databases, which contain various numbers of images with different dimensions and concept categories. The experimental results demonstrate that our framework achieves better retrieval accuracy via inter-cluster retrieval than that of intra-cluster retrieval with minimal extra effort.","PeriodicalId":415406,"journal":{"name":"ACM International Workshop on Multimedia Databases","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125843194","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 46
Content-based sub-image retrieval using relevance feedback 基于内容的关联反馈子图像检索
Pub Date : 2004-11-13 DOI: 10.1145/1032604.1032607
Jie Luo, M. Nascimento
This paper presents the use of relevance feedback to the problem of content-based sub-image retrieval (CBsIR). Relevance feedback is used to improve the accuracy of successive retrievals via a tile re-weighting scheme that assigns penalties to each tile of database images and updates the tile penalties for all relevant images retrieved at each iteration using both the relevant (positive) and irrelevant (negative) images identified by the user. Performance evaluation on a dataset of over 10,000 images shows the effectiveness and efficiency of the proposed framework. Using 64 quantized colors in the RGB color space, the system can achieve a stable average recall value of 70% within the top 20 retrieved (and presented) images after only 5 iterations, with each such iteration taking about 2 seconds.
本文提出了将相关反馈应用于基于内容的子图像检索(CBsIR)问题。相关反馈用于提高连续检索的准确性,方法是通过一种tile重新加权方案,该方案为数据库图像的每个tile分配惩罚,并使用用户识别的相关(正面)和不相关(负面)图像更新每次迭代检索的所有相关图像的tile惩罚。在超过10,000张图像的数据集上的性能评估表明了所提出框架的有效性和效率。使用RGB色彩空间中的64种量化颜色,系统只需5次迭代,每次迭代大约需要2秒,就可以在前20张检索(和呈现)图像中实现稳定的平均召回值70%。
{"title":"Content-based sub-image retrieval using relevance feedback","authors":"Jie Luo, M. Nascimento","doi":"10.1145/1032604.1032607","DOIUrl":"https://doi.org/10.1145/1032604.1032607","url":null,"abstract":"This paper presents the use of relevance feedback to the problem of content-based sub-image retrieval (CBsIR). Relevance feedback is used to improve the accuracy of successive retrievals via a tile re-weighting scheme that assigns penalties to each tile of database images and updates the tile penalties for all relevant images retrieved at each iteration using both the relevant (positive) and irrelevant (negative) images identified by the user. Performance evaluation on a dataset of over 10,000 images shows the effectiveness and efficiency of the proposed framework. Using 64 quantized colors in the RGB color space, the system can achieve a stable average recall value of 70% within the top 20 retrieved (and presented) images after only 5 iterations, with each such iteration taking about 2 seconds.","PeriodicalId":415406,"journal":{"name":"ACM International Workshop on Multimedia Databases","volume":"62 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125439604","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 28
A PCA-based similarity measure for multivariate time series 基于pca的多变量时间序列相似性度量
Pub Date : 2004-11-13 DOI: 10.1145/1032604.1032616
Kiyoung Yang, C. Shahabi
Multivariate time series (MTS) datasets are common in various multimedia, medical and financial applications. We propose a similarity measure for MTS datasets, Eros Extended Frobenius norm), which is based on Principal Component Analysis (PCA). Eros applies PCA to MTS datasets represented as matrices to generate principal components and associated eigenvalues. These principal components and eigenvalues are then used to compare the similarity between MTS matrices. Though Eros in itself does not satisfy the triangle inequality, without which existing multidimensional indexing structures may not be utilized, the lower and upper bounds to satisfy the triangle inequality are obtained. In order to show the validity of Eros for similarity search on MTS datasets, we performed several experiments on three datasets (2 real-world and 1 synthetic). The results show the superiority of our approaches as compared to the traditional similarity measures for MTS datasets, such as Euclidean Distance (ED), Dynamic Time Warping (DTW), Weighted Sum SVD (WSSVD) and PCA similarity factor (SPCA) in precision/recall.
多元时间序列(MTS)数据集在各种多媒体、医疗和金融应用中很常见。本文提出了一种基于主成分分析(PCA)的MTS数据集相似性度量方法——Eros Extended Frobenius norm。Eros将PCA应用于以矩阵表示的MTS数据集,生成主成分和相关特征值。然后使用这些主成分和特征值来比较MTS矩阵之间的相似性。虽然Eros本身不满足三角不等式,没有它就不能利用现有的多维索引结构,但得到了满足三角不等式的下界和上界。为了证明Eros在MTS数据集上相似性搜索的有效性,我们在三个数据集(2个真实数据集和1个合成数据集)上进行了几个实验。结果表明,与传统的MTS数据集相似度度量方法(如欧氏距离(ED)、动态时间扭曲(DTW)、加权和SVD (WSSVD)和PCA相似因子(SPCA))相比,我们的方法在精度/召回率方面具有优势。
{"title":"A PCA-based similarity measure for multivariate time series","authors":"Kiyoung Yang, C. Shahabi","doi":"10.1145/1032604.1032616","DOIUrl":"https://doi.org/10.1145/1032604.1032616","url":null,"abstract":"Multivariate time series (MTS) datasets are common in various multimedia, medical and financial applications. We propose a similarity measure for MTS datasets, <i>Eros</i> <i>E</i>xtended F<i>ro</i>beniu<i>s</i> norm), which is based on Principal Component Analysis (PCA). <i>Eros</i> applies PCA to MTS datasets represented as matrices to generate principal components and associated eigenvalues. These principal components and eigenvalues are then used to compare the similarity between MTS matrices. Though <i>Eros</i> in itself does not satisfy the triangle inequality, without which existing multidimensional indexing structures may not be utilized, the lower and upper bounds to satisfy the triangle inequality are obtained. In order to show the validity of <i>Eros</i> for similarity search on MTS datasets, we performed several experiments on three datasets (2 real-world and 1 synthetic). The results show the superiority of our approaches as compared to the traditional similarity measures for MTS datasets, such as Euclidean Distance (ED), Dynamic Time Warping (DTW), Weighted Sum SVD (WSSVD) and PCA similarity factor (S<sc>PCA</sc>) in precision/recall.","PeriodicalId":415406,"journal":{"name":"ACM International Workshop on Multimedia Databases","volume":"40 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130798355","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 273
Looking at mapping, indexing & querying of MPEG-7 descriptors in RDBMS with SM3 用SM3研究RDBMS中MPEG-7描述符的映射、索引和查询
Pub Date : 2004-11-13 DOI: 10.1145/1032604.1032615
Yang Chu, L. Chia, S. Bhowmick
MPEG-7 documents, which are primarily for multimedia information exchange, are also data-centric XML documents. Due to its advantages, the relational DBMS is the best choice for storing such XML documents. Storing XML data in relational DBMS can be classified into two classes of storage model: structure-mapping and model-mapping. However, the structure-mapping model cannot support complex Xpath-based query efficiently and model mapping approach lacks the flexible capability in representing all kinds of datatypes. In this paper, we present a new storage approach, called SM3. As an XML document, MPEG-7 document can be viewed as XML tree. Such a tree graph, where the internal nodes are element type with element contents, represents the structure of document and can be viewed as nodes which are meaningful only for document traversal. The leaf node, which is a single-valued attribute or element type with text content, has little usage for XML tree routing as it is the end-point of Xpath. So it can be viewed as the special node which only holds value. In this paper, SM3 was designed to use model-mapping approach to store all internal nodes and structure-mapping model to store all leaf nodes. SM3 integrate the advantages of those two models and avoid the main drawbacks from each method. Performance studies are conducted by comparing SM3 with XParent (a pure model-mapping method) and SM3 with XML-DBMS (a pure structure-mapping method). The experimental results are presented in the paper and initial results are encouraging.
主要用于多媒体信息交换的MPEG-7文档也是以数据为中心的XML文档。由于其优点,关系DBMS是存储此类XML文档的最佳选择。在关系DBMS中存储XML数据可以分为两类存储模型:结构映射和模型映射。但是,结构映射模型不能有效地支持复杂的基于xpath的查询,模型映射方法在表示各种数据类型方面缺乏灵活的能力。在本文中,我们提出了一种新的存储方法,称为SM3。作为一种XML文档,MPEG-7文档可以看作是一棵XML树。这种树状图的内部节点是具有元素内容的元素类型,它表示文档的结构,并且可以被视为仅对文档遍历有意义的节点。叶节点是具有文本内容的单值属性或元素类型,它很少用于XML树路由,因为它是Xpath的终点。所以它可以被看作是一个特殊的节点,它只保存值。本文设计SM3采用模型映射方法存储所有内部节点,结构映射模型存储所有叶节点。SM3综合了这两种方法的优点,避免了每种方法的主要缺点。通过比较SM3与XParent(一种纯模型映射方法)和SM3与XML-DBMS(一种纯结构映射方法)进行性能研究。本文给出了实验结果,初步结果令人鼓舞。
{"title":"Looking at mapping, indexing & querying of MPEG-7 descriptors in RDBMS with SM3","authors":"Yang Chu, L. Chia, S. Bhowmick","doi":"10.1145/1032604.1032615","DOIUrl":"https://doi.org/10.1145/1032604.1032615","url":null,"abstract":"MPEG-7 documents, which are primarily for multimedia information exchange, are also data-centric XML documents. Due to its advantages, the relational DBMS is the best choice for storing such XML documents. Storing XML data in relational DBMS can be classified into two classes of storage model: structure-mapping and model-mapping. However, the structure-mapping model cannot support complex Xpath-based query efficiently and model mapping approach lacks the flexible capability in representing all kinds of datatypes. In this paper, we present a new storage approach, called SM3. As an XML document, MPEG-7 document can be viewed as XML tree. Such a tree graph, where the internal nodes are element type with element contents, represents the structure of document and can be viewed as nodes which are meaningful only for document traversal. The leaf node, which is a single-valued attribute or element type with text content, has little usage for XML tree routing as it is the end-point of Xpath. So it can be viewed as the special node which only holds value. In this paper, SM3 was designed to use model-mapping approach to store all internal nodes and structure-mapping model to store all leaf nodes. SM3 integrate the advantages of those two models and avoid the main drawbacks from each method. Performance studies are conducted by comparing SM3 with XParent (a pure model-mapping method) and SM3 with XML-DBMS (a pure structure-mapping method). The experimental results are presented in the paper and initial results are encouraging.","PeriodicalId":415406,"journal":{"name":"ACM International Workshop on Multimedia Databases","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127877114","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Automatic image annotation and retrieval using subspace clustering algorithm 基于子空间聚类算法的图像自动标注与检索
Pub Date : 2004-11-13 DOI: 10.1145/1032604.1032621
Lei Wang, Li Liu, L. Khan
The development of technology generates huge amounts of non-textual information, such as images. An efficient image annotation and retrieval system is highly desired. Clustering algorithms make it possible to represent visual features of images with finite symbols. Based on this, many statistical models, which analyze correspondence between visual features and words and discover hidden semantics, have been published. These models improve the annotation and retrieval of large image databases. However, image data usually have a large number of dimensions. Traditional clustering algorithms assign equal weights to these dimensions, and become confounded in the process of dealing with these dimensions. In this paper, we propose a top-down, subspace clustering algorithm as a solution to this problem. For a given cluster, we determine relevant features based on histogram analysis and assign greater weight to relevant features as compared to less relevant features. We have implemented four different models to link visual tokens with keywords based on the clustering results of our clustering algorithm and K-means algorithm, and evaluated performance using precision, recall and correspondence accuracy using benchmark dataset. The results show that our algorithm is better than traditional ones for automatic image annotation and retrieval.
技术的发展产生了大量的非文本信息,如图像。人们迫切需要一个高效的图像标注和检索系统。聚类算法使得用有限的符号表示图像的视觉特征成为可能。在此基础上,许多分析视觉特征与词汇对应关系、发现隐藏语义的统计模型相继问世。这些模型改进了大型图像数据库的标注和检索。然而,图像数据通常具有大量的维度。传统的聚类算法对这些维度赋予相同的权重,在处理这些维度的过程中容易产生混淆。在本文中,我们提出了一种自顶向下的子空间聚类算法来解决这个问题。对于给定的聚类,我们基于直方图分析确定相关特征,并为相关特征分配比不相关特征更大的权重。基于我们的聚类算法和K-means算法的聚类结果,我们实现了四种不同的模型来链接视觉标记和关键字,并使用基准数据集使用精度、召回率和对应精度来评估性能。结果表明,该算法在图像自动标注和检索方面优于传统算法。
{"title":"Automatic image annotation and retrieval using subspace clustering algorithm","authors":"Lei Wang, Li Liu, L. Khan","doi":"10.1145/1032604.1032621","DOIUrl":"https://doi.org/10.1145/1032604.1032621","url":null,"abstract":"The development of technology generates huge amounts of non-textual information, such as images. An efficient image annotation and retrieval system is highly desired. Clustering algorithms make it possible to represent visual features of images with finite symbols. Based on this, many statistical models, which analyze correspondence between visual features and words and discover hidden semantics, have been published. These models improve the annotation and retrieval of large image databases. However, image data usually have a large number of dimensions. Traditional clustering algorithms assign equal weights to these dimensions, and become confounded in the process of dealing with these dimensions. In this paper, we propose a top-down, subspace clustering algorithm as a solution to this problem. For a given cluster, we determine relevant features based on histogram analysis and assign greater weight to relevant features as compared to less relevant features. We have implemented four different models to link visual tokens with keywords based on the clustering results of our clustering algorithm and K-means algorithm, and evaluated performance using precision, recall and correspondence accuracy using benchmark dataset. The results show that our algorithm is better than traditional ones for automatic image annotation and retrieval.","PeriodicalId":415406,"journal":{"name":"ACM International Workshop on Multimedia Databases","volume":"126 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115962781","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 39
Automatic classification of speech and music using neural networks 使用神经网络的语音和音乐自动分类
Pub Date : 2004-11-13 DOI: 10.1145/1032604.1032620
M. K. S. Khan, W. Al-Khatib, M. Moinuddin
The importance of automatic discrimination between speech signals and music signals has evolved as a research topic over recent years. The need to classify audio into categories such as speech or music is an important aspect of many multimedia document retrieval systems. Several approaches have been previously used to discriminate between speech and music data. In this paper, we propose the use of the mean and variance of the discrete wavelet transform in addition to other features that have been used previously for audio classification. We have used Multi-Layer Perceptron (MLP) Neural Networks as a classifier. Our initial tests have shown encouraging results that indicate the viability of our approach.
近年来,语音信号与音乐信号自动识别的重要性逐渐成为一个研究课题。需要将音频分类为语音或音乐等类别是许多多媒体文档检索系统的一个重要方面。以前有几种方法被用来区分语音和音乐数据。在本文中,我们建议使用离散小波变换的均值和方差以及之前用于音频分类的其他特征。我们使用多层感知器(MLP)神经网络作为分类器。我们的初步测试显示出令人鼓舞的结果,表明我们的方法是可行的。
{"title":"Automatic classification of speech and music using neural networks","authors":"M. K. S. Khan, W. Al-Khatib, M. Moinuddin","doi":"10.1145/1032604.1032620","DOIUrl":"https://doi.org/10.1145/1032604.1032620","url":null,"abstract":"The importance of automatic discrimination between speech signals and music signals has evolved as a research topic over recent years. The need to classify audio into categories such as speech or music is an important aspect of many multimedia document retrieval systems. Several approaches have been previously used to discriminate between speech and music data. In this paper, we propose the use of the mean and variance of the discrete wavelet transform in addition to other features that have been used previously for audio classification. We have used Multi-Layer Perceptron (MLP) Neural Networks as a classifier. Our initial tests have shown encouraging results that indicate the viability of our approach.","PeriodicalId":415406,"journal":{"name":"ACM International Workshop on Multimedia Databases","volume":"58 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128502205","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 20
Semantic retrieval of multimedia data 多媒体数据的语义检索
Pub Date : 2004-11-13 DOI: 10.1145/1032604.1032612
Samira Hammiche, S. Benbernou, Mohand-Said Hacid, A. Vakali
This paper deals with the problem of finding multimedia data that fulfill the requirements of user queries. We assume both the user query and the multimedia data are expressed by MPEG-7 standard. The MPEG-7 formalism lacks the semantics and reasoning support in many ways. For example, the search of the implicit data can not be achieved, due to its description based on XML schema. We propose a framework for querying multimedia data based on a tree embedding approximation algorithm, combining the MPEG-7 standard and an ontology.
本文研究了如何查找满足用户查询要求的多媒体数据的问题。我们假设用户查询和多媒体数据都采用MPEG-7标准表示。MPEG-7的形式化在许多方面缺乏语义和推理支持。例如,由于隐式数据的描述基于XML模式,因此无法实现对隐式数据的搜索。结合MPEG-7标准和本体,提出了一种基于树嵌入近似算法的多媒体数据查询框架。
{"title":"Semantic retrieval of multimedia data","authors":"Samira Hammiche, S. Benbernou, Mohand-Said Hacid, A. Vakali","doi":"10.1145/1032604.1032612","DOIUrl":"https://doi.org/10.1145/1032604.1032612","url":null,"abstract":"This paper deals with the problem of finding multimedia data that fulfill the requirements of user queries. We assume both the user query and the multimedia data are expressed by MPEG-7 standard. The MPEG-7 formalism lacks the semantics and reasoning support in many ways. For example, the search of the implicit data can not be achieved, due to its description based on XML schema. We propose a framework for querying multimedia data based on a tree embedding approximation algorithm, combining the MPEG-7 standard and an ontology.","PeriodicalId":415406,"journal":{"name":"ACM International Workshop on Multimedia Databases","volume":"64 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133664483","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 39
Web-based multimedia databases: prospects and challenges 基于web的多媒体数据库:前景与挑战
Pub Date : 2004-11-13 DOI: 10.1145/1032604.1032605
A. Ghafoor
Development of Web-based multimedia applications is expected to hold central importance for engineering and technological progress during the rest of this decade. It is already opening up new research frontiers in a number of areas such as multimedia data modeling and indexing, data mining, multimedia document management, semantic Web, pervasive computing, distributed sensor networks, computer security, real-time operating systems, human-computer interaction, and storage technology etc. As a result of concerted effort in these areas, many Web-accessible multimedia applications involving different media types, e.g., video, audio, text, images, animation and graphics, are rapidly emerging Examples of such applications abound in the domains of health care, education, entertainment, manufacturing, e-commerce, digital libraries as well as military and critical national infrastructures. The premise is that the integration of Web and multimedia technologies can provide cost effective solutions for management and dissemination of information, which is a primary tool for increasing economic efficiency. Development of Web-based multimedia applications needs a broad range of technological solutions that deal with organizing, storing, and delivering multimedia information in an integrated, secure and timely manner with guaranteed quality of service (QoS). Multimedia database management, when viewed in conjunction with integration of contents from independent Web-based data sources, present formidable research and development challenges. Key challenges include: • content analysis and indexing of distributed multimedia data and documents • semantic modeling and knowledge-based representation of multimedia data • transformation and organization of multimedia data semantics as a part of Semantic Web • security, privacy and QoS related issues concerning Web-based multimedia database applications • emerging Web standards and their role in managing distributed multimedia databases In this talk, we elaborate on these challenges and describe several solutions and tools that have been developed for Web-based multimedia database systems. Acknowledgement: This research was in part supported by the National Science Foundation under the awards IIS0209111 and EIC-9972883. Copyright is held by the author/owner(s). MMDB'04, November 13, 2004, Washington, DC, USA. ACM 1-58113-975-6/04/0011.
基于web的多媒体应用程序的开发预计将在本十年的剩余时间里对工程和技术进步具有中心重要性。它已经在多媒体数据建模与索引、数据挖掘、多媒体文档管理、语义网、普然计算、分布式传感器网络、计算机安全、实时操作系统、人机交互、存储技术等领域开辟了新的研究前沿。由于在这些领域的协同努力,许多涉及不同媒体类型(如视频、音频、文本、图像、动画和图形)的web可访问多媒体应用正在迅速涌现,这类应用的例子在卫生保健、教育、娱乐、制造业、电子商务、数字图书馆以及军事和关键国家基础设施等领域比比皆是。前提是网络和多媒体技术的集成可以为信息的管理和传播提供具有成本效益的解决方案,这是提高经济效率的主要工具。基于web的多媒体应用程序的开发需要广泛的技术解决方案,以集成、安全和及时的方式组织、存储和交付多媒体信息,并保证服务质量(QoS)。多媒体数据库管理,当与来自独立的基于web的数据源的内容集成结合在一起时,提出了艰巨的研究和开发挑战。主要挑战包括:•分布式多媒体数据和文档的内容分析和索引•多媒体数据的语义建模和基于知识的表示•作为语义网一部分的多媒体数据语义的转换和组织•关于基于Web的多媒体数据库应用程序的安全、隐私和QoS相关问题•新兴的Web标准及其在管理分布式多媒体数据库中的作用我们详细阐述了这些挑战,并描述了为基于web的多媒体数据库系统开发的几种解决方案和工具。致谢:本研究得到了美国国家科学基金会IIS0209111和EIC-9972883奖项的部分支持。版权由作者/拥有人持有。MMDB'04, 2004年11月13日,美国华盛顿特区。ACM 1 - 58113 - 975 - 6/04/0011。
{"title":"Web-based multimedia databases: prospects and challenges","authors":"A. Ghafoor","doi":"10.1145/1032604.1032605","DOIUrl":"https://doi.org/10.1145/1032604.1032605","url":null,"abstract":"Development of Web-based multimedia applications is expected to hold central importance for engineering and technological progress during the rest of this decade. It is already opening up new research frontiers in a number of areas such as multimedia data modeling and indexing, data mining, multimedia document management, semantic Web, pervasive computing, distributed sensor networks, computer security, real-time operating systems, human-computer interaction, and storage technology etc. As a result of concerted effort in these areas, many Web-accessible multimedia applications involving different media types, e.g., video, audio, text, images, animation and graphics, are rapidly emerging Examples of such applications abound in the domains of health care, education, entertainment, manufacturing, e-commerce, digital libraries as well as military and critical national infrastructures. The premise is that the integration of Web and multimedia technologies can provide cost effective solutions for management and dissemination of information, which is a primary tool for increasing economic efficiency. Development of Web-based multimedia applications needs a broad range of technological solutions that deal with organizing, storing, and delivering multimedia information in an integrated, secure and timely manner with guaranteed quality of service (QoS). Multimedia database management, when viewed in conjunction with integration of contents from independent Web-based data sources, present formidable research and development challenges. Key challenges include: • content analysis and indexing of distributed multimedia data and documents • semantic modeling and knowledge-based representation of multimedia data • transformation and organization of multimedia data semantics as a part of Semantic Web • security, privacy and QoS related issues concerning Web-based multimedia database applications • emerging Web standards and their role in managing distributed multimedia databases In this talk, we elaborate on these challenges and describe several solutions and tools that have been developed for Web-based multimedia database systems. Acknowledgement: This research was in part supported by the National Science Foundation under the awards IIS0209111 and EIC-9972883. Copyright is held by the author/owner(s). MMDB'04, November 13, 2004, Washington, DC, USA. ACM 1-58113-975-6/04/0011.","PeriodicalId":415406,"journal":{"name":"ACM International Workshop on Multimedia Databases","volume":"60 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133221162","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A motion based scene tree for browsing and retrieval of compressed videos 用于浏览和检索压缩视频的基于运动的场景树
Pub Date : 2004-11-13 DOI: 10.1145/1032604.1032608
Haoran Yi, D. Rajan, L. Chia
This paper describes a fully automatic content-based approach for browsing and retrieval of MPEG-2 compressed video. The first step of the approach is the detection of shot boundaries based on motion vectors available from the compressed video stream. The next step involves the construction of a scene tree from the shots obtained earlier. The scene tree is shown to capture some semantic information as well as to provide a construct for hierarchical browsing of compressed videos. Finally, we build a new model for video similarity based on global as well as local motion associated with each node in the scene tree. To this end, we propose new approaches to camera motion and object motion estimation. The experimental results demonstrate that the integration of the above techniques results in an efficient framework for browsing and searching large video databases.
本文描述了一种全自动的基于内容的MPEG-2压缩视频浏览和检索方法。该方法的第一步是基于压缩视频流中可用的运动向量检测镜头边界。下一步是根据之前获得的镜头构建场景树。场景树被用来捕获一些语义信息,并为压缩视频的分层浏览提供了一个结构。最后,我们建立了一个新的基于全局和局部运动的视频相似度模型,该模型与场景树中每个节点相关联。为此,我们提出了相机运动和目标运动估计的新方法。实验结果表明,上述技术的融合为大型视频数据库的浏览和搜索提供了一个高效的框架。
{"title":"A motion based scene tree for browsing and retrieval of compressed videos","authors":"Haoran Yi, D. Rajan, L. Chia","doi":"10.1145/1032604.1032608","DOIUrl":"https://doi.org/10.1145/1032604.1032608","url":null,"abstract":"This paper describes a fully automatic content-based approach for browsing and retrieval of MPEG-2 compressed video. The first step of the approach is the detection of shot boundaries based on motion vectors available from the compressed video stream. The next step involves the construction of a scene tree from the shots obtained earlier. The scene tree is shown to capture some semantic information as well as to provide a construct for hierarchical browsing of compressed videos. Finally, we build a new model for video similarity based on global as well as local motion associated with each node in the scene tree. To this end, we propose new approaches to camera motion and object motion estimation. The experimental results demonstrate that the integration of the above techniques results in an efficient framework for browsing and searching large video databases.","PeriodicalId":415406,"journal":{"name":"ACM International Workshop on Multimedia Databases","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117003938","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 21
VRules: an effective association-based classifier for videos VRules:有效的基于关联的视频分类器
Pub Date : 2004-11-13 DOI: 10.1145/1032604.1032619
Ling Chen, S. Bhowmick, L. Chia
Video classification is an important step towards multimedia understanding. Most state-of-the-art approaches which apply HMM to capture the temporal information of videos have the limitation by assuming that the current state of a video depends only on the immediate previous state. Nevertheless, this assumption may not hold for videos of various categories. In this paper, we present an effective video classifier which employs the association rule mining technique to discover the actual dependence relationship between video states. The discriminatory state transition patterns mined from different video categories are then used to perform classification. Besides capturing the association between states in the time space, we also capture the association between low-level features in spatial dimension to further distinguish the semantics of videos. Experimental results show that the performance of our association rule based classifier is quite promising.
视频分类是理解多媒体的重要步骤。大多数应用HMM来捕获视频时间信息的最先进的方法都有一个局限性,即假设视频的当前状态仅取决于前一个状态。然而,这种假设可能不适用于各种类型的视频。本文提出了一种有效的视频分类器,该分类器采用关联规则挖掘技术来发现视频状态之间的实际依赖关系。然后使用从不同视频类别中挖掘的歧视性状态转移模式进行分类。除了在时间空间上捕获状态之间的关联外,我们还在空间维度上捕获低级特征之间的关联,以进一步区分视频的语义。实验结果表明,基于关联规则的分类器具有良好的性能。
{"title":"VRules: an effective association-based classifier for videos","authors":"Ling Chen, S. Bhowmick, L. Chia","doi":"10.1145/1032604.1032619","DOIUrl":"https://doi.org/10.1145/1032604.1032619","url":null,"abstract":"Video classification is an important step towards multimedia understanding. Most state-of-the-art approaches which apply HMM to capture the temporal information of videos have the limitation by assuming that the current state of a video depends only on the immediate previous state. Nevertheless, this assumption may not hold for videos of various categories. In this paper, we present an effective video classifier which employs the association rule mining technique to discover the actual dependence relationship between video states. The discriminatory state transition patterns mined from different video categories are then used to perform classification. Besides capturing the association between states in the time space, we also capture the association between low-level features in spatial dimension to further distinguish the semantics of videos. Experimental results show that the performance of our association rule based classifier is quite promising.","PeriodicalId":415406,"journal":{"name":"ACM International Workshop on Multimedia Databases","volume":"56 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122307200","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
期刊
ACM International Workshop on Multimedia Databases
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1