Methods of Multi-Modal Data Exploration

Proceedings of the 2019 on International Conference on Multimedia Retrieval Pub Date : 2019-06-05 DOI:10.1145/3323873.3325858

Tomás Grosup

引用次数: 0

Abstract

Techniques and tools designed for information retrieval, data exploration or data analytical tasks are based on the relational and text-search model, and cannot be easily applied to unstructured data such as images or videos. Researcher communities have been trying to reveal the semantics of multimedia in the last decades with ever-improving results in various tasks, dominated by the latest success of deep learning. Limits of object retrieval models drive the need for data exploration methods that support multi-modal data, like multimedia surrounded by structured attributes. In this paper, we describe, implement and evaluate exploration methods using multiple modalities and retrieval models in the context of multimedia. We apply the techniques in e-commerce product search and recommending, and demonstrate benefit for different retrieval scenarios. Lastly, we propose a method for extending database schema by latent visual attributes learned from image data. This enables closing the loop by going back to relational data, and potentially benefiting a range of industrial applications.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

多模态数据探索方法

为信息检索、数据探索或数据分析任务而设计的技术和工具是基于关系和文本搜索模型的，不容易应用于图像或视频等非结构化数据。在过去的几十年里，研究人员一直试图揭示多媒体的语义，并在各种任务中取得了不断改进的成果，其中以深度学习的最新成功为主导。对象检索模型的局限性促使人们需要支持多模态数据的数据探索方法，比如被结构化属性包围的多媒体。在本文中，我们描述、实现和评估了在多媒体背景下使用多种模式和检索模型的探索方法。我们将该技术应用于电子商务产品搜索和推荐中，并在不同的检索场景下展示了其优势。最后，我们提出了一种利用从图像数据中学习到的潜在视觉属性来扩展数据库模式的方法。这可以通过返回关系数据来完成循环，并可能使一系列工业应用程序受益。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊