On the Design of the MITLL Trimodal Dataset for Identity Verification

2023 11th International Workshop on Biometrics and Forensics (IWBF) Pub Date : 2023-04-19 DOI:10.1109/IWBF57495.2023.10157658

E. Singer, B. J. Borgstrom, Kenneth Alperin, Trang Nguyen, C. Dagli, Melissa R. Dale, A. Ross

引用次数: 0

Abstract

The recent advances in deep learning have led to an increased interest in the development of techniques for multimodal identity verification applications, particularly in the area of biometric fusion. Associated with these efforts is a corresponding need for large scale multimodal datasets to provide the bases for establishing performance baselines for proposed approaches. After examining the characteristics of existing multimodal datasets, this paper will describe the development of the MITLL Trimodal dataset, a new triple-modality collection of data comprising parallel samples of audio, image, and text for 553 subjects. The dataset is formed from YouTube videos and Twitter tweets. Baseline single modality results using a common processing pipeline are presented, along with the results of applying a conventional fusion algorithm to the individual stream scores.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

身份验证用MITLL三模态数据集的设计

深度学习的最新进展导致了对多模态身份验证应用技术开发的兴趣增加，特别是在生物识别融合领域。与这些努力相关的是对大规模多模态数据集的相应需求，以提供为拟议方法建立性能基线的基础。在研究了现有多模态数据集的特征之后，本文将描述MITLL三模态数据集的发展，这是一个新的三模态数据集，包括553个受试者的音频、图像和文本的并行样本。该数据集由YouTube视频和Twitter推文组成。给出了使用通用处理管道的基线单模态结果，以及将传统融合算法应用于单个流分数的结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2023 11th International Workshop on Biometrics and Forensics (IWBF)

自引率

0.00%

发文量