Space semantic aware loss function for embedding creation in case of transaction data

Q4 Mathematics Zhurnal Belorusskogo Gosudarstvennogo Universiteta. Matematika. Informatika Pub Date : 2022-04-14 DOI:10.33581/2520-6508-2022-1-97-102

M. Vatkin, D. A. Vorobey

引用次数: 0

Abstract

Transaction data are the most popular data type of bank domain, they are often represented as sparse vectors with a large number of features. Using sparse vectors in deep learning tasks is computationally inefficient and may lead to overfitting. Аutoencoders are widely applied to extract new useful features in a lower dimensional space. In this paper we propose to use a novel loss function based on the metric that estimates the quality of mapping the semantic structure of the original tabular data to the embedded space. The proposed loss function allows preserving the item relation structure of the original space during the dimension reduction transformation. The obtained results show the improvement of the resulting embedding properties while using the combination of the new loss function and the traditional mean squared error one.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

事务数据情况下嵌入创建的空间语义感知损失函数

交易数据是银行领域最常用的数据类型，它们通常被表示为具有大量特征的稀疏向量。在深度学习任务中使用稀疏向量计算效率低下，并且可能导致过拟合。Аutoencoders被广泛应用于在低维空间中提取新的有用特征。在本文中，我们提出使用一种新的基于度量的损失函数来估计原始表格数据的语义结构映射到嵌入空间的质量。所提出的损失函数允许在降维变换中保留原始空间的项目关系结构。结果表明，将新的损失函数与传统的均方误差函数结合使用后，得到的嵌入性能得到了改善。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊