GMINN：用于点击率预测的门增强多空间交互神经网络

IF 1.8 4区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Computational Intelligence Pub Date : 2024-06-09 DOI:10.1111/coin.12645

Xingyu Feng, Xuekang Yang, Boyun Zhou

{"title":"GMINN：用于点击率预测的门增强多空间交互神经网络","authors":"Xingyu Feng, Xuekang Yang, Boyun Zhou","doi":"10.1111/coin.12645","DOIUrl":null,"url":null,"abstract":"Click-through rate (CTR) prediction is a pivotal challenge in recommendation systems. Existing models are prone to disturbances from noise and redundant features, hindering their ability to fully capture implicit and higher-order feature interactions present in sparse feature data. Moreover, conventional dual-tower models overlook the significance of layer-level feature interactions. To address these limitations, this article introduces Gate-enhanced Multi-space Interactive Neural Networks (GMINN), a novel model for CTR prediction. GMINN adopts a dual-tower architecture in which a multi-space interaction layer is introduced after each layer in the dual-tower deep neural network. This layer allocates features into multiple subspaces and employs matrix multiplication to establish layer-level interactions between the dual towers. Simultaneously, a field-aware gate mechanism is proposed to extract crucial latent information from the original features. Experimental validation on publicly available datasets, Criteo and Avazu, demonstrates the superiority of the proposed GMINN model. Comparative analyses against baseline models reveal that GMINN substantially improves up to 4.09% in AUC and a maximum reduction of 7.21% in Logloss. Additionally, ablation experiments provide further validation of the effectiveness of GMINN.","PeriodicalId":55228,"journal":{"name":"Computational Intelligence","volume":"40 3","pages":""},"PeriodicalIF":1.8000,"publicationDate":"2024-06-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"GMINN: Gate-enhanced multi-space interaction neural networks for click-through rate prediction\",\"authors\":\"Xingyu Feng, Xuekang Yang, Boyun Zhou\",\"doi\":\"10.1111/coin.12645\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Click-through rate (CTR) prediction is a pivotal challenge in recommendation systems. Existing models are prone to disturbances from noise and redundant features, hindering their ability to fully capture implicit and higher-order feature interactions present in sparse feature data. Moreover, conventional dual-tower models overlook the significance of layer-level feature interactions. To address these limitations, this article introduces Gate-enhanced Multi-space Interactive Neural Networks (GMINN), a novel model for CTR prediction. GMINN adopts a dual-tower architecture in which a multi-space interaction layer is introduced after each layer in the dual-tower deep neural network. This layer allocates features into multiple subspaces and employs matrix multiplication to establish layer-level interactions between the dual towers. Simultaneously, a field-aware gate mechanism is proposed to extract crucial latent information from the original features. Experimental validation on publicly available datasets, Criteo and Avazu, demonstrates the superiority of the proposed GMINN model. Comparative analyses against baseline models reveal that GMINN substantially improves up to 4.09% in AUC and a maximum reduction of 7.21% in Logloss. Additionally, ablation experiments provide further validation of the effectiveness of GMINN.\",\"PeriodicalId\":55228,\"journal\":{\"name\":\"Computational Intelligence\",\"volume\":\"40 3\",\"pages\":\"\"},\"PeriodicalIF\":1.8000,\"publicationDate\":\"2024-06-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computational Intelligence\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1111/coin.12645\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computational Intelligence","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/coin.12645","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

摘要

点击率（CTR）预测是推荐系统中的一项关键挑战。现有的模型容易受到噪声和冗余特征的干扰，无法充分捕捉稀疏特征数据中隐含的高阶特征交互。此外，传统的双塔模型忽视了层级特征交互的重要性。为了解决这些局限性，本文介绍了用于 CTR 预测的新型模型--门增强多空间交互神经网络（GMINN）。GMINN 采用双塔结构，在双塔深度神经网络的每一层之后都引入了一个多空间交互层。该层将特征分配到多个子空间，并利用矩阵乘法在双塔之间建立层级交互。同时，还提出了一种场感知门机制，以从原始特征中提取关键的潜在信息。在公开数据集 Criteo 和 Avazu 上进行的实验验证证明了所提出的 GMINN 模型的优越性。与基线模型的对比分析表明，GMINN 的 AUC 大幅提高了 4.09%，Logloss 最大降低了 7.21%。此外，消融实验进一步验证了 GMINN 的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

GMINN: Gate-enhanced multi-space interaction neural networks for click-through rate prediction

Click-through rate (CTR) prediction is a pivotal challenge in recommendation systems. Existing models are prone to disturbances from noise and redundant features, hindering their ability to fully capture implicit and higher-order feature interactions present in sparse feature data. Moreover, conventional dual-tower models overlook the significance of layer-level feature interactions. To address these limitations, this article introduces Gate-enhanced Multi-space Interactive Neural Networks (GMINN), a novel model for CTR prediction. GMINN adopts a dual-tower architecture in which a multi-space interaction layer is introduced after each layer in the dual-tower deep neural network. This layer allocates features into multiple subspaces and employs matrix multiplication to establish layer-level interactions between the dual towers. Simultaneously, a field-aware gate mechanism is proposed to extract crucial latent information from the original features. Experimental validation on publicly available datasets, Criteo and Avazu, demonstrates the superiority of the proposed GMINN model. Comparative analyses against baseline models reveal that GMINN substantially improves up to 4.09% in AUC and a maximum reduction of 7.21% in Logloss. Additionally, ablation experiments provide further validation of the effectiveness of GMINN.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Computational Intelligence 工程技术-计算机：人工智能

CiteScore

6.90

自引率

3.60%

发文量

审稿时长

>12 weeks

期刊介绍： This leading international journal promotes and stimulates research in the field of artificial intelligence (AI). Covering a wide range of issues - from the tools and languages of AI to its philosophical implications - Computational Intelligence provides a vigorous forum for the publication of both experimental and theoretical research, as well as surveys and impact studies. The journal is designed to meet the needs of a wide range of AI workers in academic and industrial research.