{"title":"GMINN:用于点击率预测的门增强多空间交互神经网络","authors":"Xingyu Feng, Xuekang Yang, Boyun Zhou","doi":"10.1111/coin.12645","DOIUrl":null,"url":null,"abstract":"<p>Click-through rate (CTR) prediction is a pivotal challenge in recommendation systems. Existing models are prone to disturbances from noise and redundant features, hindering their ability to fully capture implicit and higher-order feature interactions present in sparse feature data. Moreover, conventional dual-tower models overlook the significance of layer-level feature interactions. To address these limitations, this article introduces <b>G</b>ate-enhanced <b>M</b>ulti-space <b>I</b>nteractive <b>N</b>eural <b>N</b>etworks (GMINN), a novel model for CTR prediction. GMINN adopts a dual-tower architecture in which a multi-space interaction layer is introduced after each layer in the dual-tower deep neural network. This layer allocates features into multiple subspaces and employs matrix multiplication to establish layer-level interactions between the dual towers. Simultaneously, a field-aware gate mechanism is proposed to extract crucial latent information from the original features. Experimental validation on publicly available datasets, Criteo and Avazu, demonstrates the superiority of the proposed GMINN model. Comparative analyses against baseline models reveal that GMINN substantially improves up to 4.09% in AUC and a maximum reduction of 7.21% in Logloss. Additionally, ablation experiments provide further validation of the effectiveness of GMINN.</p>","PeriodicalId":55228,"journal":{"name":"Computational Intelligence","volume":"40 3","pages":""},"PeriodicalIF":1.8000,"publicationDate":"2024-06-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"GMINN: Gate-enhanced multi-space interaction neural networks for click-through rate prediction\",\"authors\":\"Xingyu Feng, Xuekang Yang, Boyun Zhou\",\"doi\":\"10.1111/coin.12645\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Click-through rate (CTR) prediction is a pivotal challenge in recommendation systems. Existing models are prone to disturbances from noise and redundant features, hindering their ability to fully capture implicit and higher-order feature interactions present in sparse feature data. Moreover, conventional dual-tower models overlook the significance of layer-level feature interactions. To address these limitations, this article introduces <b>G</b>ate-enhanced <b>M</b>ulti-space <b>I</b>nteractive <b>N</b>eural <b>N</b>etworks (GMINN), a novel model for CTR prediction. GMINN adopts a dual-tower architecture in which a multi-space interaction layer is introduced after each layer in the dual-tower deep neural network. This layer allocates features into multiple subspaces and employs matrix multiplication to establish layer-level interactions between the dual towers. Simultaneously, a field-aware gate mechanism is proposed to extract crucial latent information from the original features. Experimental validation on publicly available datasets, Criteo and Avazu, demonstrates the superiority of the proposed GMINN model. Comparative analyses against baseline models reveal that GMINN substantially improves up to 4.09% in AUC and a maximum reduction of 7.21% in Logloss. Additionally, ablation experiments provide further validation of the effectiveness of GMINN.</p>\",\"PeriodicalId\":55228,\"journal\":{\"name\":\"Computational Intelligence\",\"volume\":\"40 3\",\"pages\":\"\"},\"PeriodicalIF\":1.8000,\"publicationDate\":\"2024-06-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computational Intelligence\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1111/coin.12645\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computational Intelligence","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/coin.12645","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
GMINN: Gate-enhanced multi-space interaction neural networks for click-through rate prediction
Click-through rate (CTR) prediction is a pivotal challenge in recommendation systems. Existing models are prone to disturbances from noise and redundant features, hindering their ability to fully capture implicit and higher-order feature interactions present in sparse feature data. Moreover, conventional dual-tower models overlook the significance of layer-level feature interactions. To address these limitations, this article introduces Gate-enhanced Multi-space Interactive Neural Networks (GMINN), a novel model for CTR prediction. GMINN adopts a dual-tower architecture in which a multi-space interaction layer is introduced after each layer in the dual-tower deep neural network. This layer allocates features into multiple subspaces and employs matrix multiplication to establish layer-level interactions between the dual towers. Simultaneously, a field-aware gate mechanism is proposed to extract crucial latent information from the original features. Experimental validation on publicly available datasets, Criteo and Avazu, demonstrates the superiority of the proposed GMINN model. Comparative analyses against baseline models reveal that GMINN substantially improves up to 4.09% in AUC and a maximum reduction of 7.21% in Logloss. Additionally, ablation experiments provide further validation of the effectiveness of GMINN.
期刊介绍:
This leading international journal promotes and stimulates research in the field of artificial intelligence (AI). Covering a wide range of issues - from the tools and languages of AI to its philosophical implications - Computational Intelligence provides a vigorous forum for the publication of both experimental and theoretical research, as well as surveys and impact studies. The journal is designed to meet the needs of a wide range of AI workers in academic and industrial research.