{"title":"基于多模态学习的Shopee价格匹配保证算法","authors":"Yaxuan Fang, Junhan Wang, Lei Jia, Fung Wai Kin","doi":"10.1109/CSAIEE54046.2021.9543217","DOIUrl":null,"url":null,"abstract":"Shopee has been a popular online shopping website in the Southeast Asia. Customers appreciate its easy, secure, and fast online shopping experience tailored to their region. At the same time, it allows customers to choose the one with the lower price of the same product. It relies on the product matching, that is the same product with the same description image must be removed. The base technology to achieve this function is multimodal learning, in which we focus on the images and text. In our article, we proposed a new multimodal learning model mainly based on transformer and BERT. For image matching, we use NFNet, Swin_Transformer and Efficientnet to get image embeddings. For text matching, we use Distil-Bert, Albert, Multilingual Bert and TF-IDF to get text embeddings. After we get the embedding vector, we choose KNN to classify. We use cosine and distance to measure the similarity of the different models. It is worth mentioning that the loss function is Arcface, not the traditional Softmax, which improve the difficulty of training to ensure the final performance in the test periods. In addition, 7 models vote for the final results ensuring the effect of prediction. To avoid the bad matching result, we add some postprocessing process.","PeriodicalId":376014,"journal":{"name":"2021 IEEE International Conference on Computer Science, Artificial Intelligence and Electronic Engineering (CSAIEE)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Shopee Price Match Guarantee Algorithm based on multimodal learning\",\"authors\":\"Yaxuan Fang, Junhan Wang, Lei Jia, Fung Wai Kin\",\"doi\":\"10.1109/CSAIEE54046.2021.9543217\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Shopee has been a popular online shopping website in the Southeast Asia. Customers appreciate its easy, secure, and fast online shopping experience tailored to their region. At the same time, it allows customers to choose the one with the lower price of the same product. It relies on the product matching, that is the same product with the same description image must be removed. The base technology to achieve this function is multimodal learning, in which we focus on the images and text. In our article, we proposed a new multimodal learning model mainly based on transformer and BERT. For image matching, we use NFNet, Swin_Transformer and Efficientnet to get image embeddings. For text matching, we use Distil-Bert, Albert, Multilingual Bert and TF-IDF to get text embeddings. After we get the embedding vector, we choose KNN to classify. We use cosine and distance to measure the similarity of the different models. It is worth mentioning that the loss function is Arcface, not the traditional Softmax, which improve the difficulty of training to ensure the final performance in the test periods. In addition, 7 models vote for the final results ensuring the effect of prediction. To avoid the bad matching result, we add some postprocessing process.\",\"PeriodicalId\":376014,\"journal\":{\"name\":\"2021 IEEE International Conference on Computer Science, Artificial Intelligence and Electronic Engineering (CSAIEE)\",\"volume\":\"26 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-08-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE International Conference on Computer Science, Artificial Intelligence and Electronic Engineering (CSAIEE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CSAIEE54046.2021.9543217\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE International Conference on Computer Science, Artificial Intelligence and Electronic Engineering (CSAIEE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CSAIEE54046.2021.9543217","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Shopee Price Match Guarantee Algorithm based on multimodal learning
Shopee has been a popular online shopping website in the Southeast Asia. Customers appreciate its easy, secure, and fast online shopping experience tailored to their region. At the same time, it allows customers to choose the one with the lower price of the same product. It relies on the product matching, that is the same product with the same description image must be removed. The base technology to achieve this function is multimodal learning, in which we focus on the images and text. In our article, we proposed a new multimodal learning model mainly based on transformer and BERT. For image matching, we use NFNet, Swin_Transformer and Efficientnet to get image embeddings. For text matching, we use Distil-Bert, Albert, Multilingual Bert and TF-IDF to get text embeddings. After we get the embedding vector, we choose KNN to classify. We use cosine and distance to measure the similarity of the different models. It is worth mentioning that the loss function is Arcface, not the traditional Softmax, which improve the difficulty of training to ensure the final performance in the test periods. In addition, 7 models vote for the final results ensuring the effect of prediction. To avoid the bad matching result, we add some postprocessing process.