通过融合文本和视觉特征从社交媒体中挖掘用户兴趣

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) Pub Date : 2016-12-01 DOI:10.1109/APSIPA.2016.7820713

Fang-Yu Chao, Jia Xu, Chia-Wen Lin

{"title":"通过融合文本和视觉特征从社交媒体中挖掘用户兴趣","authors":"Fang-Yu Chao, Jia Xu, Chia-Wen Lin","doi":"10.1109/APSIPA.2016.7820713","DOIUrl":null,"url":null,"abstract":"In this paper, we propose a framework that fuses textual and visual features of user generated social media data to mine the distribution of user interests. The proposed framework consists of three steps: feature extraction, model training, and user interest mining. We choose boards from popular users on Pinterest to collect training and test data. For each pin we extract the term-document matrices as textual features, bag of visual words as low-level visual features, and attributes as mid-level visual features. Representative features are then selected for training topic models using discriminative latent Dirichlet allocation (DLDA). In performance evaluation, pins collected from popular users are used to evaluate the classification accuracy and pins collected from other common users are used to evaluate the recommendation performance. Our experimental results show the efficacy of the proposed method.","PeriodicalId":409448,"journal":{"name":"2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)","volume":"56 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Mining user interests from social media by fusing textual and visual features\",\"authors\":\"Fang-Yu Chao, Jia Xu, Chia-Wen Lin\",\"doi\":\"10.1109/APSIPA.2016.7820713\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we propose a framework that fuses textual and visual features of user generated social media data to mine the distribution of user interests. The proposed framework consists of three steps: feature extraction, model training, and user interest mining. We choose boards from popular users on Pinterest to collect training and test data. For each pin we extract the term-document matrices as textual features, bag of visual words as low-level visual features, and attributes as mid-level visual features. Representative features are then selected for training topic models using discriminative latent Dirichlet allocation (DLDA). In performance evaluation, pins collected from popular users are used to evaluate the classification accuracy and pins collected from other common users are used to evaluate the recommendation performance. Our experimental results show the efficacy of the proposed method.\",\"PeriodicalId\":409448,\"journal\":{\"name\":\"2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)\",\"volume\":\"56 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/APSIPA.2016.7820713\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/APSIPA.2016.7820713","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

在本文中，我们提出了一个融合用户生成的社交媒体数据的文本和视觉特征来挖掘用户兴趣分布的框架。该框架包括三个步骤:特征提取、模型训练和用户兴趣挖掘。我们从Pinterest上的热门用户中选择板来收集训练和测试数据。对于每个引脚，我们提取术语-文档矩阵作为文本特征，提取视觉词包作为低级视觉特征，提取属性作为中级视觉特征。然后使用判别潜狄利克雷分配(DLDA)为训练主题模型选择代表性特征。在性能评估中，从热门用户收集的pin用于评估分类准确性，从其他普通用户收集的pin用于评估推荐性能。实验结果表明了该方法的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Mining user interests from social media by fusing textual and visual features

In this paper, we propose a framework that fuses textual and visual features of user generated social media data to mine the distribution of user interests. The proposed framework consists of three steps: feature extraction, model training, and user interest mining. We choose boards from popular users on Pinterest to collect training and test data. For each pin we extract the term-document matrices as textual features, bag of visual words as low-level visual features, and attributes as mid-level visual features. Representative features are then selected for training topic models using discriminative latent Dirichlet allocation (DLDA). In performance evaluation, pins collected from popular users are used to evaluate the classification accuracy and pins collected from other common users are used to evaluate the recommendation performance. Our experimental results show the efficacy of the proposed method.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

自引率

0.00%

发文量