Bo Mei, Xiaolu Cheng, Xiaoshuang Xing, Bowu Zhang, Wei Cheng
{"title":"Personal Information Prediction Based on Movie Rating Data","authors":"Bo Mei, Xiaolu Cheng, Xiaoshuang Xing, Bowu Zhang, Wei Cheng","doi":"10.1109/IIKI.2016.84","DOIUrl":null,"url":null,"abstract":"Movies are a major form of entertainment in the US. There are a dozens of websites focusing on movie information. On most of the websites, ratings and reviews from the users play an important role. When a user gives a movie a certain score, the user not only reflects his taste toward that movie but also potentially exposes his personal information. In this paper, we investigated several movie genres. In each genre, movies were classified into different clusters by using expectationmaximization (EM) algorithm. The classification criteria were built upon audience movie rating scores and existing user information. As a result, a new or anonymous users personal information could be predicted when he rated movies on movie-related websites. Moreover, newly released movies could be easily classified into corresponding clusters to assistant user information discovery. The revealed personal information was very useful and could be utilized in different ways such as increasing the accuracy for delivering user-related ads.","PeriodicalId":371106,"journal":{"name":"2016 International Conference on Identification, Information and Knowledge in the Internet of Things (IIKI)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 International Conference on Identification, Information and Knowledge in the Internet of Things (IIKI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IIKI.2016.84","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Movies are a major form of entertainment in the US. There are a dozens of websites focusing on movie information. On most of the websites, ratings and reviews from the users play an important role. When a user gives a movie a certain score, the user not only reflects his taste toward that movie but also potentially exposes his personal information. In this paper, we investigated several movie genres. In each genre, movies were classified into different clusters by using expectationmaximization (EM) algorithm. The classification criteria were built upon audience movie rating scores and existing user information. As a result, a new or anonymous users personal information could be predicted when he rated movies on movie-related websites. Moreover, newly released movies could be easily classified into corresponding clusters to assistant user information discovery. The revealed personal information was very useful and could be utilized in different ways such as increasing the accuracy for delivering user-related ads.