Design of large-scale Content-based recommender system using hadoop MapReduce framework

2015 Eighth International Conference on Contemporary Computing (IC3) Pub Date : 2015-08-20 DOI:10.1109/IC3.2015.7346697

S. Saravanan

{"title":"Design of large-scale Content-based recommender system using hadoop MapReduce framework","authors":"S. Saravanan","doi":"10.1109/IC3.2015.7346697","DOIUrl":null,"url":null,"abstract":"Nowadays, providing relevant product recommendations to customers plays an important role in retaining customers and improving their shopping experience. Recommender systems can be applied to industries such as an e-commerce, music, online radio, television, hospitality, finance and many more. It is proved over the years that a simple algorithm with a lot of data can always provide better results than a complex algorithm with an inadequate amount of data. To provide better product recommendations, retail businesses have to analyze huge amount of data. As the recommendation system has to analyze huge amount of data to provide better recommendations, it is considered as a data intensive application. Hadoop distributed cluster platform is developed by Apache Software Foundation to address the issues which are involved in designing data intensive applications. In this paper, the improved MapReduce based data preprocessing and Content based recommendation algorithms are proposed and implemented using hadoop framework. Also, graphical user interfaces are developed to interact with the recommender system. Experimental results on Amazon product co-purchasing network metadata show that Hadoop distributed cluster environment is an efficient and scalable platform for implementing large scale recommender system.","PeriodicalId":217950,"journal":{"name":"2015 Eighth International Conference on Contemporary Computing (IC3)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"15","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 Eighth International Conference on Contemporary Computing (IC3)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IC3.2015.7346697","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 15

Abstract

Nowadays, providing relevant product recommendations to customers plays an important role in retaining customers and improving their shopping experience. Recommender systems can be applied to industries such as an e-commerce, music, online radio, television, hospitality, finance and many more. It is proved over the years that a simple algorithm with a lot of data can always provide better results than a complex algorithm with an inadequate amount of data. To provide better product recommendations, retail businesses have to analyze huge amount of data. As the recommendation system has to analyze huge amount of data to provide better recommendations, it is considered as a data intensive application. Hadoop distributed cluster platform is developed by Apache Software Foundation to address the issues which are involved in designing data intensive applications. In this paper, the improved MapReduce based data preprocessing and Content based recommendation algorithms are proposed and implemented using hadoop framework. Also, graphical user interfaces are developed to interact with the recommender system. Experimental results on Amazon product co-purchasing network metadata show that Hadoop distributed cluster environment is an efficient and scalable platform for implementing large scale recommender system.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于hadoop MapReduce框架的大规模内容推荐系统的设计

如今，为顾客提供相关的产品推荐对于留住顾客和改善顾客的购物体验起着重要的作用。推荐系统可以应用于电子商务、音乐、在线广播、电视、酒店、金融等行业。多年来的事实证明，数据量大的简单算法总是比数据量不足的复杂算法提供更好的结果。为了提供更好的产品推荐，零售企业必须分析大量的数据。由于推荐系统需要分析大量的数据来提供更好的推荐，因此被认为是一个数据密集型应用。Hadoop分布式集群平台是由Apache软件基金会开发的，用于解决设计数据密集型应用程序所涉及的问题。本文提出了改进的基于MapReduce的数据预处理算法和基于内容的推荐算法，并在hadoop框架下实现。此外，还开发了图形用户界面与推荐系统进行交互。在亚马逊产品共购网络元数据上的实验结果表明，Hadoop分布式集群环境是实现大规模推荐系统的高效、可扩展的平台。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2015 Eighth International Conference on Contemporary Computing (IC3)

自引率

0.00%

发文量