Implementation Of ETL E-Commerce For Customer Clustering Using RFM And K-Means Clustering

F. Alzami, Fikri Diva Sambasri, Rifqi Mulya Kiswanto, Rama Aria Megantara, Ahmad Akrom, R. A. Pramunendar, D. P. Prabowo, Puri Sulistiyawati
{"title":"Implementation Of ETL E-Commerce For Customer Clustering Using RFM And K-Means Clustering","authors":"F. Alzami, Fikri Diva Sambasri, Rifqi Mulya Kiswanto, Rama Aria Megantara, Ahmad Akrom, R. A. Pramunendar, D. P. Prabowo, Puri Sulistiyawati","doi":"10.24843/jim.2022.v10.i03.p05","DOIUrl":null,"url":null,"abstract":"E-commerce is the activity of selling and buying goods through an online system or online. One of the business models in which consumers sell products to other consumers is the Customer to Customer (C2C) business model. One of the things that need to be considered in this business model is knowing the level of customer loyalty. By knowing the level of customer loyalty, the company can provide several different treatments to its customers so that they can maintain good relations with customers and can increase product purchase revenue. In this study, the author wants to segment customers on data in E-commerce companies in Brazil using the K-Means clustering algorithm using the RFM (Recency, Frequency, Monetary) feature. There are also several ETL stages of research that must be carried out, namely taking data from the open public data site (Kaggle), which consist of more than 9 tables (extract), then merging the data to select some data that needs to be used (transform and load), understanding data by displaying it in graphic form, conducting data selection to select features / attributes. which is in accordance with the proposed method, performs data preprocessing, and creates a model to get the cluster. Based on the results of the research that has been done, the number of clusters is 4 clusters with the evaluation value of the model using the silhouette score is 0.470.","PeriodicalId":32334,"journal":{"name":"Jurnal Ilmiah Merpati Menara Penelitian Akademika Teknologi Informasi","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2022-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Jurnal Ilmiah Merpati Menara Penelitian Akademika Teknologi Informasi","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.24843/jim.2022.v10.i03.p05","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

E-commerce is the activity of selling and buying goods through an online system or online. One of the business models in which consumers sell products to other consumers is the Customer to Customer (C2C) business model. One of the things that need to be considered in this business model is knowing the level of customer loyalty. By knowing the level of customer loyalty, the company can provide several different treatments to its customers so that they can maintain good relations with customers and can increase product purchase revenue. In this study, the author wants to segment customers on data in E-commerce companies in Brazil using the K-Means clustering algorithm using the RFM (Recency, Frequency, Monetary) feature. There are also several ETL stages of research that must be carried out, namely taking data from the open public data site (Kaggle), which consist of more than 9 tables (extract), then merging the data to select some data that needs to be used (transform and load), understanding data by displaying it in graphic form, conducting data selection to select features / attributes. which is in accordance with the proposed method, performs data preprocessing, and creates a model to get the cluster. Based on the results of the research that has been done, the number of clusters is 4 clusters with the evaluation value of the model using the silhouette score is 0.470.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于RFM和k -均值聚类的ETL电子商务客户聚类实现
电子商务是通过在线系统或网上买卖商品的活动。消费者向其他消费者销售产品的商业模式之一是客户对客户(C2C)商业模式。在这种商业模式中需要考虑的一件事是了解客户忠诚度的水平。通过了解客户忠诚度的高低,公司可以为客户提供几种不同的待遇,这样他们就可以与客户保持良好的关系,增加产品购买收入。在这项研究中,作者希望利用RFM (recent, Frequency, Monetary)特征,使用K-Means聚类算法对巴西电子商务公司的数据进行客户细分。还有几个必须进行的ETL研究阶段,即从开放的公共数据站点(Kaggle)获取数据,该站点由9个以上的表组成(提取),然后合并数据以选择需要使用的数据(转换和加载),通过图形形式显示数据来理解数据,进行数据选择以选择特征/属性。根据所提出的方法,对数据进行预处理,并建立模型得到聚类。根据已有的研究结果,聚类数量为4个,模型的剪影评分评价值为0.470。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
审稿时长
24 weeks
期刊最新文献
Helpdesk Ticket Classification for Technician Assignment Routes Using BiLSTM Development of Service-Oriented Architecture-Based Microservices Management as a Data Integration Service (Case Study: Udayana University) Implementation of a Supply chain Management System Blockchain-Based in Red Onion Farming Data Visualization Of House Of Worship Distribution In The IKN Nusantara Region Using Python Implementation Enterprise Resource Planning Sales and Purchase of Goods Using WebERP Fushia Clothing Store Denpasar
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1