PRODUCT CLUSTERING ANALYSIS ON THE MARKETPLACE USING K-MEANS APPROACH (CASE STUDY: SHOPEE)

Maria Arista Ulfa, S. Sulistyo, M. Hidayat
{"title":"PRODUCT CLUSTERING ANALYSIS ON THE MARKETPLACE USING K-MEANS APPROACH (CASE STUDY: SHOPEE)","authors":"Maria Arista Ulfa, S. Sulistyo, M. Hidayat","doi":"10.22146/ajse.v5i2.69217","DOIUrl":null,"url":null,"abstract":"The business world has experienced a paradigm shift towards a more modern concept. Many business processes are carried out through the internet or commonly known as e-commerce, by utilizing a platform known as Marketplace. One of the marketplaces that are quite well-known and in great demand in Indonesia is Shopee. The high online shopping activity in the current marketplace indirectly encourages business actors to understand the online market. However, one of the obstacles that are quite often faced by sellers, especially new sellers who are starting to enter the digital realm, is the emergence of confusion in the selection of products to be sold due to a lack of information regarding the demand for what products are in demand in the market.The process of searching for information related to the demand for products of interest is carried out through clustering analysis to find out the groups of products that are of interest to those that are less attractive to the public. The data used is product data from 6 categories in the Shopee market which was taken using web scraping techniques. The clustering processes used the K-means approach by determining the number of K and the optimal center point through the calculation of Sum Square Error (SSE) by looking at the elbow graph. The final results show the optimal number of K clusters that are different in each category, namely in category women’s clothing, men’s clothing, and electronics are at K=4 then for products in the category of Muslim fashion, care & beauty and household appliances are at K=3. Based on the validation results using the Davies Bouldin Index, values were obtained in6 categories, namely 0.391, 0.438, 0.414, 0.357, 0.387, and 0.377, which means that the cluster structure and the level of information formed in each category using the K-Means method is quite good.","PeriodicalId":280593,"journal":{"name":"ASEAN Journal of Systems Engineering","volume":"41 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ASEAN Journal of Systems Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.22146/ajse.v5i2.69217","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

The business world has experienced a paradigm shift towards a more modern concept. Many business processes are carried out through the internet or commonly known as e-commerce, by utilizing a platform known as Marketplace. One of the marketplaces that are quite well-known and in great demand in Indonesia is Shopee. The high online shopping activity in the current marketplace indirectly encourages business actors to understand the online market. However, one of the obstacles that are quite often faced by sellers, especially new sellers who are starting to enter the digital realm, is the emergence of confusion in the selection of products to be sold due to a lack of information regarding the demand for what products are in demand in the market.The process of searching for information related to the demand for products of interest is carried out through clustering analysis to find out the groups of products that are of interest to those that are less attractive to the public. The data used is product data from 6 categories in the Shopee market which was taken using web scraping techniques. The clustering processes used the K-means approach by determining the number of K and the optimal center point through the calculation of Sum Square Error (SSE) by looking at the elbow graph. The final results show the optimal number of K clusters that are different in each category, namely in category women’s clothing, men’s clothing, and electronics are at K=4 then for products in the category of Muslim fashion, care & beauty and household appliances are at K=3. Based on the validation results using the Davies Bouldin Index, values were obtained in6 categories, namely 0.391, 0.438, 0.414, 0.357, 0.387, and 0.377, which means that the cluster structure and the level of information formed in each category using the K-Means method is quite good.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于k -均值方法的市场产品聚类分析(以shopee为例)
商业世界已经经历了向更现代概念的范式转变。许多业务流程都是通过互联网或通常称为电子商务的平台,利用称为Marketplace的平台来执行的。Shopee是印尼一个非常有名且需求量很大的市场。当前市场上的高在线购物活动间接地鼓励了商业参与者了解在线市场。然而,卖家经常面临的障碍之一,尤其是那些开始进入数字领域的新卖家,是由于缺乏市场对什么产品需求的信息,在选择要销售的产品时出现混乱。通过聚类分析对感兴趣的产品需求相关信息进行搜索的过程,找出对公众吸引力较低的产品感兴趣的产品组。使用的数据是Shopee市场上6个类别的产品数据,这些数据是通过网络抓取技术获取的。聚类过程采用K-means方法,通过观察肘形图计算和方误差(Sum Square Error, SSE)来确定K的个数和最优中心点。最终结果表明,每个品类中K类的最优数量是不同的,即女装、男装、电子类的K=4,而穆斯林时尚、护理美容、家电类的K=3。根据Davies Bouldin Index的验证结果,得到了0.391、0.438、0.414、0.357、0.387、0.377 6个类别的值,说明使用K-Means方法在每个类别中形成的聚类结构和信息水平都比较好。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
FACTOR ANALYSIS OF HEALTHY FOOD PHOTOGRAPH BIOREMEDIATION OF TOFU INDUSTRY LIQUID WASTE USING EFFECTIVE MICROORGANISM-4 (EM4) SOLUTION (CASE STUDY OF TOFU SENTOSA INDUSTRY, YOGYAKARTA) FORECASTING ANALYSIS ON ELECTRICITY DEMAND IN THE SPECIAL REGION OF YOGYAKARTA UNDER THE IMPACT OF THE COVID-19 PANDEMIC POROUS CARBON FROM PINEAPPLE PEEL AS ELECTRODE MATERIAL OF SUPERCAPACITOR IMPLEMENTATION OF IMAGE CLASSIFICATION USING CONVOLUTIONAL NEURAL NETWORK (CNN) ALGORITHM ON VEHICLES IMAGES
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1