A Fast Clustering Algorithm for Analyzing Big Data Generated in Ubiquitous Sensor Networks

O. Zahwe, O. Majed, Hassan Harb, Mohamad Hamze, A. Nasser
{"title":"A Fast Clustering Algorithm for Analyzing Big Data Generated in Ubiquitous Sensor Networks","authors":"O. Zahwe, O. Majed, Hassan Harb, Mohamad Hamze, A. Nasser","doi":"10.1109/ACIT.2018.8672680","DOIUrl":null,"url":null,"abstract":"Recently, The emergence of wireless sensor networks (WSNs) plays a major role in the rise of big data as thousands of their applications collect huge amounts of data that require processing. Consequently, WSN faces two major challenges. First, it handles the big data collection, and second, the energy of sensors will be depleted quickly due to the huge volume of data collection and transmission. Hence, current research has been focused on data classification as an efficient technique to reduce big data collection in WSNs thus enhancing their lifetime. This paper proposes a fast data clustering technique called FKmeans, i.e. Fast Kmeans, dedicated to periodic applications in WSNs. FKmeans consists of two stage algorithm and aims to enhance the time cost of distance calculation of traditional Kmeans algorithm thus, ensure fast data delivery to the sink node. The first stage, i.e. center selection stage, selects a small portion of datasets in order to find the best possible location of the centers. The second stage, i.e. cluster formation stage, uses the traditional Kmeans algorithm adopted to the Euclidean distance where the initial centers used are taken from the first stage. Our proposed technique is validated via simulations on real sensor data and comparison with the traditional Kmeans algorithm. The obtained results show the effectiveness of our technique in terms of improving the energy consumption and data delivery delay, without loss in data fidelity.","PeriodicalId":443170,"journal":{"name":"2018 International Arab Conference on Information Technology (ACIT)","volume":"65 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 International Arab Conference on Information Technology (ACIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ACIT.2018.8672680","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

Recently, The emergence of wireless sensor networks (WSNs) plays a major role in the rise of big data as thousands of their applications collect huge amounts of data that require processing. Consequently, WSN faces two major challenges. First, it handles the big data collection, and second, the energy of sensors will be depleted quickly due to the huge volume of data collection and transmission. Hence, current research has been focused on data classification as an efficient technique to reduce big data collection in WSNs thus enhancing their lifetime. This paper proposes a fast data clustering technique called FKmeans, i.e. Fast Kmeans, dedicated to periodic applications in WSNs. FKmeans consists of two stage algorithm and aims to enhance the time cost of distance calculation of traditional Kmeans algorithm thus, ensure fast data delivery to the sink node. The first stage, i.e. center selection stage, selects a small portion of datasets in order to find the best possible location of the centers. The second stage, i.e. cluster formation stage, uses the traditional Kmeans algorithm adopted to the Euclidean distance where the initial centers used are taken from the first stage. Our proposed technique is validated via simulations on real sensor data and comparison with the traditional Kmeans algorithm. The obtained results show the effectiveness of our technique in terms of improving the energy consumption and data delivery delay, without loss in data fidelity.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
泛在传感器网络大数据分析的快速聚类算法
最近,无线传感器网络(wsn)的出现在大数据的兴起中发挥了重要作用,因为成千上万的应用程序收集了大量需要处理的数据。因此,无线传感器网络面临两大挑战。首先,它处理大数据收集,其次,由于数据的大量收集和传输,传感器的能量会很快耗尽。因此,目前的研究重点是将数据分类作为一种有效的技术来减少无线传感器网络的大数据收集,从而提高其使用寿命。本文提出了一种快速数据聚类技术,称为FKmeans,即fast Kmeans,用于无线传感器网络的周期性应用。FKmeans由两阶段算法组成,旨在提高传统Kmeans算法计算距离的时间成本,从而保证数据快速传递到汇聚节点。第一阶段,即中心选择阶段,选择一小部分数据集,以找到中心的最佳可能位置。第二阶段,即聚类形成阶段,使用传统的Kmeans算法对欧几里得距离进行计算,其中使用的初始中心取自第一阶段。通过对真实传感器数据的仿真和与传统Kmeans算法的比较,验证了我们提出的技术。得到的结果表明,我们的技术在提高能耗和数据传输延迟方面是有效的,而数据保真度没有损失。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Feature Selection for Android Keystroke Dynamics Proposed Method for Automatic Segmentation of Medical Images Feature-Based Opinion Summarization for Arabic Reviews Arabic Semantic Similarity Approaches - Review IoT: Architecture, Challenges, and Solutions Using Fog Network and Application Classification
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1