使用K Means和R Shiny对灾害准备进行可视化和聚类:印度尼西亚省级灾害、医务人员和卫生设施数据的案例研究

R. Kusumawardani, I. Hafidz, Septa Firmansyah Putra
{"title":"使用K Means和R Shiny对灾害准备进行可视化和聚类:印度尼西亚省级灾害、医务人员和卫生设施数据的案例研究","authors":"R. Kusumawardani, I. Hafidz, Septa Firmansyah Putra","doi":"10.1109/ICTS.2016.7910295","DOIUrl":null,"url":null,"abstract":"The open data movement has led us into immensely useful applications and innovations for decision making, both for individual citizen as well as government. This study aims to create a web application called BencanaVis which provide innovative visualization of disaster government open data using Shiny, a web framework from R programming language. The datasets being used are available from Indonesian National Disaster Management Authority agency (or BNPB), the official Indonesian Open Data government portal and the Indonesian National Statistical Bureau (or BPS) website. We create three types of scenarios or experiments for the dataset. After that, we normalize the data using min-max use normalization. Then, we employ PCA (principal component analysis) to reduce feature dimensionality. Furthermore, we apply K-Means clustering techniques and calculate the cluster validity using Sum of Square Error (SSE), Davis-Bouldin Index (DBI), Dunn Index, Connectivity Index and Silhouettes Index. The cluster member from optimal number of k are then being analyzed to create a score for disaster readiness. We shall analyze this disaster readiness using the scoring produced by weighting the attributes values with weights from the AHP methods. Furthermore, we provide two visualizations; they are 3D scatter plot and cluster distribution using leaflet library from R. There are two other visualizations provided in the web application use heatmap and streamgraph library. The heatmap visualization shows the pattern distribution of all attributes and streamgraph visualization which refers to stacked area chart shows the number of 21 types disaster which recorded from BNPB data in 16 years during the year 2000 – 2016.","PeriodicalId":177275,"journal":{"name":"2016 International Conference on Information & Communication Technology and Systems (ICTS)","volume":"179 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"BencanaVis visualization and clustering of disaster readiness using K Means with R Shiny A case study for Disaster, Medical Personnel and Health Facilities data at Province level in Indonesia\",\"authors\":\"R. Kusumawardani, I. Hafidz, Septa Firmansyah Putra\",\"doi\":\"10.1109/ICTS.2016.7910295\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The open data movement has led us into immensely useful applications and innovations for decision making, both for individual citizen as well as government. This study aims to create a web application called BencanaVis which provide innovative visualization of disaster government open data using Shiny, a web framework from R programming language. The datasets being used are available from Indonesian National Disaster Management Authority agency (or BNPB), the official Indonesian Open Data government portal and the Indonesian National Statistical Bureau (or BPS) website. We create three types of scenarios or experiments for the dataset. After that, we normalize the data using min-max use normalization. Then, we employ PCA (principal component analysis) to reduce feature dimensionality. Furthermore, we apply K-Means clustering techniques and calculate the cluster validity using Sum of Square Error (SSE), Davis-Bouldin Index (DBI), Dunn Index, Connectivity Index and Silhouettes Index. The cluster member from optimal number of k are then being analyzed to create a score for disaster readiness. We shall analyze this disaster readiness using the scoring produced by weighting the attributes values with weights from the AHP methods. Furthermore, we provide two visualizations; they are 3D scatter plot and cluster distribution using leaflet library from R. There are two other visualizations provided in the web application use heatmap and streamgraph library. The heatmap visualization shows the pattern distribution of all attributes and streamgraph visualization which refers to stacked area chart shows the number of 21 types disaster which recorded from BNPB data in 16 years during the year 2000 – 2016.\",\"PeriodicalId\":177275,\"journal\":{\"name\":\"2016 International Conference on Information & Communication Technology and Systems (ICTS)\",\"volume\":\"179 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1900-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 International Conference on Information & Communication Technology and Systems (ICTS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICTS.2016.7910295\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 International Conference on Information & Communication Technology and Systems (ICTS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICTS.2016.7910295","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

开放数据运动已经引导我们进入了非常有用的应用程序和决策创新,无论是对个人公民还是政府。本研究旨在创建一个名为BencanaVis的web应用程序,该应用程序使用Shiny(来自R编程语言的web框架)提供灾难政府开放数据的创新可视化。所使用的数据集可从印度尼西亚国家灾害管理局(或BNPB)、印度尼西亚官方开放数据政府门户网站和印度尼西亚国家统计局(或BPS)网站获得。我们为数据集创建了三种类型的场景或实验。之后,我们使用最小最大使用归一化对数据进行归一化。然后,我们使用主成分分析(PCA)来降维特征。此外,我们采用K-Means聚类技术,并使用平方误差和(SSE)、Davis-Bouldin指数(DBI)、Dunn指数、连通性指数和轮廓指数计算聚类有效性。然后分析k个最优数量的集群成员,以创建灾难准备得分。我们将使用由AHP方法中的权重对属性值进行加权所产生的评分来分析这种灾难准备情况。此外,我们提供了两种可视化;它们是3D散点图和聚类分布,使用r的传单库。web应用程序中还提供了另外两个可视化工具,使用热图和流图库。热图可视化显示了所有属性的模式分布,流图可视化是指堆叠面积图,显示了2000 - 2016年16年间BNPB数据记录的21种灾害的数量。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
BencanaVis visualization and clustering of disaster readiness using K Means with R Shiny A case study for Disaster, Medical Personnel and Health Facilities data at Province level in Indonesia
The open data movement has led us into immensely useful applications and innovations for decision making, both for individual citizen as well as government. This study aims to create a web application called BencanaVis which provide innovative visualization of disaster government open data using Shiny, a web framework from R programming language. The datasets being used are available from Indonesian National Disaster Management Authority agency (or BNPB), the official Indonesian Open Data government portal and the Indonesian National Statistical Bureau (or BPS) website. We create three types of scenarios or experiments for the dataset. After that, we normalize the data using min-max use normalization. Then, we employ PCA (principal component analysis) to reduce feature dimensionality. Furthermore, we apply K-Means clustering techniques and calculate the cluster validity using Sum of Square Error (SSE), Davis-Bouldin Index (DBI), Dunn Index, Connectivity Index and Silhouettes Index. The cluster member from optimal number of k are then being analyzed to create a score for disaster readiness. We shall analyze this disaster readiness using the scoring produced by weighting the attributes values with weights from the AHP methods. Furthermore, we provide two visualizations; they are 3D scatter plot and cluster distribution using leaflet library from R. There are two other visualizations provided in the web application use heatmap and streamgraph library. The heatmap visualization shows the pattern distribution of all attributes and streamgraph visualization which refers to stacked area chart shows the number of 21 types disaster which recorded from BNPB data in 16 years during the year 2000 – 2016.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Detecting source code plagiarism on introductory programming course assignments using a bytecode approach Indonesia scholarship selection framework using fuzzy inferences system approach: Case study: “Bidik Misi” scholarship selection Face recognition based on Extended Symmetric Local Graph Structure Early detection study of Kidney Organ Complication caused by Diabetes Mellitus using iris image color constancy Heart murmurs extraction using the complete Ensemble Empirical Mode Decomposition and the Pearson distance metric
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1