首页 > 最新文献

Media Statistika最新文献

英文 中文
PREDICTION OF FOREST FIRE USING NEURAL NETWORKS WITH BACKPROPAGATION LEARNING AND EXREME LEARNING MACHINE APPROACH USING METEOROLOGICAL AND WEATHER INDEX VARIABLES 基于神经网络和反向传播学习的森林火灾预测以及基于气象和天气指标变量的EXREME学习机方法
Pub Date : 2021-12-28 DOI: 10.14710/medstat.14.2.118-124
D. Rosadi, D. Arisanty, D. Agustina
Forest fire is one of important catastrophic events and have great impact on environment, infrastructure and human life. In this study, we discuss the method for prediction of the size of the forest fire using the hybrid approach between Fuzzy-C-Means clustering (FCM) and Neural Networks (NN) classification with backpropagation learning and extreme learning machine approach. For comparison purpose, we consider a similar hybrid approach, i.e., FCM with the classical Support Vector Machine (SVM) classification approach. In the empirical study, we apply the considered methods using several meteorological and Forest Weather Index (FWI) variables. We found that the best approach will be obtained using hybrid FCM-SVM for data training, where the best performance obtains for hybrid FCM-NN-backpropagation for data testing.
森林火灾是一种重要的灾害性事件,对环境、基础设施和人类生活都有重大影响。在本研究中,我们讨论了使用模糊C均值聚类(FCM)和神经网络(NN)分类的混合方法以及反向传播学习和极限学习机器方法预测森林火灾规模的方法。为了进行比较,我们考虑了一种类似的混合方法,即FCM和经典的支持向量机(SVM)分类方法。在实证研究中,我们使用了几个气象和森林天气指数(FWI)变量来应用所考虑的方法。我们发现,使用混合FCM-SVM进行数据训练将获得最佳方法,其中使用混合FCM-NN反向传播进行数据测试将获得最佳性能。
{"title":"PREDICTION OF FOREST FIRE USING NEURAL NETWORKS WITH BACKPROPAGATION LEARNING AND EXREME LEARNING MACHINE APPROACH USING METEOROLOGICAL AND WEATHER INDEX VARIABLES","authors":"D. Rosadi, D. Arisanty, D. Agustina","doi":"10.14710/medstat.14.2.118-124","DOIUrl":"https://doi.org/10.14710/medstat.14.2.118-124","url":null,"abstract":"Forest fire is one of important catastrophic events and have great impact on environment, infrastructure and human life. In this study, we discuss the method for prediction of the size of the forest fire using the hybrid approach between Fuzzy-C-Means clustering (FCM) and Neural Networks (NN) classification with backpropagation learning and extreme learning machine approach. For comparison purpose, we consider a similar hybrid approach, i.e., FCM with the classical Support Vector Machine (SVM) classification approach. In the empirical study, we apply the considered methods using several meteorological and Forest Weather Index (FWI) variables. We found that the best approach will be obtained using hybrid FCM-SVM for data training, where the best performance obtains for hybrid FCM-NN-backpropagation for data testing.","PeriodicalId":34146,"journal":{"name":"Media Statistika","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-12-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47208874","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MODELING OF SEA SURFACE TEMPERATURE BASED ON PARTIAL LEAST SQUARE - STRUCTURAL EQUATION 基于偏最小二乘结构方程的海面温度模拟
Pub Date : 2021-12-28 DOI: 10.14710/medstat.14.2.170-182
M. Miftahuddin, Retno Wahyuni Putri, I. Setiawan, R. S. Oktari
Variability of Sea Surface Temperature (SST) is one of the climatic features that influence global and regional climate dynamics. Missing data (gaps) in the SST dataset are worth investigating since they may statistically alter the value of the SST change. The partial least square-structural equation modeling (PLS-SEM) approach is used in this work to estimate the causality relationships between exogenous and endogenous latent variables. The findings of this study, which are significant indicators that have a loading factor value > 0.7 are as follows: i) sea surface temperature (oC) as a measure of the latent variable changes in SST, ii) wind speed (m/s) and relative humidity (%) as a measure of the latent variable of weather, and iii) air temperature (oC), long-wave solar radiation (w/m2) as a measure of climate latent variables. The size of the Rsquare value is influenced by the number of gaps. The results of the boostrapping show that the latent variables of weather and climate have a significant effect on changes in SST which are indicated by the value of tstatistics > ttabel. The structural model obtained Changes in SST (η) = -0.330 weather + 0.793 climate + ζ. The model shows that the weather has a negative coefficient, which means that the better the weather conditions, the lower the SST changes. Climate has a positive coefficient, which means that the better the climate, the SST changes will also increase. Rising sea surface temperatures caused by an increase in climate can lead to global warming, impacting El-Nino and La-Nina events.
海面温度的变化是影响全球和区域气候动力学的气候特征之一。SST数据集中的缺失数据(缺口)值得研究,因为它们可能会在统计上改变SST变化的值。本文采用偏最小二乘结构方程建模(PLS-SEM)方法来估计外生和内生潜在变量之间的因果关系。这项研究的结果是负荷因子值>0.7的重要指标,如下所示:i)海面温度(oC),作为SST潜在变量变化的衡量标准;ii)风速(m/s)和相对湿度(%),作为天气潜在变量的衡量标准,以及iii)气温(oC),长波太阳辐射(w/m2)作为气候潜在变量的一种度量。Rsquare值的大小受间隙数量的影响。增压结果表明,天气和气候的潜变量对SST的变化有显著影响,其值用tstatistics>ttabel表示。该结构模型得到SST的变化(η)=-0.330天气+0.793气候+ζ。该模型表明,天气具有负系数,这意味着天气条件越好,SST变化越低。气候具有正系数,这意味着气候越好,SST的变化也会增加。气候变化导致的海面温度上升可能导致全球变暖,影响厄尔尼诺和拉尼娜事件。
{"title":"MODELING OF SEA SURFACE TEMPERATURE BASED ON PARTIAL LEAST SQUARE - STRUCTURAL EQUATION","authors":"M. Miftahuddin, Retno Wahyuni Putri, I. Setiawan, R. S. Oktari","doi":"10.14710/medstat.14.2.170-182","DOIUrl":"https://doi.org/10.14710/medstat.14.2.170-182","url":null,"abstract":"Variability of Sea Surface Temperature (SST) is one of the climatic features that influence global and regional climate dynamics. Missing data (gaps) in the SST dataset are worth investigating since they may statistically alter the value of the SST change. The partial least square-structural equation modeling (PLS-SEM) approach is used in this work to estimate the causality relationships between exogenous and endogenous latent variables. The findings of this study, which are significant indicators that have a loading factor value > 0.7 are as follows: i) sea surface temperature (oC) as a measure of the latent variable changes in SST, ii) wind speed (m/s) and relative humidity (%) as a measure of the latent variable of weather, and iii) air temperature (oC), long-wave solar radiation (w/m2) as a measure of climate latent variables. The size of the Rsquare value is influenced by the number of gaps. The results of the boostrapping show that the latent variables of weather and climate have a significant effect on changes in SST which are indicated by the value of tstatistics > ttabel. The structural model obtained Changes in SST (η) = -0.330 weather + 0.793 climate + ζ. The model shows that the weather has a negative coefficient, which means that the better the weather conditions, the lower the SST changes. Climate has a positive coefficient, which means that the better the climate, the SST changes will also increase. Rising sea surface temperatures caused by an increase in climate can lead to global warming, impacting El-Nino and La-Nina events.","PeriodicalId":34146,"journal":{"name":"Media Statistika","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-12-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44406867","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
THE APPLICATION OF THE SEMIPARAMETRIC GSTAR MODEL IN DETERMINING GAMMA-RAY LOG DATA ON SOIL LAYERS 半参数GSTAR模型在确定土层GAMMA-RAY测井资料中的应用
Pub Date : 2021-12-28 DOI: 10.14710/medstat.14.2.108-117
Yundari Yundari, Shantika Martha
This research examines the semiparametric Generalized Space-Time Autoregressive (GSTAR) spacetime modeling and determines its spatial weight. In general, the spatial weights used are uniform, binary weights, and based on the distance, the result is a fixed weight. The GSTAR model is a stochastic model that takes into account its random variables. Thus, it is necessary to study the random spatial weights. This study introduced a new method to estimate the observed value of the GSTAR model semiparametric with a uniform kernel. The data involved the Gamma Ray (GR) log data on four coal drill holes. The semiparametric GSTAR modeling aimed to predict the amount of log GR in the unobserved soil layer based on the observation data information on the layer above it and its surrounding location. The results revealed that semiparametric GSTAR modeling could predict the presence of coal seams and their thickness of drill holes. The results also highlight the validity test on the out-sample data that the error in each borehole results in a small error. In addition, the error tends to approach the actual observed value at a depth of 1 meter down.
本研究考察了半参数广义时空自回归(GSTAR)时空建模,并确定了其空间权重。通常,使用的空间权重是统一的二进制权重,并且基于距离,结果是固定的权重。GSTAR模型是一个考虑其随机变量的随机模型。因此,有必要对随机空间权重进行研究。本文介绍了一种新的方法来估计具有一致核的GSTAR模型的观测值。数据涉及四个煤钻孔的伽马射线测井数据。半参数GSTAR建模旨在根据其上方土层及其周围位置的观测数据信息,预测未观测土层中log GR的数量。结果表明,半参数GSTAR模型可以预测煤层的存在及其钻孔厚度。结果还强调了对样本外数据的有效性测试,即每个钻孔中的误差导致小误差。此外,误差倾向于在向下1米的深度处接近实际观测值。
{"title":"THE APPLICATION OF THE SEMIPARAMETRIC GSTAR MODEL IN DETERMINING GAMMA-RAY LOG DATA ON SOIL LAYERS","authors":"Yundari Yundari, Shantika Martha","doi":"10.14710/medstat.14.2.108-117","DOIUrl":"https://doi.org/10.14710/medstat.14.2.108-117","url":null,"abstract":"This research examines the semiparametric Generalized Space-Time Autoregressive (GSTAR) spacetime modeling and determines its spatial weight. In general, the spatial weights used are uniform, binary weights, and based on the distance, the result is a fixed weight. The GSTAR model is a stochastic model that takes into account its random variables. Thus, it is necessary to study the random spatial weights. This study introduced a new method to estimate the observed value of the GSTAR model semiparametric with a uniform kernel. The data involved the Gamma Ray (GR) log data on four coal drill holes. The semiparametric GSTAR modeling aimed to predict the amount of log GR in the unobserved soil layer based on the observation data information on the layer above it and its surrounding location. The results revealed that semiparametric GSTAR modeling could predict the presence of coal seams and their thickness of drill holes. The results also highlight the validity test on the out-sample data that the error in each borehole results in a small error. In addition, the error tends to approach the actual observed value at a depth of 1 meter down.","PeriodicalId":34146,"journal":{"name":"Media Statistika","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-12-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46456315","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MODELING OF LOCAL POLYNOMIAL KERNEL NONPARAMETRIC REGRESSION FOR COVID DAILY CASES IN SEMARANG CITY, INDONESIA 印度尼西亚三宝朗市covid日病例的局部多项式核非参数回归建模
Pub Date : 2021-12-28 DOI: 10.14710/medstat.14.2.206-215
T. W. Utami, Aisyah Lahdji
Coronavirus disease 2019 (COVID-19) is an infectious disease caused by the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) which was recently discovered. Coronavirus disease is now a pandemic that occurs in many countries in the world, one of which is Indonesia. One of the cities in Indonesia that has found many COVID cases is Semarang city, located in Central Java. Data on cases of COVID patients in Semarang City which are measured daily do not form a certain distribution pattern. We can build a model with a flexible statistical approach without any assumptions that must be used, namely the nonparametric regression. The nonparametric regression in this research using Local Polynomial Kernel approach. Determination of the polynomial order and optimal bandwidth in Local Polynomial Kernel Regression modeling use the GCV (Generalized Cross Validation) method. The data used this research are data on the number of COVID patients daily cases in Semarang, Indonesia. Based on the results of the application of the COVID patient daily cases in Semarang City, the optimal bandwidth value is 0.86 and the polynomial order is 4 with the minimum GCV is 3179.568 so that the model estimation results the MSE is 2922.22 and the determination coefficient is 97%. The estimation results show the highest number of Corona in the Semarang City at the beginning of July 2020. After the corona case increased in July, while the corona case in August decreased.
冠状病毒病2019 (COVID-19)是由最近发现的严重急性呼吸综合征冠状病毒2 (SARS-CoV-2)引起的传染病。冠状病毒病现在是一种大流行,在世界上许多国家都有发生,印度尼西亚就是其中之一。位于中爪哇的三宝垄市是印度尼西亚发现许多COVID病例的城市之一。三宝垄市每日统计的新冠肺炎病例数据没有形成一定的分布格局。我们可以用灵活的统计方法建立一个模型,而不需要任何必须使用的假设,即非参数回归。本研究采用局部多项式核方法进行非参数回归。利用GCV(广义交叉验证)方法确定局部多项式核回归模型中的多项式阶数和最优带宽。本研究使用的数据是印度尼西亚三宝垄市每日新增病例数的数据。根据三宝垄市新冠肺炎患者日病例的应用结果,最优带宽值为0.86,多项式阶数为4,最小GCV为3179.568,则模型估计结果的MSE为2922.22,确定系数为97%。估计结果显示,2020年7月初三宝垄市的冠状病毒数量最多。7月冠状病毒病例增加,8月冠状病毒病例减少。
{"title":"MODELING OF LOCAL POLYNOMIAL KERNEL NONPARAMETRIC REGRESSION FOR COVID DAILY CASES IN SEMARANG CITY, INDONESIA","authors":"T. W. Utami, Aisyah Lahdji","doi":"10.14710/medstat.14.2.206-215","DOIUrl":"https://doi.org/10.14710/medstat.14.2.206-215","url":null,"abstract":"Coronavirus disease 2019 (COVID-19) is an infectious disease caused by the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) which was recently discovered. Coronavirus disease is now a pandemic that occurs in many countries in the world, one of which is Indonesia. One of the cities in Indonesia that has found many COVID cases is Semarang city, located in Central Java. Data on cases of COVID patients in Semarang City which are measured daily do not form a certain distribution pattern. We can build a model with a flexible statistical approach without any assumptions that must be used, namely the nonparametric regression. The nonparametric regression in this research using Local Polynomial Kernel approach. Determination of the polynomial order and optimal bandwidth in Local Polynomial Kernel Regression modeling use the GCV (Generalized Cross Validation) method. The data used this research are data on the number of COVID patients daily cases in Semarang, Indonesia. Based on the results of the application of the COVID patient daily cases in Semarang City, the optimal bandwidth value is 0.86 and the polynomial order is 4 with the minimum GCV is 3179.568 so that the model estimation results the MSE is 2922.22 and the determination coefficient is 97%. The estimation results show the highest number of Corona in the Semarang City at the beginning of July 2020. After the corona case increased in July, while the corona case in August decreased.","PeriodicalId":34146,"journal":{"name":"Media Statistika","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-12-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46879883","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A STUDY OF GENERALIZED LINEAR MIXED MODEL FOR COUNT DATA USING HIERARCHICAL BAYES METHOD 用层次贝叶斯方法研究计数数据的广义线性混合模型
Pub Date : 2021-12-12 DOI: 10.14710/medstat.14.2.194-205
E. Sunandi, K. Notodiputro, B. Sartono
Poisson Log-Normal Model is one of the hierarchical mixed models that can be used for count data. Several estimation methods can be used to estimate the model parameters. The first objective of this study was to examine the performance of the parameter estimator and model built using the Hierarchical Bayes method via Markov Chain Monte Carlo (MCMC) with simulation. The second objective was applied the Poisson Log-Normal model to the West Java illiteracy Cases data which is sourced from the Susenas data on March 2019. In 2019, the incidence of illiteracy is a very rare occurrence in West Java Province. So that, it is suitable as an application case in this study. The simulation results showed that the Hierarchical Bayes parameter estimator through MCMC has the smallest Root Mean Squared Error of Prediction (RMSEP) value and the absolute bias is relatively mostly similar when compared to the Maximum Likelihood (ML) and Penalized Quasi-Likelihood (PQL) methods. Meanwhile, the empirical results showed that the fixed variable is the number of respondents who have a maximum education of elementary school have the greatest risk of illiteracy. Also, the diversity of census blocks significantly affects illiteracy cases in West Java 2019.
泊松对数正态模型是一种可用于计数数据的分层混合模型。可以使用几种估计方法来估计模型参数。本研究的第一个目的是通过模拟检验使用马尔可夫链蒙特卡罗(MCMC)的分层贝叶斯方法建立的参数估计器和模型的性能。第二个目标是将泊松对数正态模型应用于西爪哇文盲案例数据,该数据来源于2019年3月的Susenas数据。2019年,西爪哇省的文盲率非常罕见。因此,它适合作为本研究的应用案例。仿真结果表明,通过MCMC的分层贝叶斯参数估计器具有最小的预测均方根误差(RMSEP)值,并且与最大似然(ML)和惩罚准似然(PQL)方法相比,绝对偏差相对基本相似。同时,实证结果表明,固定变量是受小学教育程度最高的受访者中文盲风险最大的人数。此外,人口普查区块的多样性显著影响了2019年西爪哇的文盲率。
{"title":"A STUDY OF GENERALIZED LINEAR MIXED MODEL FOR COUNT DATA USING HIERARCHICAL BAYES METHOD","authors":"E. Sunandi, K. Notodiputro, B. Sartono","doi":"10.14710/medstat.14.2.194-205","DOIUrl":"https://doi.org/10.14710/medstat.14.2.194-205","url":null,"abstract":"Poisson Log-Normal Model is one of the hierarchical mixed models that can be used for count data. Several estimation methods can be used to estimate the model parameters. The first objective of this study was to examine the performance of the parameter estimator and model built using the Hierarchical Bayes method via Markov Chain Monte Carlo (MCMC) with simulation. The second objective was applied the Poisson Log-Normal model to the West Java illiteracy Cases data which is sourced from the Susenas data on March 2019. In 2019, the incidence of illiteracy is a very rare occurrence in West Java Province. So that, it is suitable as an application case in this study. The simulation results showed that the Hierarchical Bayes parameter estimator through MCMC has the smallest Root Mean Squared Error of Prediction (RMSEP) value and the absolute bias is relatively mostly similar when compared to the Maximum Likelihood (ML) and Penalized Quasi-Likelihood (PQL) methods. Meanwhile, the empirical results showed that the fixed variable is the number of respondents who have a maximum education of elementary school have the greatest risk of illiteracy. Also, the diversity of census blocks significantly affects illiteracy cases in West Java 2019.","PeriodicalId":34146,"journal":{"name":"Media Statistika","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44741861","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
COMPARATIVE STUDY OF DISTANCE MEASURES ON FUZZY SUBTRACTIVE CLUSTERING 模糊减法聚类中距离度量的比较研究
Pub Date : 2021-12-12 DOI: 10.14710/medstat.14.2.137-145
Anisa Eka Haryati, Sugiyarto Surono
Clustering is a data analysis process which applied to classify the unlabeled data. Fuzzy clustering is a clustering method based on membership value which enclosing set of fuzzy as a measurement base for classification process. Fuzzy Subtractive Clustering (FSC) is included in one of fuzzy clustering method. This research applies Hamming distance and combined Minkowski Chebysev distance as a distance parameter in Fuzzy Subtractive Clustering. The objective of this research is to compare the output quality of the cluster from Fuzzy Subtractive Clustering by using Hamming distance and combine Minkowski Chebysev distance. The comparison of the two distances aims to see how well the clusters are produced from two different distances. The data used is data on hypertension. The variables used are age, gender, systolic pressure, diastolic pressure, and body weight. This research shows that the Partition Coefficient value resulted on Fuzzy Subtractive Clustering by applying combined Minkowski Chebysev distance is higher than the application of Hamming distance. Based on this, it can be concluded that in this study the quality of the cluster output using the combined Minkowski Chebysev distance is better.
聚类是一种用于对未标记数据进行分类的数据分析过程。模糊聚类是一种基于隶属度值的聚类方法,它将模糊集合作为分类过程的度量基础。模糊减法聚类(FSC)是模糊聚类方法中的一种。本研究将Hamming距离和联合Minkowski Chebysev距离作为模糊减法聚类的距离参数。本研究的目的是利用Hamming距离和结合Minkowski Chebysev距离对模糊减法聚类的聚类输出质量进行比较。比较这两个距离的目的是看在两个不同的距离上产生集群的效果如何。使用的数据是关于高血压的数据。使用的变量包括年龄、性别、收缩压、舒张压和体重。研究表明,采用联合Minkowski Chebysev距离进行模糊减法聚类得到的分割系数值高于采用Hamming距离进行模糊减法聚类得到的分割系数值。基于此,可以得出结论,在本研究中,使用联合Minkowski Chebysev距离的聚类输出质量更好。
{"title":"COMPARATIVE STUDY OF DISTANCE MEASURES ON FUZZY SUBTRACTIVE CLUSTERING","authors":"Anisa Eka Haryati, Sugiyarto Surono","doi":"10.14710/medstat.14.2.137-145","DOIUrl":"https://doi.org/10.14710/medstat.14.2.137-145","url":null,"abstract":"Clustering is a data analysis process which applied to classify the unlabeled data. Fuzzy clustering is a clustering method based on membership value which enclosing set of fuzzy as a measurement base for classification process. Fuzzy Subtractive Clustering (FSC) is included in one of fuzzy clustering method. This research applies Hamming distance and combined Minkowski Chebysev distance as a distance parameter in Fuzzy Subtractive Clustering. The objective of this research is to compare the output quality of the cluster from Fuzzy Subtractive Clustering by using Hamming distance and combine Minkowski Chebysev distance. The comparison of the two distances aims to see how well the clusters are produced from two different distances. The data used is data on hypertension. The variables used are age, gender, systolic pressure, diastolic pressure, and body weight. This research shows that the Partition Coefficient value resulted on Fuzzy Subtractive Clustering by applying combined Minkowski Chebysev distance is higher than the application of Hamming distance. Based on this, it can be concluded that in this study the quality of the cluster output using the combined Minkowski Chebysev distance is better.","PeriodicalId":34146,"journal":{"name":"Media Statistika","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42320589","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
VARIANCE GAMMA PROCESS WITH MONTE CARLO SIMULATION AND CLOSED FORM APPROACH FOR EUROPEAN CALL OPTION PRICE DETERMINATION 方差伽玛过程与蒙特卡罗模拟和封闭形式方法欧式看涨期权的价格确定
Pub Date : 2021-12-12 DOI: 10.14710/medstat.14.2.183-193
A. Hoyyi, Abdurakhman Abdurakhman, D. Rosadi
The Option is widely applied in the financial sector.  The Black-Scholes-Merton model is often used in calculating option prices on a stock price movement. The model uses geometric Brownian motion which assumes that the data is normally distributed. However, in reality, stock price movements can cause sharp spikes in data, resulting in nonnormal data distribution. So we need a stock price model that is not normally distributed. One of the fastest growing stock price models today is the  process exponential model. The  process has the ability to model data that has excess kurtosis and a longer tail (heavy tail) compared to the normal distribution. One of the members of the  process is the Variance Gamma (VG) process. The VG process has three parameters which each of them, to control volatility, kurtosis and skewness. In this research, the secondary data samples of options and stocks of two companies were used, namely zoom video communications, Inc. (ZM) and Nokia Corporation (NOK).  The price of call options is determined by using closed form equations and Monte Carlo simulation. The Simulation was carried out for various  values until convergent result was obtained.
期权广泛应用于金融领域。Black-Scholes-Merton模型经常用于计算股票价格变动的期权价格。该模型采用几何布朗运动,假设数据是正态分布。然而,在现实中,股票价格的变动会导致数据的急剧飙升,从而导致数据的非正态分布。所以我们需要一个非正态分布的股价模型。目前增长最快的股票价格模型之一是过程指数模型。与正态分布相比,该过程能够对具有过量峰度和较长尾(重尾)的数据进行建模。该流程的成员之一是方差伽马(VG)流程。VG过程有三个参数,分别用于控制挥发性、峰度和偏度。本研究采用zoom video communications, Inc. (ZM)和Nokia Corporation (NOK)两家公司的期权和股票二级数据样本。采用封闭形式方程和蒙特卡罗模拟方法确定看涨期权的价格。对不同的数值进行了模拟,直到得到收敛的结果。
{"title":"VARIANCE GAMMA PROCESS WITH MONTE CARLO SIMULATION AND CLOSED FORM APPROACH FOR EUROPEAN CALL OPTION PRICE DETERMINATION","authors":"A. Hoyyi, Abdurakhman Abdurakhman, D. Rosadi","doi":"10.14710/medstat.14.2.183-193","DOIUrl":"https://doi.org/10.14710/medstat.14.2.183-193","url":null,"abstract":"The Option is widely applied in the financial sector.  The Black-Scholes-Merton model is often used in calculating option prices on a stock price movement. The model uses geometric Brownian motion which assumes that the data is normally distributed. However, in reality, stock price movements can cause sharp spikes in data, resulting in nonnormal data distribution. So we need a stock price model that is not normally distributed. One of the fastest growing stock price models today is the  process exponential model. The  process has the ability to model data that has excess kurtosis and a longer tail (heavy tail) compared to the normal distribution. One of the members of the  process is the Variance Gamma (VG) process. The VG process has three parameters which each of them, to control volatility, kurtosis and skewness. In this research, the secondary data samples of options and stocks of two companies were used, namely zoom video communications, Inc. (ZM) and Nokia Corporation (NOK).  The price of call options is determined by using closed form equations and Monte Carlo simulation. The Simulation was carried out for various  values until convergent result was obtained.","PeriodicalId":34146,"journal":{"name":"Media Statistika","volume":"1 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42169991","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
UTILIZATION OF STUDENT’S T DISTRIBUTION TO HANDLE OUTLIERS IN TECHNICAL EFFICIENCY MEASUREMENT 利用学生t分布处理技术效率测量中的异常值
Pub Date : 2021-06-30 DOI: 10.14710/medstat.14.1.56-67
R. Zulkarnain, A. Djuraidah, I. Sumertajaya, Indahwati Indahwati
Stochastic frontier analysis (SFA) is the favorite method for measuring technical efficiency. SFA decomposes the error term into noise and inefficiency components. The noise component is generally assumed to have a normal distribution, while the inefficiency component is assumed to have half normal distribution. However, in the presence of outliers, the normality assumption of noise is not sufficient and can produce implausible technical efficiency scores. This paper aims to explore the use of Student’s t distribution for handling outliers in technical efficiency measurement. The model was applied in paddy rice production in East Java. Output variable was the quantity of production, while the input variables were land, seed, fertilizer, labor and capital. To link the output and inputs, Cobb-Douglas or Translog production functions was chosen using likelihood ratio test, where the parameters were estimated using maximum simulated likelihood. Furthermore, the technical efficiency scores were calculated using Jondrow method. The results showed that Student’s t distribution for noise can reduce the outliers in technical efficiency scores. Student’s t distribution revised the extremely high technical efficiency scores downward and the extremely low technical efficiency scores upward. The performance of model was improved after the outliers were handled, indicated by smaller AIC value.
随机前沿分析(SFA)是衡量技术效率的常用方法。SFA将误差项分解为噪声和低效率分量。噪声分量一般假定为正态分布,而低效率分量一般假定为半正态分布。然而,在异常值存在的情况下,噪声的正态性假设是不充分的,可能会产生令人难以置信的技术效率分数。本文旨在探讨在技术效率测量中使用Student 's t分布来处理异常值。该模型在东爪哇水稻生产中得到了应用。产出变量是产量,投入变量是土地、种子、肥料、劳动力和资本。为了连接输出和输入,使用似然比检验选择Cobb-Douglas或Translog生产函数,其中使用最大模拟似然估计参数。采用Jondrow法计算技术效率得分。结果表明,噪声的Student 's t分布可以降低技术效率得分的异常值。学生的t分布向下修正了极高的技术效率分数,向上修正了极低的技术效率分数。对异常值进行处理后,模型的性能得到了提高,AIC值变小。
{"title":"UTILIZATION OF STUDENT’S T DISTRIBUTION TO HANDLE OUTLIERS IN TECHNICAL EFFICIENCY MEASUREMENT","authors":"R. Zulkarnain, A. Djuraidah, I. Sumertajaya, Indahwati Indahwati","doi":"10.14710/medstat.14.1.56-67","DOIUrl":"https://doi.org/10.14710/medstat.14.1.56-67","url":null,"abstract":"Stochastic frontier analysis (SFA) is the favorite method for measuring technical efficiency. SFA decomposes the error term into noise and inefficiency components. The noise component is generally assumed to have a normal distribution, while the inefficiency component is assumed to have half normal distribution. However, in the presence of outliers, the normality assumption of noise is not sufficient and can produce implausible technical efficiency scores. This paper aims to explore the use of Student’s t distribution for handling outliers in technical efficiency measurement. The model was applied in paddy rice production in East Java. Output variable was the quantity of production, while the input variables were land, seed, fertilizer, labor and capital. To link the output and inputs, Cobb-Douglas or Translog production functions was chosen using likelihood ratio test, where the parameters were estimated using maximum simulated likelihood. Furthermore, the technical efficiency scores were calculated using Jondrow method. The results showed that Student’s t distribution for noise can reduce the outliers in technical efficiency scores. Student’s t distribution revised the extremely high technical efficiency scores downward and the extremely low technical efficiency scores upward. The performance of model was improved after the outliers were handled, indicated by smaller AIC value.","PeriodicalId":34146,"journal":{"name":"Media Statistika","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48369977","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
AUTOREGRESSIVE FRACTIONAL INTEGRATED MOVING AVERAGE (ARFIMA) MODEL TO PREDICT COVID-19 PANDEMIC CASES IN INDONESIA 自回归分数积分移动平均(ARFIMA)模型预测印度尼西亚新冠肺炎疫情
Pub Date : 2021-06-25 DOI: 10.14710/medstat.14.1.44-55
P. Kartikasari, H. Yasin, D. A. I. Maruddani
Currently the emergence of the novel coronavirus (Sars-Cov-2), which causes the COVID-19 pandemic and has become a serious health problem because of the high risk causes of death. Therefore, fast and appropriate action is needed to reduce the spread of the COVID-19 pandemic. One of the way is to build a prediction model so that it can be a reference in taking steps to overcome them. Because of the nature of transmission of this disease which is so fast and massive cause extreme data fluctuations and between objects whose observational distances are far enough correlated with each other (long memory). The result of this determination is the best ARFIMA model obtained to predict additional of recovering cases of COVID-19 is (1,0,489.0) with an SMAPE value of 12,44%, while the case of death is (1.0.429.0) with SMAPE value of 13,52%. This shows that the ARFIMA model can accommodate well the long memory effect, resulting in a small bias. Also in estimating model parameters, it is also simpler. For cases of recovery and death, the number is increasing even though the case of death is still very high compared to cases of recovery.
目前,新型冠状病毒(Sars-Cov-2)的出现,引发了COVID-19大流行,并因其高风险死亡原因而成为严重的健康问题。因此,需要采取迅速和适当的行动,以减少COVID-19大流行的传播。其中一种方法是建立一个预测模型,以便它可以作为采取措施克服它们的参考。由于这种疾病传播的性质是如此迅速和大规模,导致极端的数据波动,并且在观测距离足够远的物体之间相互关联(长记忆)。结果表明,ARFIMA模型预测新增病例数为(1,0,489.0),SMAPE值为12.44%;死亡病例数为(1.0.429.0),SMAPE值为13.52%。这表明ARFIMA模型可以很好地适应长记忆效应,导致偏差较小。在估计模型参数方面也更简单。就康复和死亡病例而言,尽管死亡病例与康复病例相比仍然很高,但数量仍在增加。
{"title":"AUTOREGRESSIVE FRACTIONAL INTEGRATED MOVING AVERAGE (ARFIMA) MODEL TO PREDICT COVID-19 PANDEMIC CASES IN INDONESIA","authors":"P. Kartikasari, H. Yasin, D. A. I. Maruddani","doi":"10.14710/medstat.14.1.44-55","DOIUrl":"https://doi.org/10.14710/medstat.14.1.44-55","url":null,"abstract":"Currently the emergence of the novel coronavirus (Sars-Cov-2), which causes the COVID-19 pandemic and has become a serious health problem because of the high risk causes of death. Therefore, fast and appropriate action is needed to reduce the spread of the COVID-19 pandemic. One of the way is to build a prediction model so that it can be a reference in taking steps to overcome them. Because of the nature of transmission of this disease which is so fast and massive cause extreme data fluctuations and between objects whose observational distances are far enough correlated with each other (long memory). The result of this determination is the best ARFIMA model obtained to predict additional of recovering cases of COVID-19 is (1,0,489.0) with an SMAPE value of 12,44%, while the case of death is (1.0.429.0) with SMAPE value of 13,52%. This shows that the ARFIMA model can accommodate well the long memory effect, resulting in a small bias. Also in estimating model parameters, it is also simpler. For cases of recovery and death, the number is increasing even though the case of death is still very high compared to cases of recovery.","PeriodicalId":34146,"journal":{"name":"Media Statistika","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45968491","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
GSTARI-ARCH MODEL AND APPLICATION ON POSITIVE CONFIRMED DATA FOR COVID-19 IN WEST JAVA Gstari-arch模型及其在西爪哇地区COVID-19阳性确诊数据中的应用
Pub Date : 2021-06-07 DOI: 10.14710/medstat.14.2.146-157
M. Alawiyah, D. A. Kusuma, B. N. Ruchjana
Time series model that is commonly used is the Box-Jenkins based time series model. Time series data phenomena based on Box-Jenkins can be combined with spatial data, it is called the space time model One model based on Box-Jenkins model with heterogeneous location characteristics is the Generalized Space Time Autoregressive Integrated (GSTARI) model for a model that assumes data is not stationary or has a trend. This paper discusses the development of the GSTARI model with the assumption that the error variance is not constant which is applied to positive data confirmed by Covid-19 in West Java Province, especially in 4 regencies/cities that have cases in the high category from 6 March 2020 until 31 December 2020. Four regencies/cities are Depok City, Bekasi City, Bekasi Regency, and Karawang Regency. Parameter estimation method for the assumption of non-constant error variance can use Autoregressive Conditional Heteroscedasticity (ARCH) method. GSTARI-ARCH modeling procedure followed three Box-Jenkins stages, namely the identification process, parameter estimation and checking diagnostic. Application of the GSTARI-ARCH Model to Covid-19 positive confirmed data in 4 regencies/cities has a minimum value of RMSE in Bekasi City. The plot of forecast results for the four regencies/cities has a similar pattern to the actual data only applicable for a short time for 1-2 days.
常用的时间序列模型是基于Box-Jenkins的时间序列模型。基于Box-Jenkins的时间序列数据现象可以与空间数据相结合,称为时空模型。基于具有异构位置特征的Box-Jenkins模型的一种模型是假定数据不平稳或有趋势的广义时空自回归集成(GSTARI)模型。本文在假设误差方差不恒定的情况下讨论了GSTARI模型的发展,并将其应用于西爪哇省Covid-19确认的阳性数据,特别是在2020年3月6日至2020年12月31日期间出现高类别病例的4个县/城市。四个摄政/城市是德波市,勿加西市,勿加西摄政和卡拉旺摄政。对于非恒定误差方差假设的参数估计方法,可以采用自回归条件异方差(ARCH)方法。GSTARI-ARCH建模过程分为三个Box-Jenkins阶段,即识别过程、参数估计和检查诊断。将GSTARI-ARCH模型应用于4个县/城市的Covid-19阳性确诊数据,勿加西市的RMSE值最小。四个县/市的预测结果图与实际数据的模式相似,只适用于1-2天的短时间。
{"title":"GSTARI-ARCH MODEL AND APPLICATION ON POSITIVE CONFIRMED DATA FOR COVID-19 IN WEST JAVA","authors":"M. Alawiyah, D. A. Kusuma, B. N. Ruchjana","doi":"10.14710/medstat.14.2.146-157","DOIUrl":"https://doi.org/10.14710/medstat.14.2.146-157","url":null,"abstract":"Time series model that is commonly used is the Box-Jenkins based time series model. Time series data phenomena based on Box-Jenkins can be combined with spatial data, it is called the space time model One model based on Box-Jenkins model with heterogeneous location characteristics is the Generalized Space Time Autoregressive Integrated (GSTARI) model for a model that assumes data is not stationary or has a trend. This paper discusses the development of the GSTARI model with the assumption that the error variance is not constant which is applied to positive data confirmed by Covid-19 in West Java Province, especially in 4 regencies/cities that have cases in the high category from 6 March 2020 until 31 December 2020. Four regencies/cities are Depok City, Bekasi City, Bekasi Regency, and Karawang Regency. Parameter estimation method for the assumption of non-constant error variance can use Autoregressive Conditional Heteroscedasticity (ARCH) method. GSTARI-ARCH modeling procedure followed three Box-Jenkins stages, namely the identification process, parameter estimation and checking diagnostic. Application of the GSTARI-ARCH Model to Covid-19 positive confirmed data in 4 regencies/cities has a minimum value of RMSE in Bekasi City. The plot of forecast results for the four regencies/cities has a similar pattern to the actual data only applicable for a short time for 1-2 days.","PeriodicalId":34146,"journal":{"name":"Media Statistika","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-06-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44970229","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
Media Statistika
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1