Japanese Journal of Statistics and Data Science最新文献

英文中文

Real world data and data science in medical research: present and future 医学研究中的真实世界数据与数据科学：现状与未来

IF 1.3 Q3 STATISTICS & PROBABILITY

Japanese Journal of Statistics and Data Science

Pub Date : 2022-04-13 DOI: 10.1007/s42081-022-00156-0

K. Togo, N. Yonemoto

引用次数: 6

Experience of distance education for project-based learning in data science 远程教育在数据科学项目学习中的经验

IF 1.3 Q3 STATISTICS & PROBABILITY

Japanese Journal of Statistics and Data Science

Pub Date : 2022-04-09 DOI: 10.1007/s42081-022-00154-2

Kentaro Sakamaki, Masataka Taguri, Hiromu Nishiuchi, Yoshitomo Akimoto, Kazuyuki Koizumi

引用次数: 2

Correction to: Spatial analysis and visualization of global data on multi-resolution hexagonal grids 更正：多分辨率六边形网格上全球数据的空间分析和可视化

IF 1.3 Q3 STATISTICS & PROBABILITY

Japanese Journal of Statistics and Data Science

Pub Date : 2022-04-09 DOI: 10.1007/s42081-022-00157-z

T. Stough, N. Cressie, E. Kang, A. Michalak, K. Sahr

引用次数: 0

Correction to: Statistical data integration in survey sampling: a review 更正：调查抽样中的统计数据整合：综述

IF 1.3 Q3 STATISTICS & PROBABILITY

Japanese Journal of Statistics and Data Science

Pub Date : 2022-03-28 DOI: 10.1007/s42081-022-00152-4

Shu Yang, Jae Kwang Kim

引用次数: 0

Partitioned Design Matrix Method for Two Factors Multivariate Design 两因素多元设计的分块设计矩阵法

IF 1.3 Q3 STATISTICS & PROBABILITY

Japanese Journal of Statistics and Data Science

Pub Date : 2022-03-15 DOI: 10.33369/jsds.v1i1.21010

Renny Alvionita, S. Nugroho, M. Chozin

Factorial experiment often involves large data sets and the use of generalized inverse for the data analysis. It becomes less manageable as the data increased. The objective of this study is to evaluate the accuracy of partitioned design matrix method for two factors multivariate design. The design matrix is partitioned into several sub-matrices based on their source of variation. The partitioned design matrix method in two factors multivariate is much simpler than usual sigma summation method in calculating the sum of product matrix and the degrees of freedom. This method could also be used in explaining the derivation of the statistics for testing the hypothesis of the equality of the means which corresponds to the source of variation.

析因实验通常涉及大数据集，并使用广义逆法对数据进行分析。随着数据的增加，它变得越来越难以管理。本研究的目的是评估分割设计矩阵法在两因素多变量设计中的准确性。设计矩阵根据其变异源划分为若干子矩阵。两因素多元分割设计矩阵法在计算乘积矩阵与自由度之和方面比一般的求和法简单得多。这种方法也可用于解释统计量的推导，以检验与变异源相对应的均值相等的假设。

引用次数: 0

Agglomerative Nesting (AGNES) Method and Divisive Analysis (DIANA) Method For Hierarchical Clustering On Some Distance Measurement Concepts 在一些距离测量概念上的聚类方法的凝聚嵌套法和分裂分析法

IF 1.3 Q3 STATISTICS & PROBABILITY

Japanese Journal of Statistics and Data Science

Pub Date : 2022-03-15 DOI: 10.33369/jsds.v1i1.21009

Susi Wijuniamurti, S. Nugroho, R. Rachmawati

Clustering data through hierarchical approach could be performed by Agglomerative Nesting (AGNES) Method and Divisive Analysis (DIANA) Method. The objective of this research is to compare both the methods based on Euclid and Manhattan distance measurements. Of this research the clustering procedures of agglomerative method are conducted by exploring all techniques including single linkage, complete linkage, average linkage, and Ward. The data used are the National Socio-Economic Survey (SUSENAS) data which are selected specifically for the percentage of over 5 year old residents in each province, for both living in urban or rural, who access the internet in the last 3 months in 2017 but classified according purpose of accessing. By applying Mean Square Error (MSE) for 2 and 3 clusters, it can be concluded that the single linkage technique is the best performance of clustering procedure for both Euclidean and Manhattan distances.

分层聚类数据可采用聚类嵌套法(AGNES)和分裂分析法(DIANA)进行。本研究的目的是比较基于欧几里得和曼哈顿距离测量的两种方法。在本研究中，通过探索单链接、完全链接、平均链接和Ward等所有技术，进行了聚类方法的聚类过程。使用的数据是国家社会经济调查(SUSENAS)数据，这些数据是专门为2017年最后3个月访问互联网的各省5岁以上城市或农村居民的百分比选择的，但根据访问目的进行分类。通过对2类和3类聚类的均方误差(MSE)分析，得出单链接聚类技术在欧氏距离和曼哈顿距离的聚类过程中性能最好的结论。

引用次数: 0

Forecasting A Weekly Red Chilli Price in Bengkulu City Using Autoregressive Integrated Moving Average (ARIMA) and Singular Spectrum Analysis (SSA) Methods 利用自回归综合移动平均(ARIMA)和奇异谱分析(SSA)方法预测明库鲁市红辣椒周价格

IF 1.3 Q3 STATISTICS & PROBABILITY

Japanese Journal of Statistics and Data Science

Pub Date : 2022-03-15 DOI: 10.33369/jsds.v1i1.21007

Novi Putriasari, Sigit Nugroho, R. Rachmawati, Winalia Agwil, Y. O. Sitohang

Red chili occupies a strategic position in the Indonesian economic structure because its use applies to almost all Indonesian dishes. Therefore, controlling the price of red chili is anecessity to maintain national economic stability. The purpose of this research is to forecast a red chili weekly price using ARIMA and SSA based on the weekly data of chili prices from January 2016 - December 2019 sourced from Statistics Indonseia (BPS) Branch Office of Bengkulu Province. The data have been analyzed using software R. Based on MAPE, ARIMA (2,1,2) provides the best forecasting with value 0.49% while SSA 10.64%.

红辣椒在印尼经济结构中占有战略地位，因为它几乎适用于所有印尼菜肴。因此，控制红辣椒的价格是维护国家经济稳定的必要条件。本研究的目的是基于2016年1月至2019年12月印度尼西亚统计局(BPS)明古鲁省分公司提供的每周辣椒价格数据，使用ARIMA和SSA预测红辣椒每周价格。在MAPE基础上，ARIMA(2,1,2)的预测值为0.49%，SSA的预测值为10.64%。

引用次数: 1

Simulation of Sample Determination Quick Count Legislative Elections In Bengkulu City 白古鲁市立法选举的样本测定和快速计数模拟

IF 1.3 Q3 STATISTICS & PROBABILITY

Japanese Journal of Statistics and Data Science

Pub Date : 2022-03-15 DOI: 10.33369/jsds.v1i1.21012

A. Gumilar, Sigit Nugroho, Buyung Keraman

In this research illustrates the simulation of quick count of sampling for the year 2014 Legislative Election in Bengkulu City, which has a data acquisition result for 589 TPS. The problem in this research is how to know the sample size and the right sampling method for Legislative Election in Bengkulu City on Year 2014. The purpose of this research is to know the sample size and the quick count calculation sampling method that can predict the actual vote result for Legislative Election. The method used in the calculation of fast calculation consists of three methods, simple random sampling, cluster random sampling and multistage random sampling. From the population data of 589 polling stations (TPS) into the population, the sample size was taken as much as 120 TPS or about 20% of the population, based on the results of calculations for sample sizes in a limited population. After the sample was selected, a sample simulation of 100 times for each method and simulation results was tested for compatibility with the chi-squared test. Based on the test results, it can be concluded that for sample size 120 TPS taken by simple random sampling method, cluster random sampling or multistage random sampling can predict the actual vote result in Legislative Election Year 2014 in Bengkulu with margin of error 5%. For efficiency consideration simple random sampling method can be selected.

本研究以蚌埠市2014年立法选举为例，进行了抽样快速计数模拟，获得了589个TPS的数据采集结果。本研究的问题是如何知道2014年蚌埠市立法选举的样本量和正确的抽样方法。本研究的目的是了解能够预测立法选举实际投票结果的样本量和快速点票计算抽样方法。快速计算中使用的计算方法包括简单随机抽样、聚类随机抽样和多阶段随机抽样三种方法。从589个投票站(TPS)进入人口的人口数据中，根据有限人口中样本量的计算结果，样本量高达120 TPS，约占人口的20%。样品选定后，对每种方法进行100次的样品模拟，并对模拟结果进行卡方检验的相容性检验。从检验结果可以得出，对于样本量为120 TPS的简单随机抽样方法，整群随机抽样或多阶段随机抽样都可以预测蚌库鲁2014年立法选举年的实际投票结果，误差幅度为5%。出于效率考虑，可以选择简单的随机抽样方法。

{"title":"Simulation of Sample Determination Quick Count Legislative Elections In Bengkulu City","authors":"A. Gumilar, Sigit Nugroho, Buyung Keraman","doi":"10.33369/jsds.v1i1.21012","DOIUrl":"https://doi.org/10.33369/jsds.v1i1.21012","url":null,"abstract":"In this research illustrates the simulation of quick count of sampling for the year 2014 Legislative Election in Bengkulu City, which has a data acquisition result for 589 TPS. The problem in this research is how to know the sample size and the right sampling method for Legislative Election in Bengkulu City on Year 2014. The purpose of this research is to know the sample size and the quick count calculation sampling method that can predict the actual vote result for Legislative Election. The method used in the calculation of fast calculation consists of three methods, simple random sampling, cluster random sampling and multistage random sampling. From the population data of 589 polling stations (TPS) into the population, the sample size was taken as much as 120 TPS or about 20% of the population, based on the results of calculations for sample sizes in a limited population. After the sample was selected, a sample simulation of 100 times for each method and simulation results was tested for compatibility with the chi-squared test. Based on the test results, it can be concluded that for sample size 120 TPS taken by simple random sampling method, cluster random sampling or multistage random sampling can predict the actual vote result in Legislative Election Year 2014 in Bengkulu with margin of error 5%. For efficiency consideration simple random sampling method can be selected.","PeriodicalId":29911,"journal":{"name":"Japanese Journal of Statistics and Data Science","volume":"2 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2022-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82270186","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A Comparison of Weighted Least Square and Quantile Regression for Solving Heteroscedasticity in Simple Linear Regression 加权最小二乘与分位数回归求解简单线性回归异方差的比较

IF 1.3 Q3 STATISTICS & PROBABILITY

Japanese Journal of Statistics and Data Science

Pub Date : 2022-03-15 DOI: 10.33369/jsds.v1i1.21011

Welly Fransiska, S. Nugroho, R. Rachmawati

Regression analysis is the study of the relationship between dependent variable and one or more independent variables. One of the important assumption that must be fulfilled to get the regression coefficient estimator Best Linear Unbiased Estimator (BLUE) is homoscedasticity. If the homoscedasticity assumption is violated then it is called heteroscedasticity. The consequences of heteroscedasticity are the estimator remain linear and unbiased, but it can cause estimator haven‘t a minimum variance so the estimator is no longer BLUE. The purpose of this study is to analyze and resolve the violation of heteroscedasticity assumption with Weighted Least Square(WLS) and Quantile Regression. Based on the results of the comparison between WLS and Quantile Regression obtained the most precise method used to overcome heteroscedasticity in this research is the WLS method because it produces that is greater (98%).

回归分析是研究因变量与一个或多个自变量之间的关系。得到回归系数估计量最佳线性无偏估计量(BLUE)必须满足的一个重要假设是均方差。如果违背了同方差假设，则称之为异方差。异方差的结果是估计量保持线性和无偏，但它可能导致估计量没有最小方差，因此估计量不再是BLUE。本研究的目的是利用加权最小二乘法和分位数回归分析和解决异方差假设的违反。根据WLS和分位数回归的比较结果得出，本研究中用于克服异方差的最精确的方法是WLS方法，因为它产生的异方差更大(98%)。

引用次数: 0

A k-means method for trends of time series 时间序列趋势的k-均值方法

IF 1.3 Q3 STATISTICS & PROBABILITY

Japanese Journal of Statistics and Data Science

Pub Date : 2022-03-03 DOI: 10.1007/s42081-022-00148-0

N. Watanabe

引用次数: 2

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Japanese Journal of Statistics and Data Science

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀