Constructing Groupings by Use of STATISTICA Software Package

V. S. Fetisov
{"title":"Constructing Groupings by Use of STATISTICA Software Package","authors":"V. S. Fetisov","doi":"10.31767/su.4(83)2018.04.14","DOIUrl":null,"url":null,"abstract":"STATISTICA software package for statistical analysis incorporates a wide range of advanced statistical methods. Quite often they are preceded by aggregating statistical survey data, which main component is their grouping. Although this phase of statistical data processing is relatively simple, the manual process of aggregation can be time-consuming given the need to process large data arrays, not mentioning a high probability of errors. Therefore, the all-purpose STATISTICA software package is a logical and reasonable tool for grouping of data.     \nThe article shows the grouping algorithm in STATISTICA software package, with focus on setup when constructing tables of frequencies of discrete and continual characters. Various options of grouping are scrutinized, with providing examples of their visualization.     \nA large number of STATISTICA parameters offers ample opportunities for constructing user tables, but users often are not aware of these options or do not know how they can be applied. Yet, the apparently simple grouping process in STATISTICA software package can sometimes require the knowledge of fine mechanisms for its setup. The article gives a detailed description of the mechanisms for creating interval margins when applying the parameter “approximate number of intervals”. \nThe standard algorithm for selection is analyzed, allowing a user to limit the number of groups in a grouping. STATISTICA allows for using a number of grouping parameters, enabling to produce more convenient results or filter them. Thus, setting the clicker for label field “Grouping” in the position “Integer Categories” (integer intervals (categories)) initiates the grouping only for integer values of a variable, by excluding the observations containing its fractional values. \nWhen only standard parameters are used, it will be impossible to form uneven or open intervals.  This issue is out of focus in specialized literature and Internet sources. The article shows the algorithm for constructing open intervals by user-set conditions and the process of creating these conditions. This option allows for forming both closed and open intervals by solving all the problems in time of grouping. Because creating such conditions is time consuming, they should be preserved if they are required for further use. \nSetting up of STATISTICA software with missing data is analyzed. Its application will be advisable when a grouping for two or more variables is constructed. In this case, a separate sheet with a grouping is to be created in the worksheet for each variable.      ","PeriodicalId":52812,"journal":{"name":"Statistika Ukrayini","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2018-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Statistika Ukrayini","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.31767/su.4(83)2018.04.14","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

STATISTICA software package for statistical analysis incorporates a wide range of advanced statistical methods. Quite often they are preceded by aggregating statistical survey data, which main component is their grouping. Although this phase of statistical data processing is relatively simple, the manual process of aggregation can be time-consuming given the need to process large data arrays, not mentioning a high probability of errors. Therefore, the all-purpose STATISTICA software package is a logical and reasonable tool for grouping of data.     The article shows the grouping algorithm in STATISTICA software package, with focus on setup when constructing tables of frequencies of discrete and continual characters. Various options of grouping are scrutinized, with providing examples of their visualization.     A large number of STATISTICA parameters offers ample opportunities for constructing user tables, but users often are not aware of these options or do not know how they can be applied. Yet, the apparently simple grouping process in STATISTICA software package can sometimes require the knowledge of fine mechanisms for its setup. The article gives a detailed description of the mechanisms for creating interval margins when applying the parameter “approximate number of intervals”. The standard algorithm for selection is analyzed, allowing a user to limit the number of groups in a grouping. STATISTICA allows for using a number of grouping parameters, enabling to produce more convenient results or filter them. Thus, setting the clicker for label field “Grouping” in the position “Integer Categories” (integer intervals (categories)) initiates the grouping only for integer values of a variable, by excluding the observations containing its fractional values. When only standard parameters are used, it will be impossible to form uneven or open intervals.  This issue is out of focus in specialized literature and Internet sources. The article shows the algorithm for constructing open intervals by user-set conditions and the process of creating these conditions. This option allows for forming both closed and open intervals by solving all the problems in time of grouping. Because creating such conditions is time consuming, they should be preserved if they are required for further use. Setting up of STATISTICA software with missing data is analyzed. Its application will be advisable when a grouping for two or more variables is constructed. In this case, a separate sheet with a grouping is to be created in the worksheet for each variable.      
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用STATISTICA软件包构造分组
用于统计分析的STATISTICA软件包包含了广泛的先进统计方法。通常情况下,在他们之前汇总统计调查数据,其主要组成部分是他们的分组。虽然统计数据处理的这一阶段相对简单,但考虑到需要处理大型数据数组,更不用说错误的高概率,手动聚合过程可能会很耗时。因此,通用的STATISTICA软件包是一个逻辑合理的数据分组工具。本文介绍了STATISTICA软件包中的分组算法,重点介绍了在构造离散和连续字符频率表时的设置。详细介绍了分组的各种选项,并提供了可视化的示例。大量的STATISTICA参数为构造用户表提供了充分的机会,但是用户通常不知道这些选项,或者不知道如何应用它们。然而,STATISTICA软件包中看似简单的分组过程有时需要了解其设置的良好机制。本文详细描述了应用参数“近似区间数”时创建区间边际的机制。分析了选择的标准算法,允许用户限制分组中的组数。STATISTICA允许使用许多分组参数,从而能够生成更方便的结果或过滤它们。因此,将标签字段“Grouping”的点击器设置在“Integer Categories”(整数间隔(类别))位置,通过排除包含其小数值的观测值,只对变量的整数值进行分组。当只使用标准参数时,将不可能形成不均匀或开放的间隔。这个问题在专业文献和网络资源中没有得到关注。本文展示了通过用户设置条件构造开区间的算法以及创建这些条件的过程。此选项允许通过在分组时间内解决所有问题来形成封闭和开放区间。由于创建这样的条件非常耗时,因此如果需要进一步使用,则应保留这些条件。分析了缺失数据下STATISTICA软件的设置问题。当构造两个或多个变量的分组时,它的应用将是可取的。在这种情况下,将在工作表中为每个变量创建具有分组的单独工作表。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
10
审稿时长
12 weeks
期刊最新文献
The Ukrainian Trace on the Way of Development of the International Statistical Institute Information and Analytical Support for the Management of Law Enforcement and Socio-Economic Activities (on the Basis of Methodologies and Practices of Applied Statistics) The Mortality from External Causes: Impact of the COVID-19 Pandemic and the War in Ukraine Interaction of Social Capital Forms in the Structure of Civil Society Networks: Managerial Aspect Counteracting the Risks of International Investment in the Conditions of War
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1