International Journal of Assessment Tools in Education最新文献_第2页

Development of self-regulation scale for middle school students: Validity and reliability study 中学生自我调节量表的开发：有效性和可靠性研究

IF 0.9 Q3 EDUCATION & EDUCATIONAL RESEARCH

International Journal of Assessment Tools in Education

Pub Date : 2023-10-18 DOI: 10.21449/ijate.1311600

İsmail Sarikaya, Mesut Öztürk, Mustafa Özgöl

This study was conducted to develop a valid and reliable measurement tool that can identify middle school students’ self-regulation skills. Firstly, the literature was utilized in the development of the measurement tool. The form was finalized with the support of the opinions of different experts and a trial application. Then, the form was administered to 341 middle school students to determine its construct validity, and exploratory factor analysis (EFA) was performed on the collected data. Then, the form was administered to 341 middle school students to determine its construct validity, and exploratory factor analysis (EFA) was performed on the collected data. As a result of the study, it was determined that the scale consisted of 9 items and two sub-dimensions, namely “forethought” and “volitional control and self-reflection.” In order to reveal the reliability of the scale, internal consistency, two-half test analyses, and composite reliability (CR) were used, and it was determined that the values found were .74 and above. In order to reveal the accuracy of the obtained structure, data were collected from 218 middle school students in a different province, and confirmatory factor analysis (CFA) was performed with these data. As a result of the analysis, the two-factor structure of the scale was confirmed. When the findings are examined, it can be said that the scale is a reliable and valid tool that can be applied to determine the self-regulation skills of students studying at the middle school level.

本研究旨在开发一种有效、可靠的测量工具，用于识别初中生的自我调节能力。首先，在开发测量工具时利用了文献资料。在听取不同专家的意见并进行试用后，最终确定了表格。然后，对 341 名初中生进行了问卷调查，以确定其建构效度，并对收集到的数据进行了探索性因子分析（EFA）。然后，对 341 名中学生进行了问卷调查，以确定其建构效度，并对收集到的数据进行了探索性因子分析（EFA）。研究结果表明，该量表由 9 个项目和两个子维度组成，即 "深思熟虑 "和 "意志控制与自我反省"。为了揭示量表的信度，我们使用了内部一致性、两次半测验分析和综合信度（CR），结果表明其信度值均在 0.74 及以上。为了揭示所获得结构的准确性，我们收集了来自不同省份 218 名初中生的数据，并对这些数据进行了确证因子分析（CFA）。分析结果证实了量表的双因素结构。研究结果表明，该量表是一个可靠、有效的工具，可用于测定初中学生的自我调节能力。

{"title":"Development of self-regulation scale for middle school students: Validity and reliability study","authors":"İsmail Sarikaya, Mesut Öztürk, Mustafa Özgöl","doi":"10.21449/ijate.1311600","DOIUrl":"https://doi.org/10.21449/ijate.1311600","url":null,"abstract":"This study was conducted to develop a valid and reliable measurement tool that can identify middle school students’ self-regulation skills. Firstly, the literature was utilized in the development of the measurement tool. The form was finalized with the support of the opinions of different experts and a trial application. Then, the form was administered to 341 middle school students to determine its construct validity, and exploratory factor analysis (EFA) was performed on the collected data. Then, the form was administered to 341 middle school students to determine its construct validity, and exploratory factor analysis (EFA) was performed on the collected data. As a result of the study, it was determined that the scale consisted of 9 items and two sub-dimensions, namely “forethought” and “volitional control and self-reflection.” In order to reveal the reliability of the scale, internal consistency, two-half test analyses, and composite reliability (CR) were used, and it was determined that the values found were .74 and above. In order to reveal the accuracy of the obtained structure, data were collected from 218 middle school students in a different province, and confirmatory factor analysis (CFA) was performed with these data. As a result of the analysis, the two-factor structure of the scale was confirmed. When the findings are examined, it can be said that the scale is a reliable and valid tool that can be applied to determine the self-regulation skills of students studying at the middle school level.","PeriodicalId":42417,"journal":{"name":"International Journal of Assessment Tools in Education","volume":"17 1","pages":""},"PeriodicalIF":0.9,"publicationDate":"2023-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139317577","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Comparison of cronbach’s alpha and McDonald’s omega for ordinal data: Are they different? 比较用于序数数据的克朗巴赫阿尔法和麦克唐纳欧米茄：它们有区别吗？

IF 0.9 Q3 EDUCATION & EDUCATIONAL RESEARCH

International Journal of Assessment Tools in Education

Pub Date : 2023-10-16 DOI: 10.21449/ijate.1271693

Fatih Orçan

Among all, Cronbach’s Alpha and McDonald’s Omega are commonly used for reliability estimations. The alpha uses inter-item correlations while omega is based on a factor analysis result. This study uses simulated ordinal data sets to test whether the alpha and omega produce different estimates. Their performances were compared according to the sample size, number of items, and deviance from tau equivalence. Based on the result, the alpha and omega had similar results, except for the small sample size, the smaller number of items, and the low factor loading values. When there were 5 or more items in the scale and factor analysis which the omega was calculated from showed fit to the data set, using omega over alpha could be preferred. Also, as the number of items exceeds 5, the alpha and omega differences disappear. Since calculating the alpha is easier compared to the omega (omega requires fitting a factor model first) using alpha over omega can also be suggested. However, when the number of items and the correlations among the items were small, omega performed worse than alpha. Therefore, alpha should be used for the reliability estimations.

其中，Cronbach's Alpha 和 McDonald's Omega 是常用的信度估计方法。α采用项目间相关性，而Ω则基于因子分析结果。本研究使用模拟序数数据集来检验 alpha 和 omega 是否会产生不同的估计值。根据样本大小、项目数量和偏离 tau 等效的程度，对它们的性能进行了比较。结果表明，除了样本量较小、项目数量较少和因子载荷值较低外，阿尔法和奥米加的结果相似。如果量表中有 5 个或 5 个以上的项目，并且根据Ω 计算出的因子分析结果显示与数据集相匹配，那么使用Ω 比使用α 更合适。此外，当项目数超过 5 个时，α 和 ω 的差异也会消失。由于计算 alpha 比计算 omega 容易（omega 需要先拟合一个因子模型），因此也可以建议使用 alpha 而不是 omega。然而，当项目数和项目间相关性较小时，omega 的表现不如 alpha。因此，应使用阿尔法进行信度估计。

{"title":"Comparison of cronbach’s alpha and McDonald’s omega for ordinal data: Are they different?","authors":"Fatih Orçan","doi":"10.21449/ijate.1271693","DOIUrl":"https://doi.org/10.21449/ijate.1271693","url":null,"abstract":"Among all, Cronbach’s Alpha and McDonald’s Omega are commonly used for reliability estimations. The alpha uses inter-item correlations while omega is based on a factor analysis result. This study uses simulated ordinal data sets to test whether the alpha and omega produce different estimates. Their performances were compared according to the sample size, number of items, and deviance from tau equivalence. Based on the result, the alpha and omega had similar results, except for the small sample size, the smaller number of items, and the low factor loading values. When there were 5 or more items in the scale and factor analysis which the omega was calculated from showed fit to the data set, using omega over alpha could be preferred. Also, as the number of items exceeds 5, the alpha and omega differences disappear. Since calculating the alpha is easier compared to the omega (omega requires fitting a factor model first) using alpha over omega can also be suggested. However, when the number of items and the correlations among the items were small, omega performed worse than alpha. Therefore, alpha should be used for the reliability estimations.","PeriodicalId":42417,"journal":{"name":"International Journal of Assessment Tools in Education","volume":"15 1","pages":""},"PeriodicalIF":0.9,"publicationDate":"2023-10-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139318552","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

An investigation of factors related to collaborative problem-solving skills with mediation models 利用中介模型调查与合作解决问题技能有关的因素

IF 0.9 Q3 EDUCATION & EDUCATIONAL RESEARCH

International Journal of Assessment Tools in Education

Pub Date : 2023-10-07 DOI: 10.21449/ijate.1349918

Özge Arici, Ömer Kutlu

This study investigated the factors that are directly and indirectly related to collaborative problem-solving skills of students in Türkiye with multiple and multilevel mediation models, according to PISA 2015 results. The PISA 2015 Türkiye sample consisted of 5895 students. After missing data assignment and outlier analysis, the analyses were performed over the data set of 5882 students. In this study, whether the variables of valuing teamwork and valuing relationships show a mediation effect was tested with the Bootstrap method through multiple mediation models constructed with the dependent variable of collaborative problem-solving and the independent variables of school belonging and disciplinary climate. Our analyses revealed that the mediator variables had significant effects between school belonging and collaborative problem-solving. Similarly, the mediation effect between the disciplinary climate and the collaborative problem-solving was also significant. Multilevel mediation models constructed with the independent variables of students’ behavior hindering learning and extracurricular creative activities were analyzed with the multilevel structural equation modeling. The findings indicated that the variables of valuing relationships and valuing teamwork did not have a significant mediation effect between extracurricular creative activities and collaborative problem-solving scores. Similarly, it was found that the mediation effect between the students’ behavior hindering learning and collaborative problem-solving scores was not significant. In light of all these findings, it is recommended that school practices be strengthened to improve students' sense of belonging to school and a positive disciplinary climate, to develop students' collaborative problem-solving skills, and to improve attitudes towards collaboration.

本研究根据 2015 年国际学生评估项目（PISA）的结果，采用多重和多层次中介模型，调查了与土耳其学生协作解决问题能力直接和间接相关的因素。PISA 2015 图尔基耶样本由 5895 名学生组成。经过缺失数据分配和离群值分析后，对 5882 名学生的数据集进行了分析。本研究采用 Bootstrap 方法，通过协作解决问题这一因变量以及学校归属感和纪律氛围这两个自变量构建的多重中介模型，检验了重视团队合作和重视人际关系这两个变量是否具有中介效应。我们的分析表明，学校归属感和协作解决问题之间的中介变量具有显著影响。同样，纪律氛围与合作解决问题之间的中介效应也很明显。利用多层次结构方程模型分析了以学生妨碍学习的行为和课外创造性活动为自变量构建的多层次中介模型。结果表明，重视人际关系和重视团队合作这两个变量在课外创造性活动与合作解决问题得分之间没有显著的中介效应。同样，研究还发现，学生阻碍学习的行为与合作解决问题得分之间的中介效应也不明显。鉴于所有这些研究结果，建议加强学校实践，以提高学生对学校的归属感和积极的纪律氛围，培养学生协作解决问题的能力，并改善对协作的态度。

{"title":"An investigation of factors related to collaborative problem-solving skills with mediation models","authors":"Özge Arici, Ömer Kutlu","doi":"10.21449/ijate.1349918","DOIUrl":"https://doi.org/10.21449/ijate.1349918","url":null,"abstract":"This study investigated the factors that are directly and indirectly related to collaborative problem-solving skills of students in Türkiye with multiple and multilevel mediation models, according to PISA 2015 results. The PISA 2015 Türkiye sample consisted of 5895 students. After missing data assignment and outlier analysis, the analyses were performed over the data set of 5882 students. In this study, whether the variables of valuing teamwork and valuing relationships show a mediation effect was tested with the Bootstrap method through multiple mediation models constructed with the dependent variable of collaborative problem-solving and the independent variables of school belonging and disciplinary climate. Our analyses revealed that the mediator variables had significant effects between school belonging and collaborative problem-solving. Similarly, the mediation effect between the disciplinary climate and the collaborative problem-solving was also significant. Multilevel mediation models constructed with the independent variables of students’ behavior hindering learning and extracurricular creative activities were analyzed with the multilevel structural equation modeling. The findings indicated that the variables of valuing relationships and valuing teamwork did not have a significant mediation effect between extracurricular creative activities and collaborative problem-solving scores. Similarly, it was found that the mediation effect between the students’ behavior hindering learning and collaborative problem-solving scores was not significant. In light of all these findings, it is recommended that school practices be strengthened to improve students' sense of belonging to school and a positive disciplinary climate, to develop students' collaborative problem-solving skills, and to improve attitudes towards collaboration.","PeriodicalId":42417,"journal":{"name":"International Journal of Assessment Tools in Education","volume":"45 1","pages":""},"PeriodicalIF":0.9,"publicationDate":"2023-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139321950","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Effects of gender and marital status on the perception of paternalistic leadership: A meta-analysis study 性别、婚姻状况对家长式领导感知的影响:一项元分析研究

Q3 EDUCATION & EDUCATIONAL RESEARCH

International Journal of Assessment Tools in Education

Pub Date : 2023-09-22 DOI: 10.21449/ijate.1298691

Mehmet Sabir ÇEVİK

This research aims to determine the overall effect size of gender and marital status on the perception of paternalistic leadership. In line with the research objective, studies on the perception of paternalistic leadership carried out between 2005 and 2022 in Türkiye were analysed with the meta-analysis method. Meta-analysis covered 22 studies on gender (n Gender=9569) and 10 studies on marital status (n Marital Status=6397) on the perception of paternalistic leadership. In this meta-analysis study utilising the random effects model, the Hedges' g value determining the standardised mean difference between the groups was used to calculate the effect sizes, and the origin of the heterogeneity in the research was tried to be determined by the moderator (sub-group) analyses. Research results revealed that the overall effect size of gender on the perception of paternalistic leadership was at a low level, with a value of 0.170, while the effect size of marital status was at a mean level, with a value of -0.523. However, it was determined in the moderator (sub-group) analyses that the effect size led to a statistically significant difference just in terms of sample groups in both gender and marital status variables.

本研究旨在确定性别和婚姻状况对家长式领导感知的整体效应大小。根据研究目的，采用元分析方法对2005年至2022年在日本进行的家长式领导感知研究进行分析。meta分析包括22项性别研究(n gender =9569)和10项婚姻状况研究(n婚姻状况=6397)对家长式领导感知的影响。在这项采用随机效应模型的荟萃分析研究中，使用决定组间标准化平均差异的Hedges' g值来计算效应大小，并试图通过调节因子(子组)分析来确定研究中异质性的来源。研究结果显示，性别对家长式领导感知的整体效应量处于较低水平，为0.170，而婚姻状况的影响量处于平均水平，为-0.523。然而，在调节因子(亚组)分析中确定，仅就样本组而言，在性别和婚姻状况变量中，效应量导致了统计学上显著的差异。

{"title":"Effects of gender and marital status on the perception of paternalistic leadership: A meta-analysis study","authors":"Mehmet Sabir ÇEVİK","doi":"10.21449/ijate.1298691","DOIUrl":"https://doi.org/10.21449/ijate.1298691","url":null,"abstract":"This research aims to determine the overall effect size of gender and marital status on the perception of paternalistic leadership. In line with the research objective, studies on the perception of paternalistic leadership carried out between 2005 and 2022 in Türkiye were analysed with the meta-analysis method. Meta-analysis covered 22 studies on gender (n Gender=9569) and 10 studies on marital status (n Marital Status=6397) on the perception of paternalistic leadership. In this meta-analysis study utilising the random effects model, the Hedges' g value determining the standardised mean difference between the groups was used to calculate the effect sizes, and the origin of the heterogeneity in the research was tried to be determined by the moderator (sub-group) analyses. Research results revealed that the overall effect size of gender on the perception of paternalistic leadership was at a low level, with a value of 0.170, while the effect size of marital status was at a mean level, with a value of -0.523. However, it was determined in the moderator (sub-group) analyses that the effect size led to a statistically significant difference just in terms of sample groups in both gender and marital status variables.","PeriodicalId":42417,"journal":{"name":"International Journal of Assessment Tools in Education","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136010522","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

The effect of student characteristics and socioeconomic status on mathematics achievement in Türkiye: Insights from TIMSS 2011-2019 学生特征和社会经济地位对学生数学成绩的影响:来自TIMSS 2011-2019的见解

Q3 EDUCATION & EDUCATIONAL RESEARCH

International Journal of Assessment Tools in Education

Pub Date : 2023-09-22 DOI: 10.21449/ijate.1272517

Burçin COŞKUN, Kübra KARAKAYA ÖZYER

This study examines the factors affecting the mathematics achievement of 8th-grade students in Türkiye using data from the TIMSS in 2011, 2015, and 2019. The data were analysed with multilevel (two-level) modelling. The first level was the student, and the second level was the school. At the student level, such affective characteristics as self-confidence in learning mathematics, liking to learn mathematics, and value given to learning mathematics, as well as educational resources, namely at home, gender, and the frequency of speaking the language of the test at home, were taken into consideration. At the school level, the school’s socioeconomic status was included in the model. The results showed that self-confidence in learning mathematics is the most important variable affecting students' mathematics achievement in all years. Besides, the school’s socioeconomic status has the strongest effect on students' mathematics achievement, which has increased over the years. The study also showed that those students who performed higher achievement in TIMSS 2011, 2015, and 2019 are confident in learning mathematics, have many educational resources at home, frequently speak Turkish at home, and are from affluent schools. On the other hand, for TIMSS 2011 and 2019, female students were more successful than male students. The effect of liking to learn mathematics on achievement was negative and significant only for TIMSS 2015, while the effect of value given to learning mathematics was positive and significant only for TIMSS 2019. However, the effect size values of the variables showed that this effect was not significant in practice.

本研究利用2011年、2015年和2019年的TIMSS数据，探讨了影响缅甸八年级学生数学成绩的因素。采用多水平(两水平)模型分析数据。第一个层次是学生，第二个层次是学校。在学生层面，考虑了学习数学的自信、喜欢学习数学、对学习数学的重视等情感特征，以及教育资源，即家庭、性别、家庭使用测试语言的频率。在学校层面，学校的社会经济地位被纳入模型。结果表明，数学学习自信是影响各年级学生数学成绩的最重要变量。此外，学校的社会经济地位对学生的数学成绩影响最大，而且这种影响逐年增加。研究还表明，在TIMSS 2011、2015和2019中取得较高成绩的学生对学习数学有信心，家里有很多教育资源，经常在家里说土耳其语，来自富裕的学校。另一方面，在2011年和2019年的TIMSS中，女生比男生更成功。喜欢学习数学对成绩的影响仅在2015年TIMSS中呈负向且显著，而给予数学学习价值的影响仅在2019年TIMSS中呈正向且显著。然而，变量的效应大小值表明，这种效应在实践中并不显著。

{"title":"The effect of student characteristics and socioeconomic status on mathematics achievement in Türkiye: Insights from TIMSS 2011-2019","authors":"Burçin COŞKUN, Kübra KARAKAYA ÖZYER","doi":"10.21449/ijate.1272517","DOIUrl":"https://doi.org/10.21449/ijate.1272517","url":null,"abstract":"This study examines the factors affecting the mathematics achievement of 8th-grade students in Türkiye using data from the TIMSS in 2011, 2015, and 2019. The data were analysed with multilevel (two-level) modelling. The first level was the student, and the second level was the school. At the student level, such affective characteristics as self-confidence in learning mathematics, liking to learn mathematics, and value given to learning mathematics, as well as educational resources, namely at home, gender, and the frequency of speaking the language of the test at home, were taken into consideration. At the school level, the school’s socioeconomic status was included in the model. The results showed that self-confidence in learning mathematics is the most important variable affecting students' mathematics achievement in all years. Besides, the school’s socioeconomic status has the strongest effect on students' mathematics achievement, which has increased over the years. The study also showed that those students who performed higher achievement in TIMSS 2011, 2015, and 2019 are confident in learning mathematics, have many educational resources at home, frequently speak Turkish at home, and are from affluent schools. On the other hand, for TIMSS 2011 and 2019, female students were more successful than male students. The effect of liking to learn mathematics on achievement was negative and significant only for TIMSS 2015, while the effect of value given to learning mathematics was positive and significant only for TIMSS 2019. However, the effect size values of the variables showed that this effect was not significant in practice.","PeriodicalId":42417,"journal":{"name":"International Journal of Assessment Tools in Education","volume":"80 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136010395","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A comparative study of ensemble methods in the field of education: Bagging and Boosting algorithms 教育领域集成方法的比较研究:Bagging和Boosting算法

Q3 EDUCATION & EDUCATIONAL RESEARCH

International Journal of Assessment Tools in Education

Pub Date : 2023-09-22 DOI: 10.21449/ijate.1167705

Hikmet ŞEVGİN

This study aims to conduct a comparative study of Bagging and Boosting algorithms among ensemble methods and to compare the classification performance of TreeNet and Random Forest methods using these algorithms on the data extracted from ABİDE application in education. The main factor in choosing them for analyses is that they are Ensemble methods combining decision trees via Bagging and Boosting algorithms and creating a single outcome by combining the outputs obtained from each of them. The data set consists of mathematics scores of ABİDE (Academic Skills Monitoring and Evaluation) 2016 implementation and various demographic variables regarding students. The study group involves 5000 students randomly recruited. On the deletion of loss data and assignment procedures, this number decreased to 4568. The analyses showed that the TreeNet method performed more successfully in terms of classification accuracy, sensitivity, F1-score and AUC value based on sample size, and the Random Forest method on specificity and accuracy. It can be alleged that the TreeNet method is more successful in all numerical estimation error rates for each sample size by producing lower values compared to the Random Forest method. When comparing both analysis methods based on ABİDE data, considering all the conditions, including sample size, cross validity and performance criteria following the analyses, TreeNet can be said to exhibit higher classification performance than Random Forest. Unlike a single classifier or predictive method, the classification or prediction of multiple methods by using Boosting and Bagging algorithms is considered important for the results obtained in education.

本研究旨在对Bagging和Boosting算法在集成方法中进行比较研究，并比较使用这些算法的TreeNet和Random Forest方法在ABİDE教育应用中提取的数据上的分类性能。选择它们进行分析的主要因素是，它们是通过Bagging和Boosting算法组合决策树的集成方法，并通过组合从每个算法获得的输出来创建单个结果。数据集由ABİDE(学术技能监测和评估)2016年实施的数学成绩和有关学生的各种人口统计变量组成。这个学习小组包括随机招募的5000名学生。在删除损失数据和转让程序后，这一数字减少到4568。分析表明，TreeNet方法在分类精度、灵敏度、f1评分和基于样本量的AUC值方面优于Random Forest方法，而Random Forest方法在特异性和准确性方面优于TreeNet方法。可以断言，与随机森林方法相比，TreeNet方法在每个样本量的所有数值估计错误率上都更成功，因为它产生的值更低。当基于ABİDE数据对两种分析方法进行比较时，考虑到所有条件，包括样本量、交叉效度和分析后的性能标准，TreeNet可以说比Random Forest表现出更高的分类性能。与单一分类器或预测方法不同，使用Boosting和Bagging算法对多种方法进行分类或预测对于在教育中获得的结果很重要。

{"title":"A comparative study of ensemble methods in the field of education: Bagging and Boosting algorithms","authors":"Hikmet ŞEVGİN","doi":"10.21449/ijate.1167705","DOIUrl":"https://doi.org/10.21449/ijate.1167705","url":null,"abstract":"This study aims to conduct a comparative study of Bagging and Boosting algorithms among ensemble methods and to compare the classification performance of TreeNet and Random Forest methods using these algorithms on the data extracted from ABİDE application in education. The main factor in choosing them for analyses is that they are Ensemble methods combining decision trees via Bagging and Boosting algorithms and creating a single outcome by combining the outputs obtained from each of them. The data set consists of mathematics scores of ABİDE (Academic Skills Monitoring and Evaluation) 2016 implementation and various demographic variables regarding students. The study group involves 5000 students randomly recruited. On the deletion of loss data and assignment procedures, this number decreased to 4568. The analyses showed that the TreeNet method performed more successfully in terms of classification accuracy, sensitivity, F1-score and AUC value based on sample size, and the Random Forest method on specificity and accuracy. It can be alleged that the TreeNet method is more successful in all numerical estimation error rates for each sample size by producing lower values compared to the Random Forest method. When comparing both analysis methods based on ABİDE data, considering all the conditions, including sample size, cross validity and performance criteria following the analyses, TreeNet can be said to exhibit higher classification performance than Random Forest. Unlike a single classifier or predictive method, the classification or prediction of multiple methods by using Boosting and Bagging algorithms is considered important for the results obtained in education.","PeriodicalId":42417,"journal":{"name":"International Journal of Assessment Tools in Education","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136010523","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

The mediating role of academic self-efficacy between the answer-copying tendency and the fear of negative evaluation 学业自我效能感在回答抄袭倾向与负面评价恐惧之间的中介作用

Q3 EDUCATION & EDUCATIONAL RESEARCH

International Journal of Assessment Tools in Education

Pub Date : 2023-09-22 DOI: 10.21449/ijate.1335260

Müge ULUMAN, Emine Burcu PEHLİVAN

The aim of this research study was to analyse the relationship among answer-copying tendency, academic self-efficacy, and fear of negative evaluation. To this end, we formed a structural equivalence model, and we evaluated the mediating role of academic self-efficacy between answer-copying tendency and fear of negative evaluation. A total of 562 university students participated in the study. We used the following as data collection tools: The Brief Fear of Negative Evaluation Scale, Academic Self-Efficacy Scale, and Answer-Copying Tendency Scale. For the analysis of the data, first the measurement model was tested, then the Structural Equation Model was established and estimations were made with Maximum Probability Estimation. According to the results, academic self-efficacy plays the role of a mediatory variable between fear of negative evaluation and answer-copying tendency. As can be seen from the impact of fear of negative evaluation on answer-copying tendency, there is a meaningful, positive correlation between the two variables. When we included academic self-efficacy in the model as a mediatory variable, we observed that the relationship between fear of negative evaluation and answer-copying tendency weakened and became less noteworthy. In the light of these observations, we can assert that the tendency of individuals with high academic self-efficacy to cheat in academic contexts is lower even if they have a fear of negative evaluation.

摘要本研究的目的在于分析抄答倾向、学业自我效能感与负面评价恐惧之间的关系。为此，我们构建了结构等价模型，评估了学业自我效能感在答案复制倾向与负面评价恐惧之间的中介作用。共有562名大学生参与了这项研究。我们使用以下工具作为数据收集工具:负面评价简短恐惧量表、学业自我效能量表和答案复制倾向量表。对数据进行分析时，首先对测量模型进行检验，然后建立结构方程模型，并用最大概率估计法进行估计。研究结果显示，学业自我效能感在负面评价恐惧与答案复制倾向之间起中介作用。从负面评价恐惧对抄答案倾向的影响可以看出，这两个变量之间存在有意义的正相关关系。当我们将学业自我效能作为中介变量纳入模型时，我们观察到负面评价恐惧与答案复制倾向之间的关系减弱，变得不那么显著。根据这些观察，我们可以断言，学术自我效能高的个体即使害怕负面评价，在学术环境中作弊的倾向也更低。

{"title":"The mediating role of academic self-efficacy between the answer-copying tendency and the fear of negative evaluation","authors":"Müge ULUMAN, Emine Burcu PEHLİVAN","doi":"10.21449/ijate.1335260","DOIUrl":"https://doi.org/10.21449/ijate.1335260","url":null,"abstract":"The aim of this research study was to analyse the relationship among answer-copying tendency, academic self-efficacy, and fear of negative evaluation. To this end, we formed a structural equivalence model, and we evaluated the mediating role of academic self-efficacy between answer-copying tendency and fear of negative evaluation. A total of 562 university students participated in the study. We used the following as data collection tools: The Brief Fear of Negative Evaluation Scale, Academic Self-Efficacy Scale, and Answer-Copying Tendency Scale. For the analysis of the data, first the measurement model was tested, then the Structural Equation Model was established and estimations were made with Maximum Probability Estimation. According to the results, academic self-efficacy plays the role of a mediatory variable between fear of negative evaluation and answer-copying tendency. As can be seen from the impact of fear of negative evaluation on answer-copying tendency, there is a meaningful, positive correlation between the two variables. When we included academic self-efficacy in the model as a mediatory variable, we observed that the relationship between fear of negative evaluation and answer-copying tendency weakened and became less noteworthy. In the light of these observations, we can assert that the tendency of individuals with high academic self-efficacy to cheat in academic contexts is lower even if they have a fear of negative evaluation.","PeriodicalId":42417,"journal":{"name":"International Journal of Assessment Tools in Education","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136011296","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Application of the professional maturity scale as a computerized adaptive testing 职业成熟度量表作为计算机化自适应测试的应用

Q3 EDUCATION & EDUCATIONAL RESEARCH

International Journal of Assessment Tools in Education

Pub Date : 2023-09-22 DOI: 10.21449/ijate.1262199

Süleyman DEMİR, Derya ÇOBANOĞLU AKTAN, Neşe GÜLER

This study has two main purposes. Firstly, to compare the different item selection methods and stopping rules used in Computerized Adaptive Testing (CAT) applications with simulative data generated based on the item parameters of the Vocational Maturity Scale. Secondly, to test the validity of CAT application scores. For the first purpose, simulative data produced based on Vocational Maturity Scale item parameters were analyzed under different item selection methods (Maximum Fisher Information [MFI],Maximum Likelihood Weighted Information [MLWI] Maximum Posterior Weighted Information [MPWI] Maximum Expected Information [MEI] Minimum Expected Posterior Variance [MEPV] Maximum Expected Posterior Weighted Information [MEPWI]) and stopping rules (Standard Error [SE]<0.30, SE<0.50, SE <0.70, Number of Item [NI]=10, NI=20) by calculating the average number of items, standard error averages, correlation coefficients, bias, and RMSE statistics. For all the conditions of the item selection methods, standard error averages, correlation coefficients, bias, and RMSE statistics showed similar results. When the average number of items is considered, MFI and SE<0.30 were found as most appropriate methods to be used in CAT application. For the second purpose of the study, the paper-pencil form of the Vocational Maturity scale and CAT version were administered to 33 students. A moderate, positive, and statistically significant relationship was found between the CAT application scores and the paper-pencil form scores on the vocational maturity scale. As a result, it can be said that the vocational maturity scale can be applied as a computerized adaptive test and can be used in career guidance processes.

这项研究有两个主要目的。首先，利用基于职业成熟度量表项目参数生成的模拟数据，比较计算机自适应测试(CAT)应用中不同的项目选择方法和停止规则。其次，检验CAT应用分数的效度。首先，基于职业成熟度量表项目参数生成的模拟数据在不同的项目选择方法(最大Fisher信息[MFI]、最大似然加权信息[MLWI]、最大后验加权信息[MPWI]、最大期望信息[MEI]、最小期望后验方差[MEPV]、最大期望后验加权信息[MEPWI])和停止规则(标准误差[SE]<0.30、SE<0.50、SE< 0.70、项目数[NI]=10, NI=20)，计算平均项目数、标准误差平均值、相关系数、偏差和RMSE统计量。对于所有选择方法的条件，标准误差平均值、相关系数、偏差和RMSE统计结果相似。当考虑平均项目数时，发现MFI和SE<0.30是CAT应用中最合适的方法。第二个研究目的是对33名学生使用纸笔职业成熟度量表和CAT量表。在职业成熟度量表上，CAT应用得分与纸笔形式得分之间存在着适度的、正的、有统计学意义的关系。因此，可以说，职业成熟度量表可以作为一种计算机化的适应性测试，并可用于职业指导过程。

{"title":"Application of the professional maturity scale as a computerized adaptive testing","authors":"Süleyman DEMİR, Derya ÇOBANOĞLU AKTAN, Neşe GÜLER","doi":"10.21449/ijate.1262199","DOIUrl":"https://doi.org/10.21449/ijate.1262199","url":null,"abstract":"This study has two main purposes. Firstly, to compare the different item selection methods and stopping rules used in Computerized Adaptive Testing (CAT) applications with simulative data generated based on the item parameters of the Vocational Maturity Scale. Secondly, to test the validity of CAT application scores. For the first purpose, simulative data produced based on Vocational Maturity Scale item parameters were analyzed under different item selection methods (Maximum Fisher Information [MFI],Maximum Likelihood Weighted Information [MLWI] Maximum Posterior Weighted Information [MPWI] Maximum Expected Information [MEI] Minimum Expected Posterior Variance [MEPV] Maximum Expected Posterior Weighted Information [MEPWI]) and stopping rules (Standard Error [SE]&lt;0.30, SE&lt;0.50, SE &lt;0.70, Number of Item [NI]=10, NI=20) by calculating the average number of items, standard error averages, correlation coefficients, bias, and RMSE statistics. For all the conditions of the item selection methods, standard error averages, correlation coefficients, bias, and RMSE statistics showed similar results. When the average number of items is considered, MFI and SE&lt;0.30 were found as most appropriate methods to be used in CAT application. For the second purpose of the study, the paper-pencil form of the Vocational Maturity scale and CAT version were administered to 33 students. A moderate, positive, and statistically significant relationship was found between the CAT application scores and the paper-pencil form scores on the vocational maturity scale. As a result, it can be said that the vocational maturity scale can be applied as a computerized adaptive test and can be used in career guidance processes.","PeriodicalId":42417,"journal":{"name":"International Journal of Assessment Tools in Education","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136011681","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

The Contemporary Leadership Behavior of School Principals Scale according to teacher's perceptions: Validity and reliability study 基于教师知觉的当代校长领导行为量表:效度与信度研究

Q3 EDUCATION & EDUCATIONAL RESEARCH

International Journal of Assessment Tools in Education

Pub Date : 2023-09-22 DOI: 10.21449/ijate.1286819

İbrahim GÜL, Selda ÖRS ÖZDİL

School principals are expected to possess modern leadership abilities that place an emphasis on ideas like collaboration, innovation, technological advancement, and egalitarianism. The objective of this study was to establish the psychometric features of the scale of current leadership behaviors of school principals (SCLBSP) developed in accordance with teacher perspectives. For the scale's content validity, eight experts were contacted, and for each item, the content validity ratio (CVR) and content validity index (CVI) were computed. Two different samples served as the basis for the scale development investigation. 253 teachers' worth of data were utilized in the exploratory factor analysis (EFA), whereas 215 teachers' worth of data were used in the confirmatory factor analysis (CFA) and measurement invariance studies. Cronbach's alpha, McDonald's omega, Split-Half method coefficients, composite reliability (CR), and average variance extracted (AVE) values were determined to determine the scale's reliability. The studies produced a single-factor structure with 34 items that explained 74.4% of the total variation. All SCLBSP items were found to have high levels of discrimination, and the reliability and validity of the entire scale were also found to be high.

学校校长应该具备现代领导能力，强调合作、创新、技术进步和平等主义等理念。摘要本研究的目的是建立以教师视角编制的校长当前领导行为量表的心理测量特征。对于量表的内容效度，我们联系了8位专家，对每个条目计算内容效度比(CVR)和内容效度指数(CVI)。两种不同的样品作为水垢发展调查的基础。探索性因子分析(EFA)使用了253份教师价值数据，验证性因子分析(CFA)和测量不变性研究使用了215份教师价值数据。通过确定Cronbach’s alpha、McDonald’s omega、Split-Half方法系数、复合信度(CR)和平均方差提取(AVE)值来确定量表的信度。这些研究产生了一个包含34个项目的单因素结构，解释了总变异的74.4%。结果显示，scbsp量表的所有条目均具有较高的辨识度，整个量表的信度和效度也较高。

{"title":"The Contemporary Leadership Behavior of School Principals Scale according to teacher's perceptions: Validity and reliability study","authors":"İbrahim GÜL, Selda ÖRS ÖZDİL","doi":"10.21449/ijate.1286819","DOIUrl":"https://doi.org/10.21449/ijate.1286819","url":null,"abstract":"School principals are expected to possess modern leadership abilities that place an emphasis on ideas like collaboration, innovation, technological advancement, and egalitarianism. The objective of this study was to establish the psychometric features of the scale of current leadership behaviors of school principals (SCLBSP) developed in accordance with teacher perspectives. For the scale's content validity, eight experts were contacted, and for each item, the content validity ratio (CVR) and content validity index (CVI) were computed. Two different samples served as the basis for the scale development investigation. 253 teachers' worth of data were utilized in the exploratory factor analysis (EFA), whereas 215 teachers' worth of data were used in the confirmatory factor analysis (CFA) and measurement invariance studies. Cronbach's alpha, McDonald's omega, Split-Half method coefficients, composite reliability (CR), and average variance extracted (AVE) values were determined to determine the scale's reliability. The studies produced a single-factor structure with 34 items that explained 74.4% of the total variation. All SCLBSP items were found to have high levels of discrimination, and the reliability and validity of the entire scale were also found to be high.","PeriodicalId":42417,"journal":{"name":"International Journal of Assessment Tools in Education","volume":"34 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136011295","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Examining the Factors Affecting Students' Science Success with Bayesian Networks 用贝叶斯网络研究影响学生科学成就的因素

Q3 EDUCATION & EDUCATIONAL RESEARCH

International Journal of Assessment Tools in Education

Pub Date : 2023-09-22 DOI: 10.21449/ijate.1218659

Hasan Aykut KARABOĞA, İbrahim DEMİR

Bayesian Networks (BNs) are probabilistic graphical statistical models that have been widely used in many fields over the last decade. This method, which can also be used for educational data mining (EDM) purposes, is a fairly new method in education literature. This study models students' science success using the BN approach. Science is one of the core areas in the PISA exam. To this end, we used the data set including the most successful 25% and the least successful 25% students from Turkey based on their scores from Program for International Student Assessment (PISA) survey. We also made the feature selection to determine the most effective variables on success. The accuracy value of the BN model created with the variables determined by the feature selection is 86.2%. We classified effective variables on success into three categories; individual, family-related and school-related. Based on the analysis, we found that family-related variables are very effective in science success, and gender is not a discriminant variable in this success. In addition, this is the first study in the literature on the evaluation of complex data made with the BN model. In this respect, it serves as a guide in the evaluation of international exams and in the use of the data obtained.

贝叶斯网络(BNs)是一种概率图形统计模型，在过去十年中被广泛应用于许多领域。这种方法在教育文献中是一种相当新的方法，也可用于教育数据挖掘(EDM)。本研究使用BN方法模拟学生的科学成功。科学是PISA考试的核心领域之一。为此，我们使用的数据集包括最成功的25%和最不成功的25%来自土耳其的学生，基于他们在国际学生评估项目(PISA)调查中的分数。我们还进行了特征选择，以确定成功的最有效变量。由特征选择确定的变量所创建的BN模型准确率值为86.2%。我们将影响成功的有效变量分为三类;个人的、家庭的和学校的。通过分析，我们发现家庭相关变量在科学成功中非常有效，性别不是这种成功的判别变量。此外，这是文献中首次使用BN模型对复杂数据进行评价的研究。在这方面，它是评价国际考试和使用所获得数据的指南。

{"title":"Examining the Factors Affecting Students' Science Success with Bayesian Networks","authors":"Hasan Aykut KARABOĞA, İbrahim DEMİR","doi":"10.21449/ijate.1218659","DOIUrl":"https://doi.org/10.21449/ijate.1218659","url":null,"abstract":"Bayesian Networks (BNs) are probabilistic graphical statistical models that have been widely used in many fields over the last decade. This method, which can also be used for educational data mining (EDM) purposes, is a fairly new method in education literature. This study models students' science success using the BN approach. Science is one of the core areas in the PISA exam. To this end, we used the data set including the most successful 25% and the least successful 25% students from Turkey based on their scores from Program for International Student Assessment (PISA) survey. We also made the feature selection to determine the most effective variables on success. The accuracy value of the BN model created with the variables determined by the feature selection is 86.2%. We classified effective variables on success into three categories; individual, family-related and school-related. Based on the analysis, we found that family-related variables are very effective in science success, and gender is not a discriminant variable in this success. In addition, this is the first study in the literature on the evaluation of complex data made with the BN model. In this respect, it serves as a guide in the evaluation of international exams and in the use of the data obtained.","PeriodicalId":42417,"journal":{"name":"International Journal of Assessment Tools in Education","volume":"51 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136010390","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0