评估德国国际学生评估项目分层设计:模拟研究

IF 2.6 Q1 EDUCATION & EDUCATIONAL RESEARCH Large-Scale Assessments in Education Pub Date : 2024-05-08 DOI:10.1186/s40536-024-00203-0
Julia Mang, Helmut Küchenhoff, Sabine Meinck
{"title":"评估德国国际学生评估项目分层设计:模拟研究","authors":"Julia Mang, Helmut Küchenhoff, Sabine Meinck","doi":"10.1186/s40536-024-00203-0","DOIUrl":null,"url":null,"abstract":"<p>Stratification is an important design feature of many studies using complex sampling designs and it is often used in large-scale assessment (LSA) studies, such as the <i>Programme for International Student Assessment</i> (PISA), for two main reasons. First, stratification variables that achieve a high between and low within strata variance can improve the efficiency of a survey design. Second, stratification allows one to, explicitly or implicitly, control for sample sizes across subpopulations. It ensures that some parts of a population are in the sample in predetermined proportions. In this study, we determine through simulation which stratification scheme is best for PISA in Germany. For this, we consider the constraints imposed by the international sampling design, the available information about schools, and specific national characteristics of the German educational system. We examine seven different stratification designs selected based on scenarios used in past LSAs in Germany and theoretical considerations for future implementations. The chosen scenarios were compared with two reference scenarios: (1) an unstratified design and (2) a synthetic optimal stratification design. The simulation study reveals that the stratification design currently applied in PISA produces satisfactory results regarding sampling precision. The present stratification design is based on Germany's federal states and school types. However, this approach leads to small strata, which has been problematic for estimating sampling variance in previous cycles. Therefore, alternative stratification scenarios were considered and, in addition to overcoming the small-strata problem, also led to smaller standard errors for estimates of student mean performance in mathematics, science, and reading. As a result of this study, we recommend considering three different stratification designs for Germany in future cycles of PISA. These recommendations aim to: (1) improve the sampling efficiency while keeping the sample size constant, (2) follow a sound methodological approach, and (3) make conservative and cautious changes while maintaining a reflection of the structure of the German federal school system with different school types. These suggestions include a reinvented stratification of grouped German federal states and designs with school types as explicit stratifiers and federal states as implicit stratifiers.</p>","PeriodicalId":37009,"journal":{"name":"Large-Scale Assessments in Education","volume":"94 1","pages":""},"PeriodicalIF":2.6000,"publicationDate":"2024-05-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Evaluating German PISA stratification designs: a simulation study\",\"authors\":\"Julia Mang, Helmut Küchenhoff, Sabine Meinck\",\"doi\":\"10.1186/s40536-024-00203-0\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Stratification is an important design feature of many studies using complex sampling designs and it is often used in large-scale assessment (LSA) studies, such as the <i>Programme for International Student Assessment</i> (PISA), for two main reasons. First, stratification variables that achieve a high between and low within strata variance can improve the efficiency of a survey design. Second, stratification allows one to, explicitly or implicitly, control for sample sizes across subpopulations. It ensures that some parts of a population are in the sample in predetermined proportions. In this study, we determine through simulation which stratification scheme is best for PISA in Germany. For this, we consider the constraints imposed by the international sampling design, the available information about schools, and specific national characteristics of the German educational system. We examine seven different stratification designs selected based on scenarios used in past LSAs in Germany and theoretical considerations for future implementations. The chosen scenarios were compared with two reference scenarios: (1) an unstratified design and (2) a synthetic optimal stratification design. The simulation study reveals that the stratification design currently applied in PISA produces satisfactory results regarding sampling precision. The present stratification design is based on Germany's federal states and school types. However, this approach leads to small strata, which has been problematic for estimating sampling variance in previous cycles. Therefore, alternative stratification scenarios were considered and, in addition to overcoming the small-strata problem, also led to smaller standard errors for estimates of student mean performance in mathematics, science, and reading. As a result of this study, we recommend considering three different stratification designs for Germany in future cycles of PISA. These recommendations aim to: (1) improve the sampling efficiency while keeping the sample size constant, (2) follow a sound methodological approach, and (3) make conservative and cautious changes while maintaining a reflection of the structure of the German federal school system with different school types. These suggestions include a reinvented stratification of grouped German federal states and designs with school types as explicit stratifiers and federal states as implicit stratifiers.</p>\",\"PeriodicalId\":37009,\"journal\":{\"name\":\"Large-Scale Assessments in Education\",\"volume\":\"94 1\",\"pages\":\"\"},\"PeriodicalIF\":2.6000,\"publicationDate\":\"2024-05-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Large-Scale Assessments in Education\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1186/s40536-024-00203-0\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"EDUCATION & EDUCATIONAL RESEARCH\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Large-Scale Assessments in Education","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1186/s40536-024-00203-0","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"EDUCATION & EDUCATIONAL RESEARCH","Score":null,"Total":0}
引用次数: 0

摘要

分层是许多采用复杂抽样设计的研究的一个重要设计特征,它经常用于大规模评估 (LSA)研究,如国际学生评估项目(PISA),主要有两个原因。首先,分层变量的层间方差大、层内方差小,可以提高调查设计的效率。其次,分层可以或明或暗地控制各子人群的样本量。它能确保人口中的某些部分按预定比例进入样本。在本研究中,我们通过模拟确定哪种分层方案最适合德国国际学生评估项目。为此,我们考虑了国际抽样设计所带来的限制、现有的学校信息以及德国教育系统的具体国家特征。我们研究了七种不同的分层设计,这些设计是根据德国过去的 LSA 所使用的方案以及对未来实施的理论考虑而选定的。我们将所选方案与两种参考方案进行了比较:(1) 无分层设计和 (2) 合成最优分层设计。模拟研究表明,PISA 目前采用的分层设计在抽样精度方面取得了令人满意的结果。目前的分层设计基于德国的联邦州和学校类型。然而,这种方法导致分层较小,这在以前的周期中对抽样方差的估计造成了问题。因此,我们考虑了其他分层方案,除了克服小分层问题外,还缩小了数学、科学和阅读方面学生平均成绩估计值的标准误差。通过这项研究,我们建议德国在未来的国际学生评估项目中考虑三种不同的分层设计。这些建议旨在(1) 在保持样本量不变的情况下,提高抽样效率;(2) 遵循合理的方法论;(3) 在保持反映德国联邦学校系统不同学校类型结构的情况下,做出保守而谨慎的改变。这些建议包括对德国联邦州进行重新分层,以及将学校类型作为显性分层和联邦州作为隐性分层的设计。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

摘要图片

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Evaluating German PISA stratification designs: a simulation study

Stratification is an important design feature of many studies using complex sampling designs and it is often used in large-scale assessment (LSA) studies, such as the Programme for International Student Assessment (PISA), for two main reasons. First, stratification variables that achieve a high between and low within strata variance can improve the efficiency of a survey design. Second, stratification allows one to, explicitly or implicitly, control for sample sizes across subpopulations. It ensures that some parts of a population are in the sample in predetermined proportions. In this study, we determine through simulation which stratification scheme is best for PISA in Germany. For this, we consider the constraints imposed by the international sampling design, the available information about schools, and specific national characteristics of the German educational system. We examine seven different stratification designs selected based on scenarios used in past LSAs in Germany and theoretical considerations for future implementations. The chosen scenarios were compared with two reference scenarios: (1) an unstratified design and (2) a synthetic optimal stratification design. The simulation study reveals that the stratification design currently applied in PISA produces satisfactory results regarding sampling precision. The present stratification design is based on Germany's federal states and school types. However, this approach leads to small strata, which has been problematic for estimating sampling variance in previous cycles. Therefore, alternative stratification scenarios were considered and, in addition to overcoming the small-strata problem, also led to smaller standard errors for estimates of student mean performance in mathematics, science, and reading. As a result of this study, we recommend considering three different stratification designs for Germany in future cycles of PISA. These recommendations aim to: (1) improve the sampling efficiency while keeping the sample size constant, (2) follow a sound methodological approach, and (3) make conservative and cautious changes while maintaining a reflection of the structure of the German federal school system with different school types. These suggestions include a reinvented stratification of grouped German federal states and designs with school types as explicit stratifiers and federal states as implicit stratifiers.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Large-Scale Assessments in Education
Large-Scale Assessments in Education Social Sciences-Education
CiteScore
4.30
自引率
6.50%
发文量
16
审稿时长
13 weeks
期刊最新文献
Assessment of student ICT competence according to mathematics, science, and reading literacy: evidence from PISA 2018 Teacher-centered analysis with TIMSS and PIRLS data: weighting approaches, accuracy, and precision Secondary school students’ attitudes of tolerance towards minorities Participation rates, characteristics, and differential effects on reading literacy of extracurricular tutoring in a German large-scale assessment Teaching practices and organisational aspects associated with the use of ICT
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1