Educational Assessment Evaluation and Accountability最新文献

英文中文

Enhanced conceptual understanding through formative assessment: results of a randomized controlled intervention study in physics classes. 通过形成性评价增强概念理解：物理课堂随机对照干预研究结果。

IF 2.8 3区教育学 Q1 EDUCATION & EDUCATIONAL RESEARCH

Educational Assessment Evaluation and Accountability

Pub Date : 2025-01-01 Epub Date: 2024-12-27 DOI: 10.1007/s11092-024-09445-6

Andreas Lichtenberger, Sarah I Hofer, Elsbeth Stern, Andreas Vaterlaus

While formative assessment is a widely valued instructional approach to support meaningful learning, putting it into classroom practice remains a challenge, also because the time resources required may conflict with other goals. In a cluster-randomized controlled intervention study with 29 teachers and 604 students (mean age 15.6 years) at secondary school level, we examined the yield of formative assessment with regard to students' conceptual understanding and quantitative problem-solving skills in physics. Ten teachers applied formative assessment (FA group) in a 14-lesson curriculum on kinematics after having undergone a training that focused on the implementation of multiple-choice concept questions together with monitoring tools, clicker sessions, and reflective lessons. In the frequent testing group (FT group), ten teachers had no training on formative assessment but implemented the same concept questions as those used in the FA group. Nine teachers taught kinematics in their traditional way (TT group). The results revealed that students in the FA group outperformed students in the other two groups in a test on conceptual understanding immediately after the intervention as well as 3 months later, whereas students from the FT group and the TT group did not differ. Importantly, a better conceptual understanding in the FA group was not at the expense of performance in quantitative problem solving, as students of this group better integrated both kinds of knowledge. Our study has shown that a short but well-structured formative assessment teacher training could unfold its potential in terms of students' learning of challenging content.

Supplementary information: The online version contains supplementary material available at 10.1007/s11092-024-09445-6.

虽然形成性评估是一种广泛重视的教学方法，以支持有意义的学习，但将其付诸课堂实践仍然是一个挑战，因为所需的时间资源可能与其他目标相冲突。在一项对29名教师和604名中学生（平均年龄15.6岁）进行的集群随机对照干预研究中，我们检验了形成性评估对学生物理概念理解和定量解决问题能力的影响。10名教师在14课的运动学课程中应用了形成性评估（FA组），他们接受了侧重于多项选择概念问题的实施以及监控工具、点击器会话和反思性课程的培训。在频繁测试组（FT组）中，10名教师没有接受过形成性评估方面的培训，但使用了与FA组相同的概念题。9位老师以传统方式教授运动学（TT组）。结果显示，在干预后立即和3个月后的概念理解测试中，FA组的学生表现优于其他两组的学生，而FT组和TT组的学生没有差异。重要的是，在FA组中，更好的概念理解并没有以牺牲定量问题解决的表现为代价，因为这个组的学生更好地整合了两种知识。我们的研究表明，一个简短但结构良好的形成性评估教师培训可以在学生学习具有挑战性的内容方面发挥其潜力。补充信息：在线版本包含补充资料，可在10.1007/s11092-024-09445-6获得。

{"title":"Enhanced conceptual understanding through formative assessment: results of a randomized controlled intervention study in physics classes.","authors":"Andreas Lichtenberger, Sarah I Hofer, Elsbeth Stern, Andreas Vaterlaus","doi":"10.1007/s11092-024-09445-6","DOIUrl":"10.1007/s11092-024-09445-6","url":null,"abstract":"While formative assessment is a widely valued instructional approach to support meaningful learning, putting it into classroom practice remains a challenge, also because the time resources required may conflict with other goals. In a cluster-randomized controlled intervention study with 29 teachers and 604 students (mean age 15.6 years) at secondary school level, we examined the yield of formative assessment with regard to students' conceptual understanding and quantitative problem-solving skills in physics. Ten teachers applied formative assessment (FA group) in a 14-lesson curriculum on kinematics after having undergone a training that focused on the implementation of multiple-choice concept questions together with monitoring tools, clicker sessions, and reflective lessons. In the frequent testing group (FT group), ten teachers had no training on formative assessment but implemented the same concept questions as those used in the FA group. Nine teachers taught kinematics in their traditional way (TT group). The results revealed that students in the FA group outperformed students in the other two groups in a test on conceptual understanding immediately after the intervention as well as 3 months later, whereas students from the FT group and the TT group did not differ. Importantly, a better conceptual understanding in the FA group was not at the expense of performance in quantitative problem solving, as students of this group better integrated both kinds of knowledge. Our study has shown that a short but well-structured formative assessment teacher training could unfold its potential in terms of students' learning of challenging content.Supplementary information: The online version contains supplementary material available at 10.1007/s11092-024-09445-6.","PeriodicalId":46725,"journal":{"name":"Educational Assessment Evaluation and Accountability","volume":"37 1","pages":"5-33"},"PeriodicalIF":2.8,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11958380/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143774607","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"教育学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

IRTrees for skipping items in PIRLS 跳过 PIRLS 项目的 IRTrees

IF 3.9 3区教育学 Q1 EDUCATION & EDUCATIONAL RESEARCH

Educational Assessment Evaluation and Accountability

Pub Date : 2024-07-24 DOI: 10.1007/s11092-024-09439-4

Andrés Christiansen, Rianne Janssen

In international large-scale assessments, students may not be compelled to answer every test item: a student can decide to skip a seemingly difficult item or may drop out before the end of the test is reached. The way these missing responses are treated will affect the estimation of the item difficulty and student ability, and ultimately affect the country’s score. In the Progress in International Reading Literacy Study (PIRLS), incorrect answer substitution is used. This means that skipped and omitted items are treated as incorrect responses. In the present study, the effect of this approach is investigated. The data of 2006, 2011, and 2016 cycles of PIRLS were analyzed using IRTree models in which a sequential tree structure is estimated to model the full response process. Item difficulty, students’ ability, and country means were estimated and compared with results from a Rasch model using the standard PIRLS approach to missing values. Results showed that the IRTree model was able to disentangle the students’ ability and their propensity to skip items, reducing the correlation between ability and the proportion of skipped items in comparison to the Rasch model. Nevertheless, at the country level, the aggregated scores showed no important differences between models for the pooled sample, but some differences within countries across cycles.

在国际大规模评估中，学生可能不会被迫回答每一个测试项目：学生可能决定跳过一个看似困难的项目，也可能在测试结束前退出。如何处理这些缺失的回答将影响对项目难度和学生能力的估计，并最终影响国家的得分。在国际阅读能力进展研究（PIRLS）中，使用了错误答案替换法。这意味着跳过和遗漏的项目将被视为错误答案。本研究调查了这种方法的效果。我们使用 IRTree 模型对 2006、2011 和 2016 年 PIRLS 的数据进行了分析。对项目难度、学生能力和国家平均值进行了估计，并与使用标准 PIRLS 方法处理缺失值的 Rasch 模型的结果进行了比较。结果表明，与 Rasch 模型相比，IRTree 模型能够将学生的能力与其跳题倾向区分开来，降低了能力与跳题比例之间的相关性。尽管如此，在国家层面上，总分显示，在汇总样本中，不同模型之间没有重要差异，但在不同周期的国家内部存在一些差异。

{"title":"IRTrees for skipping items in PIRLS","authors":"Andrés Christiansen, Rianne Janssen","doi":"10.1007/s11092-024-09439-4","DOIUrl":"https://doi.org/10.1007/s11092-024-09439-4","url":null,"abstract":"In international large-scale assessments, students may not be compelled to answer every test item: a student can decide to skip a seemingly difficult item or may drop out before the end of the test is reached. The way these missing responses are treated will affect the estimation of the item difficulty and student ability, and ultimately affect the country’s score. In the Progress in International Reading Literacy Study (PIRLS), incorrect answer substitution is used. This means that skipped and omitted items are treated as incorrect responses. In the present study, the effect of this approach is investigated. The data of 2006, 2011, and 2016 cycles of PIRLS were analyzed using IRTree models in which a sequential tree structure is estimated to model the full response process. Item difficulty, students’ ability, and country means were estimated and compared with results from a Rasch model using the standard PIRLS approach to missing values. Results showed that the IRTree model was able to disentangle the students’ ability and their propensity to skip items, reducing the correlation between ability and the proportion of skipped items in comparison to the Rasch model. Nevertheless, at the country level, the aggregated scores showed no important differences between models for the pooled sample, but some differences within countries across cycles.","PeriodicalId":46725,"journal":{"name":"Educational Assessment Evaluation and Accountability","volume":"9 1","pages":""},"PeriodicalIF":3.9,"publicationDate":"2024-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141785924","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"教育学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

How representative is the Swedish PISA sample? A comparison of PISA and register data 瑞典国际学生评估项目（PISA）样本的代表性如何？PISA 与登记数据的比较

IF 3.9 3区教育学 Q1 EDUCATION & EDUCATIONAL RESEARCH

Educational Assessment Evaluation and Accountability

Pub Date : 2024-07-06 DOI: 10.1007/s11092-024-09438-5

Linda Borger, Stefan Johansson, Rolf Strietholt

PISA aims to serve as a “global yardstick” for educational success, as measured by student performance. For comparisons to be meaningful across countries or over time, PISA samples must be representative of the population of 15-year-old students in each country. Exclusions and non-response can undermine this representativeness and potentially bias estimates of student performance. Unfortunately, testing the representativeness of PISA samples is typically infeasible due to unknown population parameters. To address this issue, we integrate PISA 2018 data with comprehensive Swedish registry data, which includes all students in Sweden. Utilizing various achievement measures, we find that the Swedish PISA sample significantly overestimates the achievement levels in Sweden. The observed difference equates to standardized effect sizes ranging from d = .19 to .28, corresponding to approximately 25 points on the PISA scale or an additional year of schooling. The paper concludes with a plea for more rigorous quality standards and their strict enforcement.

国际学生评估项目旨在作为衡量教育成功与否的 "全球尺度"，以学生的表现来衡量。为了使各国之间或不同时期的比较具有意义，PISA 的样本必须能够代表每个国家 15 岁的学生群体。排除和不响应会破坏这种代表性，并可能使对学生成绩的估计出现偏差。遗憾的是，由于人口参数未知，测试 PISA 样本的代表性通常是不可行的。为了解决这个问题，我们将 PISA 2018 数据与瑞典的综合登记数据相结合，其中包括瑞典的所有学生。利用各种成绩衡量标准，我们发现瑞典 PISA 样本大大高估了瑞典的成绩水平。观察到的差异相当于 d = 0.19 至 0.28 的标准化效应大小，相当于 PISA 量表上的约 25 分或额外一年的学校教育。本文最后呼吁制定更严格的质量标准并严格执行。

引用次数: 0

Dimensions of teachers’ data literacy: A systematic review of literature from 1990 to 2021 教师的数据素养：对 1990 年至 2021 年文献的系统回顾

IF 3.9 3区教育学 Q1 EDUCATION & EDUCATIONAL RESEARCH

Educational Assessment Evaluation and Accountability

Pub Date : 2024-05-06 DOI: 10.1007/s11092-024-09435-8

Jihyun Lee, Dennis Alonzo, Kim Beswick, Jan Michael Vincent Abril, Adrian W. Chew, Cherry Zin Oo

The current study presents a systematic review of teachers’ data literacy, arising from a synthesis of 83 empirical studies published between 1990 to 2021. Our review identified 95 distinct indicators across five dimensions: (a) knowledge about data, (b) skills in using data, (c) dispositions towards data use, (d) data application for various purposes, and (e) data-related behaviors. Our findings indicate that teachers' data literacy goes beyond addressing the needs of supporting student learning and includes elements such as teacher reflection, collaboration, communication, and participation in professional development. Considering these findings, future policies should acknowledge the significance of teacher dispositions and behaviors in relation to data, recognizing that they are as important as knowledge and skills acquisition. Additionally, prioritizing the provision of system-level support to foster teacher collaboration within in-school professional development programs may prove useful in enhancing teachers’ data literacy.

本研究综述了 1990 年至 2021 年间发表的 83 项实证研究，对教师的数据素养进行了系统回顾。我们从五个方面确定了 95 个不同的指标：（a）数据知识，（b）使用数据的技能，（c）使用数据的态度，（d）为各种目的应用数据，以及（e）与数据相关的行为。我们的研究结果表明，教师的数据素养不仅仅是满足支持学生学习的需要，还包括教师的反思、合作、交流和参与专业发展等要素。考虑到这些发现，未来的政策应承认教师在数据方面的处置和行为的重要性，承认它们与知识和技能的学习同等重要。此外，优先提供系统层面的支持，促进教师在校内专业发展项目中的合作，可能会有助于提高教师的数据素养。

{"title":"Dimensions of teachers’ data literacy: A systematic review of literature from 1990 to 2021","authors":"Jihyun Lee, Dennis Alonzo, Kim Beswick, Jan Michael Vincent Abril, Adrian W. Chew, Cherry Zin Oo","doi":"10.1007/s11092-024-09435-8","DOIUrl":"https://doi.org/10.1007/s11092-024-09435-8","url":null,"abstract":"The current study presents a systematic review of teachers’ data literacy, arising from a synthesis of 83 empirical studies published between 1990 to 2021. Our review identified 95 distinct indicators across five dimensions: (a) knowledge about data, (b) skills in using data, (c) dispositions towards data use, (d) data application for various purposes, and (e) data-related behaviors. Our findings indicate that teachers' data literacy goes beyond addressing the needs of supporting student learning and includes elements such as teacher reflection, collaboration, communication, and participation in professional development. Considering these findings, future policies should acknowledge the significance of teacher dispositions and behaviors in relation to data, recognizing that they are as important as knowledge and skills acquisition. Additionally, prioritizing the provision of system-level support to foster teacher collaboration within in-school professional development programs may prove useful in enhancing teachers’ data literacy.","PeriodicalId":46725,"journal":{"name":"Educational Assessment Evaluation and Accountability","volume":"51 1","pages":""},"PeriodicalIF":3.9,"publicationDate":"2024-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140884447","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"教育学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Examining pre-service teachers’ feedback on low- and high-quality written assignments 研究职前教师对低质量和高质量书面作业的反馈意见

IF 3.9 3区教育学 Q1 EDUCATION & EDUCATIONAL RESEARCH

Educational Assessment Evaluation and Accountability

Pub Date : 2024-04-16 DOI: 10.1007/s11092-024-09432-x

Ignacio Máñez, Anastasiya A. Lipnevich, Carolina Lopera-Oquendo, Raquel Cerdán

Assessing student writing assignments and providing effective feedback are a complex pedagogical skill teacher candidates need to master. Scarce literature has closely examined the type of feedback that pre-service high-school teachers spontaneously deliver when assessing student writings, which is the main goal of our study. In a sample of 255 high school teacher candidates, we examined the type of feedback that they provided when assessing two writing assignments that differed in quality. One thousand eight hundred thirty-five comments were analyzed and coded into 11 sub-categories. Results showed that candidates’ feedback not only focused on task performance but also on the writing process. Although candidates provided critical and past-oriented evaluations frequently, they also crafted feedback in a neutral tone and included future-oriented suggestions. Further, feedback varied as a function of candidates’ gender, academic discipline, and students’ quality of writing. Teacher training programs may use this information to design resources to address nuances of feedback provision.

评估学生的写作作业并提供有效反馈是师范生需要掌握的一项复杂的教学技能。很少有文献仔细研究职前高中教师在评估学生写作时自发提供的反馈类型，而这正是我们研究的主要目标。我们对 255 名高中教师候选人进行了抽样调查，研究了他们在评价两份质量不同的写作作业时所提供的反馈类型。我们对 1835 条评论进行了分析，并将其编码为 11 个子类别。结果显示，候选人的反馈意见不仅关注任务的完成情况，也关注写作过程。虽然考生经常提供批判性和面向过去的评价，但他们也以中性的语气提出反馈意见，并包含面向未来的建议。此外，反馈也因考生的性别、学科和学生的写作质量而有所不同。教师培训项目可以利用这些信息来设计资源，以解决提供反馈时的细微差别。

{"title":"Examining pre-service teachers’ feedback on low- and high-quality written assignments","authors":"Ignacio Máñez, Anastasiya A. Lipnevich, Carolina Lopera-Oquendo, Raquel Cerdán","doi":"10.1007/s11092-024-09432-x","DOIUrl":"https://doi.org/10.1007/s11092-024-09432-x","url":null,"abstract":"Assessing student writing assignments and providing effective feedback are a complex pedagogical skill teacher candidates need to master. Scarce literature has closely examined the type of feedback that pre-service high-school teachers spontaneously deliver when assessing student writings, which is the main goal of our study. In a sample of 255 high school teacher candidates, we examined the type of feedback that they provided when assessing two writing assignments that differed in quality. One thousand eight hundred thirty-five comments were analyzed and coded into 11 sub-categories. Results showed that candidates’ feedback not only focused on task performance but also on the writing process. Although candidates provided critical and past-oriented evaluations frequently, they also crafted feedback in a neutral tone and included future-oriented suggestions. Further, feedback varied as a function of candidates’ gender, academic discipline, and students’ quality of writing. Teacher training programs may use this information to design resources to address nuances of feedback provision.","PeriodicalId":46725,"journal":{"name":"Educational Assessment Evaluation and Accountability","volume":"15 1","pages":""},"PeriodicalIF":3.9,"publicationDate":"2024-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140614642","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"教育学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Legitimising capital: parent organisations and their resistance to testing in England 资本合法化：英格兰家长组织及其对考试的抵制

IF 3.9 3区教育学 Q1 EDUCATION & EDUCATIONAL RESEARCH

Educational Assessment Evaluation and Accountability

Pub Date : 2024-03-06 DOI: 10.1007/s11092-024-09431-y

This paper presents findings from a year-long network ethnography into the strategies, networks, and outcomes of More Than a Score (MTAS)—a campaign against standardised testing in UK primary schools. Focusing specifically on the parent-based groups of the organisation, we use theorisations of symbolic capital to challenge traditional understandings of how capital can be leveraged for group advancement. We argue that MTAS frames itself as a grassroots organisation, using this image to promote its agenda amongst possible allies. Parent groups serve a critical role in accentuating the ‘grassroots’ image, as they bring a level of credibility to this claim. At the same time, the individuals who run these groups also bring technical skills, professional experiences, and connections that provide logistical and expert capital to the range of MTAS’s strategies and agendas. In doing so, their political and social capital as ‘parents’ provide a sort of legitimising capital to MTAS.

本文介绍了对 "比分数更重要"（MTAS）--一项反对英国小学标准化考试的运动--的战略、网络和成果进行为期一年的网络人种学研究的结果。我们特别关注该组织中以家长为基础的团体，利用象征资本的理论来挑战关于如何利用资本促进团体进步的传统理解。我们认为，MTAS 将自己塑造成一个草根组织，利用这一形象在可能的盟友中推广其议程。家长团体在突出 "草根 "形象方面发挥了关键作用，因为他们为这一主张带来了一定的可信度。同时，管理这些团体的个人也带来了技术技能、专业经验和人脉关系，为 MTAS 的一系列战略和议程提供了后勤和专家资本。这样，他们作为 "家长 "的政治和社会资本就为 MTAS 提供了某种合法化资本。

引用次数: 0

Signal, error, or bias? exploring the uses of scores from observation systems 信号、误差还是偏差？探索观察系统评分的用途

IF 3.9 3区教育学 Q1 EDUCATION & EDUCATIONAL RESEARCH

Educational Assessment Evaluation and Accountability

Pub Date : 2024-01-30 DOI: 10.1007/s11092-024-09427-8

Mark White, Kirsti Klette

Scores from observational measures of teaching have recently been put to many uses within school systems, including communicating a standard of practice and providing teacher feedback, identifying teachers for professional development, monitoring system equity, and making employment decisions. In each of these uses, observation scores are interpreted as representing some aspect of the enacted instruction or teachers’ capacity to enact instruction, as seen through the observation systems lens for understanding teaching quality. The quality of these interpretations, or the extent to which observation scores are composed of a signal that accurately reflects the interpretation, has important implications for the overall validity of uses of observation systems. Starting from an explicit conceptualization of instruction, this paper combines generalizability theory and hierarchical linear modelling approaches to decompose observation scores to explore the extent to which scores from observation systems are composed of signal, error, and bias across four different uses (i.e., teacher feedback, professional development, monitoring system equity, and employment decisions) of scores. We show that the quality of observation scores may depend more on what scores are interpreted as representing (i.e., the proposed use) than on the specific observation rubric being used. Further, we show that rater errors and biases are a major threat to any attempt to interpret observation scores as capturing the observation system’s understanding of teaching quality. We discuss implications for using scores from observation systems.

近来，教学观测的分数在学校系统中被广泛应用，包括传达实践标准和提供教师反馈、确定教师的专业发展、监督系统的公平性以及做出聘用决定。在上述每一种用途中，观察分数都被解释为代表了教学的某些方面或教师开展教学的能力，正如通过观察系统的视角来理解教学质量一样。这些解释的质量，或者说观察分数在多大程度上是由能准确反映解释的信号组成的，对观察系统使用的整体有效性有着重要影响。本文从明确的教学概念出发，结合可概括性理论和分层线性建模方法，对观察评分进行分解，以探讨观察系统的评分在四种不同用途（即教师反馈、专业发展、监测系统公平性和就业决策）中由信号、误差和偏差组成的程度。我们的研究表明，观察评分的质量可能更多地取决于评分被解释为代表什么（即建议的用途），而不是所使用的具体观察评分标准。此外，我们还表明，任何试图将观察评分解释为观察系统对教学质量的理解的做法，都会受到评分者误差和偏见的严重威胁。我们讨论了使用观察系统评分的意义。

{"title":"Signal, error, or bias? exploring the uses of scores from observation systems","authors":"Mark White, Kirsti Klette","doi":"10.1007/s11092-024-09427-8","DOIUrl":"https://doi.org/10.1007/s11092-024-09427-8","url":null,"abstract":"Scores from observational measures of teaching have recently been put to many uses within school systems, including communicating a standard of practice and providing teacher feedback, identifying teachers for professional development, monitoring system equity, and making employment decisions. In each of these uses, observation scores are interpreted as representing some aspect of the enacted instruction or teachers’ capacity to enact instruction, as seen through the observation systems lens for understanding teaching quality. The quality of these interpretations, or the extent to which observation scores are composed of a signal that accurately reflects the interpretation, has important implications for the overall validity of uses of observation systems. Starting from an explicit conceptualization of instruction, this paper combines generalizability theory and hierarchical linear modelling approaches to decompose observation scores to explore the extent to which scores from observation systems are composed of signal, error, and bias across four different uses (i.e., teacher feedback, professional development, monitoring system equity, and employment decisions) of scores. We show that the quality of observation scores may depend more on what scores are interpreted as representing (i.e., the proposed use) than on the specific observation rubric being used. Further, we show that rater errors and biases are a major threat to any attempt to interpret observation scores as capturing the observation system’s understanding of teaching quality. We discuss implications for using scores from observation systems.","PeriodicalId":46725,"journal":{"name":"Educational Assessment Evaluation and Accountability","volume":"15 1","pages":""},"PeriodicalIF":3.9,"publicationDate":"2024-01-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139648079","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"教育学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Key stakeholder voices: Investigating student perceptions of teachers’ use of assessment for learning 主要利益相关者的声音：调查学生对教师使用学习评价的看法

IF 3.9 3区教育学 Q1 EDUCATION & EDUCATIONAL RESEARCH

Educational Assessment Evaluation and Accountability

Pub Date : 2024-01-15 DOI: 10.1007/s11092-024-09428-7

Christel H. D. Wolterinck-Broekhuis, Cindy L. Poortman, Kim Schildkamp, Adrie J. Visscher

Many schools aim to implement Assessment for Learning (AfL) to stimulate students to take more ownership of their learning and develop self-regulatory skills. This survey-based study is among the few in the field showing how students experience the extent of implementation of AfL, here in English language and mathematics classes in 12 Dutch secondary schools. Analysis showed no differences between the subjects. Students (N = 685) experienced activities to clarify the learning intentions and success criteria and eliciting evidence on their learning progress regularly (between 50–74% of the lessons). Students hardly ever experienced activities aimed at peer- and self-assessment (less than 25% of the lessons). However, cluster analysis revealed three distinct clusters related to the extent of AfL strategy use students experienced. Overall, we can conclude that AfL is not yet fully integrated into teaching practices. Teachers’ skills, knowledge and attitudes required to increase student engagement in AfL practices and strengthen students’ self-regulated learning need more attention in future teacher professional development trajectories.

许多学校都希望通过实施 "学习评价"（AfL）来激发学生的学习自主性，并培养他们的自我调节能力。这项基于调查的研究是该领域为数不多的研究之一，它以荷兰 12 所中学的英语和数学课为例，展示了学生如何体验学习评价的实施程度。分析表明，不同科目之间没有差异。学生（人数 = 685）经常体验到明确学习意图和成功标准的活动，以及获取学习进展证据的活动（占课时的 50-74% 之间）。学生几乎没有经历过旨在进行互评和自评的活动（少于 25%）。然而，聚类分析显示，与学生使用 AfL 策略的程度相关的有三个不同的聚类。总之，我们可以得出这样的结论，即 AfL 还没有完全融入教学实践。在今后的教师专业发展过程中，我们需要更多关注教师的技能、知识和态度，以提高学生在非语言学习实践中的参与度，加强学生的自我调节学习。

{"title":"Key stakeholder voices: Investigating student perceptions of teachers’ use of assessment for learning","authors":"Christel H. D. Wolterinck-Broekhuis, Cindy L. Poortman, Kim Schildkamp, Adrie J. Visscher","doi":"10.1007/s11092-024-09428-7","DOIUrl":"https://doi.org/10.1007/s11092-024-09428-7","url":null,"abstract":"Many schools aim to implement Assessment for Learning (AfL) to stimulate students to take more ownership of their learning and develop self-regulatory skills. This survey-based study is among the few in the field showing how students experience the extent of implementation of AfL, here in English language and mathematics classes in 12 Dutch secondary schools. Analysis showed no differences between the subjects. Students (N = 685) experienced activities to clarify the learning intentions and success criteria and eliciting evidence on their learning progress regularly (between 50–74% of the lessons). Students hardly ever experienced activities aimed at peer- and self-assessment (less than 25% of the lessons). However, cluster analysis revealed three distinct clusters related to the extent of AfL strategy use students experienced. Overall, we can conclude that AfL is not yet fully integrated into teaching practices. Teachers’ skills, knowledge and attitudes required to increase student engagement in AfL practices and strengthen students’ self-regulated learning need more attention in future teacher professional development trajectories.","PeriodicalId":46725,"journal":{"name":"Educational Assessment Evaluation and Accountability","volume":"15 1","pages":""},"PeriodicalIF":3.9,"publicationDate":"2024-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139469655","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"教育学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Comparing the teaching quality of alternatively certified teachers and traditionally certified teachers: findings from a large-scale study 比较另类认证教师和传统认证教师的教学质量：一项大规模研究的结果

IF 3.9 3区教育学 Q1 EDUCATION & EDUCATIONAL RESEARCH

Educational Assessment Evaluation and Accountability

Pub Date : 2024-01-12 DOI: 10.1007/s11092-023-09426-1

Christin Lucksnat, Eric Richter, Sofie Henschel, Lars Hoffmann, Stefan Schipolowski, Dirk Richter

The present study investigates whether alternatively and traditionally certified teachers differ in their teaching quality. We conducted doubly latent multilevel analysis using a large-scale dataset with student ratings on the quality of instruction provided by 1550 traditionally and 135 alternatively certified secondary school mathematics teachers in Germany. Findings show no evidence of differences in teaching quality between alternatively and traditionally certified teachers. Independent of the type of certification, novice teachers scored lower on classroom management than experienced teachers. Findings suggest that beginning teachers—whether alternatively or traditionally certified—need to be better prepared for classroom management before starting to teach.

本研究调查了获得替代认证的教师和获得传统认证的教师在教学质量上是否存在差异。我们利用大规模数据集对德国 1550 名获得传统认证的中学数学教师和 135 名获得替代认证的中学数学教师的教学质量进行了双重潜在多层次分析。研究结果表明，没有证据表明另类认证教师和传统认证教师的教学质量存在差异。与认证类型无关，新手教师在课堂管理方面的得分低于经验丰富的教师。研究结果表明，初任教师--不论是通过另类认证还是传统认证--在开始教学之前都需要在课堂管理方面做好更充分的准备。

引用次数: 0

Re-contextualization of school quality assessment policies: an ethnography approach to SIMCE on Chilean disadvantaged schools 学校质量评估政策的再语境化：智利贫困学校 SIMCE 的人种学方法

IF 3.9 3区教育学 Q1 EDUCATION & EDUCATIONAL RESEARCH

Educational Assessment Evaluation and Accountability

Pub Date : 2023-12-28 DOI: 10.1007/s11092-023-09425-2

Abstract

Accountability policies through standardized testing are widespread in diverse educational systems. Based on an ethnographic research study, we sought to understand how a learning assessment policy, used for over three decades in primary and lower secondary education, is lived and interpreted in daily school life in Chile. This article analyzes how learning assessment policies are re-contextualized (translated and interpreted) in three elementary and high schools located in poor neighborhoods with different performance categorization according to students’ test results. We conducted interviews and observations with teachers and school leadership team members. Results show that school actors experience the effects of testing both as a menace and as a pressure despite their different school categorization performance. They consider the tests a decontextualized and unjust measurement and therefore respond with similar strategies that seek to avoid these negative consequences. This study contributes to understanding the effects of tight accountability systems that attempt to place different levels of pressure through categorization.

摘要通过标准化考试实行问责制的政策在不同的教育体制中普遍存在。基于一项人种学调查研究，我们试图了解在智利小学和初中教育中使用了三十多年的学习评估政策是如何在日常学校生活中被应用和诠释的。本文分析了学习评估政策是如何在三所位于贫困街区的小学和中学中被重新语境化（翻译和解释）的，这些学校根据学生的考试成绩进行了不同的成绩分类。我们对教师和学校领导团队成员进行了访谈和观察。结果表明，尽管学校的分类表现不同，但学校的参与者都将考试的影响视为一种威胁和压力。他们认为考试是一种脱离实际的、不公正的衡量标准，因此采取了类似的策略来避免这些负面影响。本研究有助于理解严格的问责制度的影响，这种制度试图通过分类施加不同程度的压力。

{"title":"Re-contextualization of school quality assessment policies: an ethnography approach to SIMCE on Chilean disadvantaged schools","authors":"","doi":"10.1007/s11092-023-09425-2","DOIUrl":"https://doi.org/10.1007/s11092-023-09425-2","url":null,"abstract":"<h3>Abstract</h3> Accountability policies through standardized testing are widespread in diverse educational systems. Based on an ethnographic research study, we sought to understand how a learning assessment policy, used for over three decades in primary and lower secondary education, is lived and interpreted in daily school life in Chile. This article analyzes how learning assessment policies are re-contextualized (translated and interpreted) in three elementary and high schools located in poor neighborhoods with different performance categorization according to students’ test results. We conducted interviews and observations with teachers and school leadership team members. Results show that school actors experience the effects of testing both as a menace and as a pressure despite their different school categorization performance. They consider the tests a decontextualized and unjust measurement and therefore respond with similar strategies that seek to avoid these negative consequences. This study contributes to understanding the effects of tight accountability systems that attempt to place different levels of pressure through categorization.","PeriodicalId":46725,"journal":{"name":"Educational Assessment Evaluation and Accountability","volume":"25 1","pages":""},"PeriodicalIF":3.9,"publicationDate":"2023-12-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139078118","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"教育学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Educational Assessment Evaluation and Accountability

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀