Educational Assessment最新文献

英文中文

Using Pictorial Glossaries as an Accommodation for English Learners: An Exploratory Study 运用图示词汇作为英语学习者的辅助工具:一项探索性研究

IF 1.5 Q1 EDUCATION & EDUCATIONAL RESEARCH

Educational Assessment

Pub Date : 2019-05-24 DOI: 10.1080/10627197.2019.1615371

Sultan Turkan, Alexis A. López, René Lawless, Florencia Tolentino

ABSTRACT In this article we explore the use of pictorial glossaries as an accommodation for English learners (ELs) with entry and emerging levels of English language proficiency. Drawing on survey responses from 98 middle school ELs and cognitive interviews with 10 of the survey participants, we examined the participants’ preferences and experiences with using accommodations and explored how some of them responded to NAEP mathematics items using pictorial glossaries. Our findings showed that the participants viewed the use of pictures, videos, and translations as useful, but they did not have a lot of experience using these types of accommodations. Also, we found that the pictorial glosses sometimes helped ELs with understanding the local meaning of words in the mathematics problems but not with the global meaning conveyed at the sentence level in the problems.

摘要在这篇文章中，我们探讨了使用图示词汇表来适应英语学习者（EL）的入门和新兴水平。根据98名中学ELs的调查结果和对10名调查参与者的认知访谈，我们检查了参与者使用住宿的偏好和体验，并探讨了他们中的一些人如何使用图形词汇表对NAEP数学项目做出反应。我们的研究结果表明，参与者认为图片、视频和翻译的使用很有用，但他们在使用这些类型的住宿方面没有太多经验。此外，我们发现，图形注释有时有助于ELs理解数学问题中单词的局部含义，但不能理解问题中句子层面传达的全局含义。

引用次数: 4

A Generalized Scoring Process to Measure Collaborative Problem Solving in Online Environments 衡量在线环境下协同问题解决的广义评分过程

IF 1.5 Q1 EDUCATION & EDUCATIONAL RESEARCH

Educational Assessment

Pub Date : 2019-05-16 DOI: 10.1080/10627197.2019.1615372

C. Scoular, E. Care

ABSTRACT Recent educational and psychological research has highlighted shifting workplace requirements and change required to equip the emerging workforce with skills for the 21st century. The emergence of these highlights the issues, and drives the importance, of new methods of assessment. This study addresses some of the issues by describing a scoring process for measuring collaborative problem solving (CPS) in online environments. The method presented, from conceptualization to implementation, centers on its generalizable application, presenting a systematic process of identifying, coding, and scoring behavior patterns in log stream data generated from assessments. Item Response Theory was used to investigate the psychometric properties of behavior patterns. The goal of this study was to present an approach that informs new measurement practices in relation to sociocognitive latent traits and their processes. The generalized scoring process provides an efficient approach to develop measures of social and cognitive skills in online environments.

摘要最近的教育和心理学研究强调了工作场所需求的变化和为新兴劳动力提供21世纪技能所需的变革。这些问题的出现突出了新的评估方法的问题，并推动了其重要性。本研究通过描述在线环境中衡量协作问题解决（CPS）的评分过程来解决一些问题。所提出的方法，从概念化到实现，都以其可推广的应用为中心，呈现了在评估生成的日志流数据中识别、编码和评分行为模式的系统过程。项目反应理论用于研究行为模式的心理测量特性。本研究的目的是提出一种方法，为新的社会认知潜在特征及其过程的测量实践提供信息。广义评分过程为在线环境中发展社交和认知技能提供了一种有效的方法。

引用次数: 6

Validity of a Special Education Teacher Observation System 特殊教育教师观察系统的有效性

IF 1.5 Q1 EDUCATION & EDUCATIONAL RESEARCH

Educational Assessment

Pub Date : 2019-05-01 DOI: 10.1080/10627197.2019.1702461

Evelyn S. Johnson, Angela R. Crawford, Laura A. Moylan, Yuzhu Z. Zheng

ABSTRACT This manuscript describes the comprehensive validation work undertaken to develop the Recognizing Effective Special Education Teachers (RESET) observation system, which was designed to provide evaluations of special education teachers’ ability to effectively implement evidence-based practices and to provide specific, actionable feedback to teachers on how to improve instruction. Following the guidance for developing effective educator evaluation systems, we employed the Evidence-Centered Design framework, articulated the claims and inferences to be made with RESET, and conducted a series of studies to collect evidence to evaluate its validity. Our efforts and results to date are described, and implications for practice and further research are discussed.

摘要本文描述了为开发“识别有效的特殊教育教师”（RESET）观察系统而进行的全面验证工作，该系统旨在评估特殊教育教师有效实施循证实践的能力，并就如何改进教学向教师提供具体、可操作的反馈。根据开发有效的教育评估系统的指导，我们采用了以证据为中心的设计框架，阐明了RESET的主张和推论，并进行了一系列研究来收集证据以评估其有效性。我们描述了迄今为止的努力和结果，并讨论了对实践和进一步研究的启示。

引用次数: 9

Are Test and Academic Disengagement Related? Implications for Measurement and Practice 考试和学术脱离相关吗？对测量和实践的启示

IF 1.5 Q1 EDUCATION & EDUCATIONAL RESEARCH

Educational Assessment

Pub Date : 2019-03-13 DOI: 10.1080/10627197.2019.1575723

J. Soland, N. Jensen, Tran D. Keys, Sharon Bi, Emily Wolk

ABSTRACT A vast literature investigates academic disengagement among students, including its ultimate manifestation, dropping out of school. Research also shows that test disengagement can be a problem for many inferences educators and policymakers wish to draw from test scores. However, few studies consider whether academic and test disengagement are related. In this study, we examine whether behaviors indicative of academic disengagement like chronic absenteeism and course failures are related to behaviors indicative of test disengagement like rapidly guessing on items. We also examine whether social-emotional factors like low academic self-efficacy and self-management, which research suggests are the root causes of academic disengagement, are also related to rapid guessing behavior. Our results provide evidence that academic and test disengagement are related, including through a common association with poor self-management. The implications of this connection for measurement and practice are discussed.

摘要：大量文献调查了学生的学业脱离，包括其最终表现形式——辍学。研究还表明，对于教育工作者和政策制定者希望从考试成绩中得出的许多推论来说，脱离考试可能是一个问题。然而，很少有研究考虑学术和考试脱离是否相关。在这项研究中，我们检验了长期旷课和课程失败等表明学业脱离的行为是否与快速猜测项目等表明考试脱离的行为有关。我们还研究了低学业自我效能感和自我管理等社会情绪因素是否也与快速猜测行为有关，研究表明，这些因素是学业脱离的根本原因。我们的研究结果提供了证据，证明学术和考试脱离是相关的，包括通过与不良自我管理的共同联系。讨论了这种联系对测量和实践的影响。

引用次数: 20

The Immediate and Delayed Effect of Dynamic Assessment Approaches on EFL Learners’ Oral Narrative Performance and Anxiety 动态评估方法对英语学习者口语叙事表现和焦虑的直接和延迟影响

IF 1.5 Q1 EDUCATION & EDUCATIONAL RESEARCH

Educational Assessment

Pub Date : 2019-02-13 DOI: 10.1080/10627197.2019.1578169

Masoomeh Estaji, Mahsa Farahanynia

ABSTRACT The present study aimed to investigate the effect of two major approaches of Dynamic Assessment, namely, interventionist and interactionist approaches, on learners’ oral narrative performance and anxiety. To this end, 34 Iranian EFL learners were assigned to an Interactionist Group (InA.G) and Interventionist Group (InV.G). Initially, both groups were given the Foreign Language Classroom Anxiety Scale and a pretest of speaking. In the treatment phase, the InV.G was asked to narrate a video and received instructions on their errors. The InA.G narrated the video while being provided with scaffolding during narration. Then both groups were given a posttest and, two weeks later, a delayed posttest. The results indicated that both groups’ oral performance significantly increased, while their anxiety reduced. In the end, a semi-structured interview was conducted whose results revealed that the InA.G experienced more anxiety mostly due to feeling a sense of interruption and losing face.

摘要本研究旨在探讨动态评估的两种主要方法，即干预主义和互动主义方法对学习者口头叙事表现和焦虑的影响。为此，34名伊朗英语学习者被分配到互动组(InA.G)和干预组(invg)。最初，两组人都接受了外语课堂焦虑量表和口语预测。在治疗阶段，他们被要求解说一段视频，并接受关于他们错误的指导。艾娜。G在旁白的时候用脚手架进行解说。然后两组都进行了后测，两周后进行了延迟后测。结果表明，两组患者的口腔表现均显著提高，焦虑程度有所降低。最后，我们进行了一次半结构化的访谈，结果显示:G更焦虑，主要是因为有被打断的感觉和丢面子的感觉。

引用次数: 14

Scenario-Based Assessments in Writing: An Experimental Study 基于情景的写作评估：一项实验研究

IF 1.5 Q1 EDUCATION & EDUCATIONAL RESEARCH

Educational Assessment

Pub Date : 2019-01-16 DOI: 10.1080/10627197.2018.1557515

Mo Zhang, P. V. van Rijn, P. Deane, R. Bennett

ABSTRACT Writing from source text is critical for developing college-and-career readiness because it is required in advanced academic environments and many vocations. Scenario-based assessment (SBA) represents one approach to measuring this ability. In such assessment, the scenario presents an issue that the student is to read and write about. Before writing, lead-in exercises are presented to encourage the examinee to engage with the source materials and to model the process used in a classroom writing project. This study experimentally manipulated a middle-school assessment design to understand if (1) the lead-in/essay structure increased scores erroneously with a concomitant decrease in test technical quality, and (2) the presence of a single unifying scenario affected scores or score meaning. In general, the SBA design did not appear to artificially increase total-test or essay scores. As importantly, it functioned as well as, sometimes better than, the alternative designs in terms of the measurement characteristics examined.

根据原文写作对于培养大学和职业准备是至关重要的，因为它在先进的学术环境和许多职业中都是必需的。基于场景的评估(SBA)代表了衡量这种能力的一种方法。在这样的评估中，场景提出了一个学生要阅读和写作的问题。在写作之前，引入练习是为了鼓励考生与原始材料接触，并模拟课堂写作项目中使用的过程。本研究通过实验操纵中学评估设计来了解(1)引入/文章结构是否错误地提高了分数，同时降低了考试技术质量，以及(2)单一统一场景的存在是否影响了分数或分数意义。总的来说，SBA的设计似乎并没有人为地提高总分或论文分数。同样重要的是，就测量特性而言，它的功能与其他设计一样好，有时甚至更好。

引用次数: 15

Evaluating Teacher Effectiveness Using Classroom Observations: A Rasch Analysis of the Rater Effects of Principals 用课堂观察评估教师效能:校长评鉴效果之Rasch分析

IF 1.5 Q1 EDUCATION & EDUCATIONAL RESEARCH

Educational Assessment

Pub Date : 2019-01-09 DOI: 10.1080/10627197.2018.1564272

E. Jones, C. Bergin

ABSTRACT In most U.S. schools, teachers are evaluated using observation of teaching practice (OTP). This study investigates rater effects on OTP ratings among 421 principals in an authentic teacher evaluation system. Many-facet Rasch analysis (MFR) using a block of shared ratings revealed that principals generally (a) differentiated between more and less effective teachers, (b) rated their teachers with leniency (i.e., overused higher rating categories), and (c) differentiated between teaching practices (e.g., Cognitive Engagement vs. Classroom Management) with minimal halo effect. Individual principals varied significantly in degree of leniency, and approximately 12% of principals exhibited severe rater bias. Implications for use of OTP ratings for evaluating teachers’ effectiveness are discussed. Strengths and limitations of MFR to analyze rater effects in OTP are also discussed.

摘要在大多数美国学校，教师的评估都采用教学实践观察法（OTP）。本研究在一个真实的教师评估系统中调查了421名校长的评分者对OTP评分的影响。使用共享评级块的多方面Rasch分析（MFR）显示，校长通常（a）区分效率较高和较低的教师，（b）对教师的评级较宽（即过度使用较高评级类别），以及（c）区分光环效应最小的教学实践（如认知参与与课堂管理）。个别校长的宽大程度差异很大，约12%的校长表现出严重的评分者偏见。讨论了使用OTP评分评估教师效能的意义。还讨论了MFR分析OTP中评分者效应的优势和局限性。

引用次数: 12

Testing Accommodations and the Measurement of Student Academic Growth 测试住宿和学生学业成长的测量

IF 1.5 Q1 EDUCATION & EDUCATIONAL RESEARCH

Educational Assessment

Pub Date : 2019-01-02 DOI: 10.1080/10627197.2018.1545571

H. Buzick

ABSTRACT Using two states’ grades 3 through 8 state assessment databases, this study documents the extent to which students were assigned testing accommodations for ELA or mathematics in only one of two consecutive years. The percentage of students with disabilities who were assigned accommodations in the current year only or in the prior year only in a given grade statewide was not trivial, sometimes exceeding 25%. The relationship between inconsistent assignment to any accommodations and both students’ prior proficiency level and aggregate growth is also documented. Group differences were observed at the state level. No practical differences were observed when covariates for inconsistent assignment were included in school value-added models, but very few schools had a substantial proportion of students with disabilities assigned accommodations inconsistently in a given grade. Implications for research and practice are discussed.

使用两个州的3至8级评估数据库，本研究记录了学生在连续两年中的一年被分配ELA或数学测试住宿的程度。在全州范围内，仅在本年度或仅在前一年在给定年级分配住宿的残疾学生的百分比并非微不足道，有时超过25%。任何住宿的不一致分配与学生先前的熟练程度和总体增长之间的关系也被记录。在州一级观察到组间差异。当将不一致分配的协变量纳入学校增值模型时，没有观察到实际差异，但很少有学校在给定年级中有相当比例的残疾学生分配不一致的住宿。讨论了对研究和实践的启示。

引用次数: 4

Developing a Measure of Spanish Phonological Awareness for Preschool Age Children: Spanish Individual Growth and Development Indicators 发展对学龄前儿童西班牙语语音意识的测量:西班牙语个体成长和发展指标

IF 1.5 Q1 EDUCATION & EDUCATIONAL RESEARCH

Educational Assessment

Pub Date : 2018-11-23 DOI: 10.1080/10627197.2018.1545570

Alisha K. Wackerle-Hollman, Lillian Durán, S. Brunner, José Palma, Theresa L. Kohlmeier, Michael C. Rodriguez

ABSTRACT Spanish speakers in the United States are a steadily increasing population, up by 233% since 1980. Given the growing population of dual language learners (DLLs) and the large numbers of Spanish-speaking children enrolled in pre-kindergarten programs, addressing the educational needs of preschool-aged DLLs has become a national imperative. Specifically, the intersection of this growing population and the dearth of appropriate assessment tools to evaluate DLLs early language and literacy skills creates a need for assessments that accurately measure preschool performance. This manuscript reports on the iterative design process of a measure of Spanish phonological awareness for preschool-aged DLLs: Spanish Individual Growth and Development Indicators (S-IGDI) Primeros Sonidos. We employed measure design framework to develop the measure and tested item function within a study of 970, 4–5 year old DLLs. Results, including item level analyses and evidence regarding construct and criterion validity are reported.

在美国，讲西班牙语的人口稳步增长，自1980年以来增长了233%。鉴于双语学习者(dll)的人数不断增长，以及大量讲西班牙语的儿童参加了学前教育项目，解决学龄前dl的教育需求已成为国家的当务之急。具体来说，人口增长和缺乏适当的评估工具来评估dll的早期语言和读写能力的交集，创造了对准确衡量学龄前表现的评估的需求。本文报告了一种针对学龄前dls的西班牙语语音意识测量的迭代设计过程:西班牙语个人成长和发展指标(S-IGDI) Primeros Sonidos。本研究以970名4 ~ 5岁儿童为研究对象，采用量表设计框架开发量表和测试项功能。结果，包括项目水平分析和证据有关结构和标准效度报告。

{"title":"Developing a Measure of Spanish Phonological Awareness for Preschool Age Children: Spanish Individual Growth and Development Indicators","authors":"Alisha K. Wackerle-Hollman, Lillian Durán, S. Brunner, José Palma, Theresa L. Kohlmeier, Michael C. Rodriguez","doi":"10.1080/10627197.2018.1545570","DOIUrl":"https://doi.org/10.1080/10627197.2018.1545570","url":null,"abstract":"ABSTRACT Spanish speakers in the United States are a steadily increasing population, up by 233% since 1980. Given the growing population of dual language learners (DLLs) and the large numbers of Spanish-speaking children enrolled in pre-kindergarten programs, addressing the educational needs of preschool-aged DLLs has become a national imperative. Specifically, the intersection of this growing population and the dearth of appropriate assessment tools to evaluate DLLs early language and literacy skills creates a need for assessments that accurately measure preschool performance. This manuscript reports on the iterative design process of a measure of Spanish phonological awareness for preschool-aged DLLs: Spanish Individual Growth and Development Indicators (S-IGDI) Primeros Sonidos. We employed measure design framework to develop the measure and tested item function within a study of 970, 4–5 year old DLLs. Results, including item level analyses and evidence regarding construct and criterion validity are reported.","PeriodicalId":46209,"journal":{"name":"Educational Assessment","volume":"24 1","pages":"33 - 56"},"PeriodicalIF":1.5,"publicationDate":"2018-11-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1080/10627197.2018.1545570","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43291229","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

The Impact of Item Stem Format on the Dimensional Structure of Mathematics Assessments 题干格式对数学评价维度结构的影响

IF 1.5 Q1 EDUCATION & EDUCATIONAL RESEARCH

Educational Assessment

Pub Date : 2018-11-13 DOI: 10.1080/10627197.2018.1545569

A. Kan, O. Bulut, D. Cormier

ABSTRACT Item stem formats can alter the cognitive complexity as well as the type of abilities required for solving mathematics items. Consequently, it is possible that item stem formats can affect the dimensional structure of mathematics assessments. This empirical study investigated the relationship between item stem format and the dimensionality of mathematics assessments. A sample of 671 sixth-grade students was given two forms of a mathematics assessment in which mathematical expression (ME) items and word problems (WP) were used to measure the same content. The effects of mathematical language and reading abilities in responding to ME and WP items were explored using unidimensional and multidimensional item response theory models. The results showed that WP and ME items appear to differ with regard to the underlying abilities required to answer these items. Hence, the multidimensional model fit the response data better than the unidimensional model. For the accurate assessment of mathematics achievement, students’ reading and mathematical language abilities should also be considered when implementing mathematics assessments with ME and WP items.

摘要题干格式可以改变认知复杂性以及解决数学题所需的能力类型。因此，题干格式可能会影响数学评估的维度结构。本实证研究探讨了题干格式与数学评估维度之间的关系。对671名六年级学生进行了两种形式的数学评估，其中数学表达（ME）项目和单词问题（WP）用于测量相同的内容。运用单维度和多维项目反应理论模型，探讨了数学语言和阅读能力对ME和WP项目反应的影响。结果表明，WP和ME项目在回答这些项目所需的基本能力方面似乎有所不同。因此，多维模型比一维模型更适合响应数据。为了准确评估数学成绩，在使用ME和WP项目进行数学评估时，还应考虑学生的阅读和数学语言能力。

{"title":"The Impact of Item Stem Format on the Dimensional Structure of Mathematics Assessments","authors":"A. Kan, O. Bulut, D. Cormier","doi":"10.1080/10627197.2018.1545569","DOIUrl":"https://doi.org/10.1080/10627197.2018.1545569","url":null,"abstract":"ABSTRACT Item stem formats can alter the cognitive complexity as well as the type of abilities required for solving mathematics items. Consequently, it is possible that item stem formats can affect the dimensional structure of mathematics assessments. This empirical study investigated the relationship between item stem format and the dimensionality of mathematics assessments. A sample of 671 sixth-grade students was given two forms of a mathematics assessment in which mathematical expression (ME) items and word problems (WP) were used to measure the same content. The effects of mathematical language and reading abilities in responding to ME and WP items were explored using unidimensional and multidimensional item response theory models. The results showed that WP and ME items appear to differ with regard to the underlying abilities required to answer these items. Hence, the multidimensional model fit the response data better than the unidimensional model. For the accurate assessment of mathematics achievement, students’ reading and mathematical language abilities should also be considered when implementing mathematics assessments with ME and WP items.","PeriodicalId":46209,"journal":{"name":"Educational Assessment","volume":"24 1","pages":"13 - 32"},"PeriodicalIF":1.5,"publicationDate":"2018-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1080/10627197.2018.1545569","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42649952","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 12

首页上一页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Educational Assessment

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀