{"title":"Keynote: Rethinking measurement for accountable assessment","authors":"Mark R. Wilson","doi":"10.37517/978-1-74286-638-3_13","DOIUrl":null,"url":null,"abstract":"The underlying model for most formal educational measurement (e.g. standardised tests) is based on a very simple model: the student takes a test (possibly alongside other students). The complications of there being an instructional plan, actual instruction, interpretation of the outcome, and formulation of next steps, are all bypassed in considering how to model the process of measurement. There are some standard exceptions, of course: a pre-test/post-test context will involve two measurements, and attention to gain score, or similar. However, if we wish to design measurement to hold to Lehrer’s (2021) definition of ‘accountable assessment’ – as ‘actionable information for improving classroom instruction’ – then this narrow conceptualisation must be extended. In this presentation, I will posit a simple model that reflects the simple one-test context described above, and then elaborate on it by adding in a) a framework for design of the assessments that is keyed to educational interpretation, b) further rounds of data collection that can indicate changes in a student’s underlying ability, and c) provision for varied assessment modes that will allow for i) classroom-independent tasks that operate at the summative and meso levels, and ii) classroom-dependent tasks that operate at the micro level. The former are designed to provide a basis for triangulating student responses across different contexts, and the latter are designed to closely track the variation of student performance over time in a classroom instructional context. This framing will be exemplified in a in a K–5 elementary school that is seeking to improve the quality of instruction and students’ understandings of measure and arithmetic. The different levels of data collection will be instantiated by two different pieces of software, which operate at the micro level and the meso/summative levels respectively.","PeriodicalId":413895,"journal":{"name":"Research Conference 2021: Excellent progress for every student: Proceedings and Program","volume":"96 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Research Conference 2021: Excellent progress for every student: Proceedings and Program","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.37517/978-1-74286-638-3_13","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The underlying model for most formal educational measurement (e.g. standardised tests) is based on a very simple model: the student takes a test (possibly alongside other students). The complications of there being an instructional plan, actual instruction, interpretation of the outcome, and formulation of next steps, are all bypassed in considering how to model the process of measurement. There are some standard exceptions, of course: a pre-test/post-test context will involve two measurements, and attention to gain score, or similar. However, if we wish to design measurement to hold to Lehrer’s (2021) definition of ‘accountable assessment’ – as ‘actionable information for improving classroom instruction’ – then this narrow conceptualisation must be extended. In this presentation, I will posit a simple model that reflects the simple one-test context described above, and then elaborate on it by adding in a) a framework for design of the assessments that is keyed to educational interpretation, b) further rounds of data collection that can indicate changes in a student’s underlying ability, and c) provision for varied assessment modes that will allow for i) classroom-independent tasks that operate at the summative and meso levels, and ii) classroom-dependent tasks that operate at the micro level. The former are designed to provide a basis for triangulating student responses across different contexts, and the latter are designed to closely track the variation of student performance over time in a classroom instructional context. This framing will be exemplified in a in a K–5 elementary school that is seeking to improve the quality of instruction and students’ understandings of measure and arithmetic. The different levels of data collection will be instantiated by two different pieces of software, which operate at the micro level and the meso/summative levels respectively.