计算机自适应测试中增长估计的技术充分性:对进度监测的影响。

School psychology quarterly : the official journal of the Division of School Psychology, American Psychological Association Pub Date : 2017-09-01 Epub Date: 2016-08-08 DOI:10.1037/spq0000175

Ethan R Van Norman, Peter M Nelson, David C Parker

{"title":"计算机自适应测试中增长估计的技术充分性:对进度监测的影响。","authors":"Ethan R Van Norman, Peter M Nelson, David C Parker","doi":"10.1037/spq0000175","DOIUrl":null,"url":null,"abstract":"Computer adaptive tests (CATs) hold promise to monitor student progress within multitiered systems of support. However, the relationship between how long and how often data are collected and the technical adequacy of growth estimates from CATs has not been explored. Given CAT administration times, it is important to identify optimal data collection schedules to minimize missed instructional time. We used simulation methodology to investigate how the duration and frequency of data collection influenced the reliability, validity, and precision of growth estimates from a math CAT. A progress monitoring dataset of 746 Grade 4, 664 Grade 5, and 400 Grade 6 students from 40 schools in the upper Midwest was used to generate model parameters. Across grades, 53% of students were female and 53% were White. Grade level was not as influential as the duration and frequency of data collection on the technical adequacy of growth estimates. Low-stakes decisions were possible after 14-18 weeks when data were collected weekly (420-540 min of assessment), 20-24 weeks when collected every other week (300-360 min of assessment), and 20-28 weeks (150-210 min of assessment) when data were collected once a month, depending on student grade level. The validity and precision of growth estimates improved when the duration and frequency of progress monitoring increased. Given the amount of time required to obtain technically adequate growth estimates in the present study, results highlight the importance of weighing the potential costs of missed instructional time relative to other types of assessments, such as curriculum-based measures. Implications for practice, research, as well as future directions are also discussed. (PsycINFO Database Record","PeriodicalId":88124,"journal":{"name":"School psychology quarterly : the official journal of the Division of School Psychology, American Psychological Association","volume":"32 3","pages":"379-391"},"PeriodicalIF":0.0000,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":"{\"title\":\"Technical adequacy of growth estimates from a computer adaptive test: Implications for progress monitoring.\",\"authors\":\"Ethan R Van Norman, Peter M Nelson, David C Parker\",\"doi\":\"10.1037/spq0000175\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Computer adaptive tests (CATs) hold promise to monitor student progress within multitiered systems of support. However, the relationship between how long and how often data are collected and the technical adequacy of growth estimates from CATs has not been explored. Given CAT administration times, it is important to identify optimal data collection schedules to minimize missed instructional time. We used simulation methodology to investigate how the duration and frequency of data collection influenced the reliability, validity, and precision of growth estimates from a math CAT. A progress monitoring dataset of 746 Grade 4, 664 Grade 5, and 400 Grade 6 students from 40 schools in the upper Midwest was used to generate model parameters. Across grades, 53% of students were female and 53% were White. Grade level was not as influential as the duration and frequency of data collection on the technical adequacy of growth estimates. Low-stakes decisions were possible after 14-18 weeks when data were collected weekly (420-540 min of assessment), 20-24 weeks when collected every other week (300-360 min of assessment), and 20-28 weeks (150-210 min of assessment) when data were collected once a month, depending on student grade level. The validity and precision of growth estimates improved when the duration and frequency of progress monitoring increased. Given the amount of time required to obtain technically adequate growth estimates in the present study, results highlight the importance of weighing the potential costs of missed instructional time relative to other types of assessments, such as curriculum-based measures. Implications for practice, research, as well as future directions are also discussed. (PsycINFO Database Record\",\"PeriodicalId\":88124,\"journal\":{\"name\":\"School psychology quarterly : the official journal of the Division of School Psychology, American Psychological Association\",\"volume\":\"32 3\",\"pages\":\"379-391\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"8\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"School psychology quarterly : the official journal of the Division of School Psychology, American Psychological Association\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1037/spq0000175\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2016/8/8 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"School psychology quarterly : the official journal of the Division of School Psychology, American Psychological Association","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1037/spq0000175","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2016/8/8 0:00:00","PubModel":"Epub","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 8

摘要

计算机适应性测试(CATs)有望在多层支持系统中监测学生的进步。然而，收集数据的时间和频率与从CATs获得的增长估计的技术充分性之间的关系尚未得到探讨。给定CAT管理时间，确定最佳数据收集计划以最小化错过的教学时间是很重要的。我们使用模拟方法来研究数据收集的持续时间和频率如何影响数学CAT增长估计的可靠性、有效性和精度。来自中西部北部40所学校的746名4年级学生、664名5年级学生和400名6年级学生的进度监测数据用于生成模型参数。从年级来看，53%的学生是女生，53%是白人。职等对增长估计在技术上是否适当的影响不如收集数据的持续时间和频率大。根据学生的年级水平，在14-18周每周收集数据(420-540分钟评估)、20-24周每隔一周收集数据(300-360分钟评估)和20-28周(150-210分钟评估)后可以做出低风险决策。当进度监测的持续时间和频率增加时，增长估计的有效性和准确性就会提高。鉴于在本研究中获得技术上适当的增长估计所需的时间，结果突出了权衡错过教学时间相对于其他类型的评估(如基于课程的措施)的潜在成本的重要性。本文还讨论了对实践、研究以及未来发展方向的影响。(PsycINFO数据库记录

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Technical adequacy of growth estimates from a computer adaptive test: Implications for progress monitoring.

Computer adaptive tests (CATs) hold promise to monitor student progress within multitiered systems of support. However, the relationship between how long and how often data are collected and the technical adequacy of growth estimates from CATs has not been explored. Given CAT administration times, it is important to identify optimal data collection schedules to minimize missed instructional time. We used simulation methodology to investigate how the duration and frequency of data collection influenced the reliability, validity, and precision of growth estimates from a math CAT. A progress monitoring dataset of 746 Grade 4, 664 Grade 5, and 400 Grade 6 students from 40 schools in the upper Midwest was used to generate model parameters. Across grades, 53% of students were female and 53% were White. Grade level was not as influential as the duration and frequency of data collection on the technical adequacy of growth estimates. Low-stakes decisions were possible after 14-18 weeks when data were collected weekly (420-540 min of assessment), 20-24 weeks when collected every other week (300-360 min of assessment), and 20-28 weeks (150-210 min of assessment) when data were collected once a month, depending on student grade level. The validity and precision of growth estimates improved when the duration and frequency of progress monitoring increased. Given the amount of time required to obtain technically adequate growth estimates in the present study, results highlight the importance of weighing the potential costs of missed instructional time relative to other types of assessments, such as curriculum-based measures. Implications for practice, research, as well as future directions are also discussed. (PsycINFO Database Record

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

School psychology quarterly : the official journal of the Division of School Psychology, American Psychological Association

自引率

0.00%

发文量