计算机自适应测试中增长估计的技术充分性:对进度监测的影响。

Ethan R Van Norman, Peter M Nelson, David C Parker
{"title":"计算机自适应测试中增长估计的技术充分性:对进度监测的影响。","authors":"Ethan R Van Norman,&nbsp;Peter M Nelson,&nbsp;David C Parker","doi":"10.1037/spq0000175","DOIUrl":null,"url":null,"abstract":"<p><p>Computer adaptive tests (CATs) hold promise to monitor student progress within multitiered systems of support. However, the relationship between how long and how often data are collected and the technical adequacy of growth estimates from CATs has not been explored. Given CAT administration times, it is important to identify optimal data collection schedules to minimize missed instructional time. We used simulation methodology to investigate how the duration and frequency of data collection influenced the reliability, validity, and precision of growth estimates from a math CAT. A progress monitoring dataset of 746 Grade 4, 664 Grade 5, and 400 Grade 6 students from 40 schools in the upper Midwest was used to generate model parameters. Across grades, 53% of students were female and 53% were White. Grade level was not as influential as the duration and frequency of data collection on the technical adequacy of growth estimates. Low-stakes decisions were possible after 14-18 weeks when data were collected weekly (420-540 min of assessment), 20-24 weeks when collected every other week (300-360 min of assessment), and 20-28 weeks (150-210 min of assessment) when data were collected once a month, depending on student grade level. The validity and precision of growth estimates improved when the duration and frequency of progress monitoring increased. Given the amount of time required to obtain technically adequate growth estimates in the present study, results highlight the importance of weighing the potential costs of missed instructional time relative to other types of assessments, such as curriculum-based measures. Implications for practice, research, as well as future directions are also discussed. (PsycINFO Database Record</p>","PeriodicalId":88124,"journal":{"name":"School psychology quarterly : the official journal of the Division of School Psychology, American Psychological Association","volume":"32 3","pages":"379-391"},"PeriodicalIF":0.0000,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":"{\"title\":\"Technical adequacy of growth estimates from a computer adaptive test: Implications for progress monitoring.\",\"authors\":\"Ethan R Van Norman,&nbsp;Peter M Nelson,&nbsp;David C Parker\",\"doi\":\"10.1037/spq0000175\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Computer adaptive tests (CATs) hold promise to monitor student progress within multitiered systems of support. However, the relationship between how long and how often data are collected and the technical adequacy of growth estimates from CATs has not been explored. Given CAT administration times, it is important to identify optimal data collection schedules to minimize missed instructional time. We used simulation methodology to investigate how the duration and frequency of data collection influenced the reliability, validity, and precision of growth estimates from a math CAT. A progress monitoring dataset of 746 Grade 4, 664 Grade 5, and 400 Grade 6 students from 40 schools in the upper Midwest was used to generate model parameters. Across grades, 53% of students were female and 53% were White. Grade level was not as influential as the duration and frequency of data collection on the technical adequacy of growth estimates. Low-stakes decisions were possible after 14-18 weeks when data were collected weekly (420-540 min of assessment), 20-24 weeks when collected every other week (300-360 min of assessment), and 20-28 weeks (150-210 min of assessment) when data were collected once a month, depending on student grade level. The validity and precision of growth estimates improved when the duration and frequency of progress monitoring increased. Given the amount of time required to obtain technically adequate growth estimates in the present study, results highlight the importance of weighing the potential costs of missed instructional time relative to other types of assessments, such as curriculum-based measures. Implications for practice, research, as well as future directions are also discussed. (PsycINFO Database Record</p>\",\"PeriodicalId\":88124,\"journal\":{\"name\":\"School psychology quarterly : the official journal of the Division of School Psychology, American Psychological Association\",\"volume\":\"32 3\",\"pages\":\"379-391\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"8\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"School psychology quarterly : the official journal of the Division of School Psychology, American Psychological Association\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1037/spq0000175\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2016/8/8 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"School psychology quarterly : the official journal of the Division of School Psychology, American Psychological Association","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1037/spq0000175","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2016/8/8 0:00:00","PubModel":"Epub","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 8

摘要

计算机适应性测试(CATs)有望在多层支持系统中监测学生的进步。然而,收集数据的时间和频率与从CATs获得的增长估计的技术充分性之间的关系尚未得到探讨。给定CAT管理时间,确定最佳数据收集计划以最小化错过的教学时间是很重要的。我们使用模拟方法来研究数据收集的持续时间和频率如何影响数学CAT增长估计的可靠性、有效性和精度。来自中西部北部40所学校的746名4年级学生、664名5年级学生和400名6年级学生的进度监测数据用于生成模型参数。从年级来看,53%的学生是女生,53%是白人。职等对增长估计在技术上是否适当的影响不如收集数据的持续时间和频率大。根据学生的年级水平,在14-18周每周收集数据(420-540分钟评估)、20-24周每隔一周收集数据(300-360分钟评估)和20-28周(150-210分钟评估)后可以做出低风险决策。当进度监测的持续时间和频率增加时,增长估计的有效性和准确性就会提高。鉴于在本研究中获得技术上适当的增长估计所需的时间,结果突出了权衡错过教学时间相对于其他类型的评估(如基于课程的措施)的潜在成本的重要性。本文还讨论了对实践、研究以及未来发展方向的影响。(PsycINFO数据库记录
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Technical adequacy of growth estimates from a computer adaptive test: Implications for progress monitoring.

Computer adaptive tests (CATs) hold promise to monitor student progress within multitiered systems of support. However, the relationship between how long and how often data are collected and the technical adequacy of growth estimates from CATs has not been explored. Given CAT administration times, it is important to identify optimal data collection schedules to minimize missed instructional time. We used simulation methodology to investigate how the duration and frequency of data collection influenced the reliability, validity, and precision of growth estimates from a math CAT. A progress monitoring dataset of 746 Grade 4, 664 Grade 5, and 400 Grade 6 students from 40 schools in the upper Midwest was used to generate model parameters. Across grades, 53% of students were female and 53% were White. Grade level was not as influential as the duration and frequency of data collection on the technical adequacy of growth estimates. Low-stakes decisions were possible after 14-18 weeks when data were collected weekly (420-540 min of assessment), 20-24 weeks when collected every other week (300-360 min of assessment), and 20-28 weeks (150-210 min of assessment) when data were collected once a month, depending on student grade level. The validity and precision of growth estimates improved when the duration and frequency of progress monitoring increased. Given the amount of time required to obtain technically adequate growth estimates in the present study, results highlight the importance of weighing the potential costs of missed instructional time relative to other types of assessments, such as curriculum-based measures. Implications for practice, research, as well as future directions are also discussed. (PsycINFO Database Record

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Teacher and student race as a predictor for negative feedback during instruction. Fostering youth self-efficacy to address transgender and racial diversity issues: The role of gay-straight alliances. The big two personality traits and adolescents' complete mental health: The mediation role of perceived school stress. Examining the stability of social, emotional, and behavioral risk status: Implications for screening frequency. Developing a direct rating behavior scale for depression in middle school students.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1