首页 > 最新文献

Educational Measurement-Issues and Practice最新文献

英文 中文
On the Cover: Sequential Progression and Item Review in Timed Tests: Patterns in Process Data 封面:时间测试中的顺序进展和项目审查:过程数据中的模式
IF 2.7 4区 教育学 Q1 EDUCATION & EDUCATIONAL RESEARCH Pub Date : 2025-05-20 DOI: 10.1111/emip.12670
Yuan-Ling Liaw
<p>We are excited to announce the winners of the 12th <i>EM:IP</i> Cover Graphic/Data Visualization Competition. Each year, we invite our readers to submit visualizations that are not only accurate and insightful but also visually compelling and easy to understand. This year's submissions explored key topics in educational measurement, including process data, item characteristics, test design, and score interpretation. We extend our sincere thanks to everyone who submitted their work, and we are especially grateful to the <i>EM:IP</i> editorial board for their thoughtful review and feedback in the selection process.</p><p>Winning entries may be featured on the cover of a future <i>EM:IP</i> issue. Previous winners who have not yet appeared on a cover remain eligible for upcoming issues.</p><p>This issue's cover features Sequential Progression and Item Review in Timed Tests: Patterns in Process Data, a compelling visualization created by Christian Meyer from the Association of American Medical Colleges and the University of Maryland, along with Ying Jin and Marc Kroopnick, both from the Association of American Medical Colleges.</p><p>The visualization, developed using R, presents smoothed density plots derived from process data collected during a high-stakes admissions test. It illustrates how examinees navigated one section of the test within a 95-minute time limit. The <i>x</i>-axis represents elapsed time in minutes. The <i>y</i>-axis segments item positions into five groups: 1 to 15, 16 to 25, 26 to 35, 36 to 45, and 46 to 59. Meyer and his colleagues explain that, for each item group, the height of the plot indicates density. The supports of the estimated densities extend beyond the start and end of the test to allow the plots to approach zero smoothly at the extremes.</p><p>Color is used effectively to distinguish between initial engagement and item review. Blue areas indicate when items were first viewed, while red areas show when examinees revisited those same items. The authors describe, “The figure illustrates a common test-taking strategy: examinees initially progress sequentially through the test, as shown by the early blue density peaks for each group. Toward the end of the session, they frequently revisit earlier items, as evidenced by the red peaks clustering near the time limit.” This pattern reflects deliberate time management, with examinees dividing their approach into two distinct phases.</p><p>They continue, “In the first phase, they assess each item, either attempting a response or skipping it for later review. In the second phase, they revisit skipped or uncertain items, providing more considered answers when time permits or resorting to random guessing if necessary.”</p><p>According to Meyer and his colleagues, the visualization offers valuable insight into examinees’ time management and engagement strategies during timed tests. They conclude, “It captures temporal strategies, such as sequential progression and end-of-sessi
我们很高兴地宣布第十二届EM:IP封面图形/数据可视化比赛的获胜者。每年,我们都会邀请读者提交不仅准确且富有洞察力,而且在视觉上引人注目且易于理解的可视化图像。今年提交的作品探讨了教育测量的关键主题,包括过程数据、项目特征、测试设计和分数解释。我们衷心感谢所有提交作品的人,并特别感谢《新知识产权》编辑委员会在评选过程中所做的周到审查和反馈。获奖作品可能会出现在未来的《新兴市场:知识产权》杂志的封面上。以前没有出现在封面上的获奖者仍然有资格出现在即将出版的杂志上。这期的封面特色是时序测试中的顺序进展和项目审查:过程数据中的模式,这是由美国医学院协会和马里兰大学的Christian Meyer以及美国医学院协会的Ying Jin和Marc Kroopnick创建的引人注目的可视化。使用R开发的可视化显示了从高风险入学测试期间收集的过程数据得出的平滑密度图。它说明了考生如何在95分钟的时间内完成考试的一个部分。x轴表示以分钟为单位的经过时间。y轴分段项目位置分为五组:1至15、16至25、26至35、36至45和46至59。迈耶和他的同事解释说,对于每个项目组,图的高度表示密度。估计密度的支持超出了测试的开始和结束,以允许图在极端情况下平稳地接近零。颜色被有效地用于区分初始参与和项目回顾。蓝色区域表示考生第一次看这些题目的时间,而红色区域表示考生再次看这些题目的时间。作者描述说:“该图说明了一种常见的考试策略:考生最初是按顺序通过考试的,正如每组的早期蓝色密度峰值所示。在会议结束时,他们经常回顾以前的项目,在时间限制附近聚集的红色峰值证明了这一点。”这种模式反映了刻意的时间管理,考生将他们的方法分为两个不同的阶段。他们继续说,“在第一阶段,他们评估每个项目,要么尝试回答,要么跳过它供以后回顾。在第二阶段,他们重新审视跳过的或不确定的项目,在时间允许的情况下提供更深思熟虑的答案,或者在必要时诉诸随机猜测。”根据迈耶和他的同事的说法,可视化提供了宝贵的见解,了解考生在定时考试中的时间管理和参与策略。他们的结论是:“它捕捉了时间策略,比如顺序进展和期末复习,为了解考生如何与考试结构和限制进行互动提供了有价值的见解。”虽然该图不包括有关单个项目的信息,并且仅限于项目位置范围,但它展示了如何以可访问和可解释的格式表示时间行为数据。其清晰度和设计使其成为通过过程数据传达测试模式的有用工具。如果你有兴趣了解更多关于数据可视化的信息,请联系Christian Meyer,邮箱:[email protected]。我们还邀请您参加年度EM:IP封面图形/数据可视化竞赛。详细信息可以在NCME和期刊网站上找到,你的参赛作品可能会出现在未来一期的封面上。如有任何问题或反馈,请联系袁玲,邮箱:[email protected]。
{"title":"On the Cover: Sequential Progression and Item Review in Timed Tests: Patterns in Process Data","authors":"Yuan-Ling Liaw","doi":"10.1111/emip.12670","DOIUrl":"https://doi.org/10.1111/emip.12670","url":null,"abstract":"&lt;p&gt;We are excited to announce the winners of the 12th &lt;i&gt;EM:IP&lt;/i&gt; Cover Graphic/Data Visualization Competition. Each year, we invite our readers to submit visualizations that are not only accurate and insightful but also visually compelling and easy to understand. This year's submissions explored key topics in educational measurement, including process data, item characteristics, test design, and score interpretation. We extend our sincere thanks to everyone who submitted their work, and we are especially grateful to the &lt;i&gt;EM:IP&lt;/i&gt; editorial board for their thoughtful review and feedback in the selection process.&lt;/p&gt;&lt;p&gt;Winning entries may be featured on the cover of a future &lt;i&gt;EM:IP&lt;/i&gt; issue. Previous winners who have not yet appeared on a cover remain eligible for upcoming issues.&lt;/p&gt;&lt;p&gt;This issue's cover features Sequential Progression and Item Review in Timed Tests: Patterns in Process Data, a compelling visualization created by Christian Meyer from the Association of American Medical Colleges and the University of Maryland, along with Ying Jin and Marc Kroopnick, both from the Association of American Medical Colleges.&lt;/p&gt;&lt;p&gt;The visualization, developed using R, presents smoothed density plots derived from process data collected during a high-stakes admissions test. It illustrates how examinees navigated one section of the test within a 95-minute time limit. The &lt;i&gt;x&lt;/i&gt;-axis represents elapsed time in minutes. The &lt;i&gt;y&lt;/i&gt;-axis segments item positions into five groups: 1 to 15, 16 to 25, 26 to 35, 36 to 45, and 46 to 59. Meyer and his colleagues explain that, for each item group, the height of the plot indicates density. The supports of the estimated densities extend beyond the start and end of the test to allow the plots to approach zero smoothly at the extremes.&lt;/p&gt;&lt;p&gt;Color is used effectively to distinguish between initial engagement and item review. Blue areas indicate when items were first viewed, while red areas show when examinees revisited those same items. The authors describe, “The figure illustrates a common test-taking strategy: examinees initially progress sequentially through the test, as shown by the early blue density peaks for each group. Toward the end of the session, they frequently revisit earlier items, as evidenced by the red peaks clustering near the time limit.” This pattern reflects deliberate time management, with examinees dividing their approach into two distinct phases.&lt;/p&gt;&lt;p&gt;They continue, “In the first phase, they assess each item, either attempting a response or skipping it for later review. In the second phase, they revisit skipped or uncertain items, providing more considered answers when time permits or resorting to random guessing if necessary.”&lt;/p&gt;&lt;p&gt;According to Meyer and his colleagues, the visualization offers valuable insight into examinees’ time management and engagement strategies during timed tests. They conclude, “It captures temporal strategies, such as sequential progression and end-of-sessi","PeriodicalId":47345,"journal":{"name":"Educational Measurement-Issues and Practice","volume":"44 2","pages":""},"PeriodicalIF":2.7,"publicationDate":"2025-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/emip.12670","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144117901","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"教育学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Digital Module 38: Differential Item Functioning by Multiple Variables Using Moderated Nonlinear Factor Analysis 数字模块38:微分项目功能的多变量使用有调节的非线性因素分析
IF 2.7 4区 教育学 Q1 EDUCATION & EDUCATIONAL RESEARCH Pub Date : 2025-05-20 DOI: 10.1111/emip.12669
Sanford R. Student, Ethan M. McCormick

Module Abstract

When investigating potential bias in educational test items via differential item functioning (DIF) analysis, researchers have historically been limited to comparing two groups of students at a time. The recent introduction of Moderated Nonlinear Factor Analysis (MNLFA) generalizes Item Response Theory models to extend the assessment of DIF to an arbitrary number of background variables. This facilitates more complex analyses such as DIF across more than two groups (e.g. low/middle/high socioeconomic status), across more than one background variable (e.g. DIF by race/ethnicity and gender), across non-categorical background variables (e.g. DIF by parental income), and more. Framing MNLFA as a generalization of the two-parameter logistic IRT model, we introduce the model with an emphasis on the parameters representing DIF versus impact; describe the current state of the art for estimating MNLFA models; and illustrate the application of MNLFA in a scenario where one wants to test for DIF across two background variables at once.

当通过差异项目功能(DIF)分析调查教育测试项目的潜在偏见时,研究人员历来仅限于一次比较两组学生。最近引入的有调节非线性因子分析(MNLFA)推广了项目反应理论模型,将DIF的评估扩展到任意数量的背景变量。这有助于进行更复杂的分析,例如跨两个以上群体(例如低/中/高社会经济地位)的DIF,跨一个以上背景变量(例如按种族/民族和性别划分的DIF),跨非分类背景变量(例如按父母收入划分的DIF)等等。将MNLFA框架为双参数logistic IRT模型的推广,我们介绍了该模型,重点介绍了表示DIF与影响的参数;描述估计MNLFA模型的最新技术;并说明MNLFA在一个场景中的应用,在这个场景中,人们想要同时测试两个背景变量的DIF。
{"title":"Digital Module 38: Differential Item Functioning by Multiple Variables Using Moderated Nonlinear Factor Analysis","authors":"Sanford R. Student,&nbsp;Ethan M. McCormick","doi":"10.1111/emip.12669","DOIUrl":"https://doi.org/10.1111/emip.12669","url":null,"abstract":"<div>\u0000 \u0000 <section>\u0000 \u0000 <h3> Module Abstract</h3>\u0000 \u0000 <p>When investigating potential bias in educational test items via differential item functioning (DIF) analysis, researchers have historically been limited to comparing two groups of students at a time. The recent introduction of Moderated Nonlinear Factor Analysis (MNLFA) generalizes Item Response Theory models to extend the assessment of DIF to an arbitrary number of background variables. This facilitates more complex analyses such as DIF across more than two groups (e.g. low/middle/high socioeconomic status), across more than one background variable (e.g. DIF by race/ethnicity and gender), across non-categorical background variables (e.g. DIF by parental income), and more. Framing MNLFA as a generalization of the two-parameter logistic IRT model, we introduce the model with an emphasis on the parameters representing DIF versus impact; describe the current state of the art for estimating MNLFA models; and illustrate the application of MNLFA in a scenario where one wants to test for DIF across two background variables at once.</p>\u0000 </section>\u0000 </div>","PeriodicalId":47345,"journal":{"name":"Educational Measurement-Issues and Practice","volume":"44 2","pages":"39-41"},"PeriodicalIF":2.7,"publicationDate":"2025-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144117928","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"教育学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
2024 NCME Presidential Address: Challenging Traditional Views of Measurement 2024年NCME主席演讲:挑战传统的测量观
IF 2.7 4区 教育学 Q1 EDUCATION & EDUCATIONAL RESEARCH Pub Date : 2025-05-19 DOI: 10.1111/emip.12673
Michael E. Walker

This article is adapted from the 2024 NCME Presidential Address. It reflects a personal journey to challenge traditional views of measurement. Considering alternative viewpoints with an open mind led to several solutions to perplexing problems at the time. The article discusses the culture-boundedness of measurement and the need to take that into consideration when designing tests.

本文改编自2024年NCME总统演讲。它反映了一个挑战传统测量观点的个人旅程。以开放的心态考虑不同的观点,为当时令人困惑的问题提供了几种解决方案。本文讨论了测量的文化有界性以及在设计测试时考虑这一点的必要性。
{"title":"2024 NCME Presidential Address: Challenging Traditional Views of Measurement","authors":"Michael E. Walker","doi":"10.1111/emip.12673","DOIUrl":"https://doi.org/10.1111/emip.12673","url":null,"abstract":"<p>This article is adapted from the 2024 NCME Presidential Address. It reflects a personal journey to challenge traditional views of measurement. Considering alternative viewpoints with an open mind led to several solutions to perplexing problems at the time. The article discusses the culture-boundedness of measurement and the need to take that into consideration when designing tests.</p>","PeriodicalId":47345,"journal":{"name":"Educational Measurement-Issues and Practice","volume":"44 2","pages":"32-38"},"PeriodicalIF":2.7,"publicationDate":"2025-05-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144117879","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"教育学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Generalizability Theory Approach to Analyzing Automated-Item Generated Test Forms 自动化项目生成测试表单分析的概括性理论方法
IF 2.7 4区 教育学 Q1 EDUCATION & EDUCATIONAL RESEARCH Pub Date : 2025-05-04 DOI: 10.1111/emip.12671
Stella Y. Kim, Sungyeun Kim

This study presents several multivariate Generalizability theory designs for analyzing automatic item-generated (AIG) based test forms. The study used real data to illustrate the analysis procedure and discuss practical considerations. We collected the data from two groups of students, each group receiving a different form generated by AIG. A total of 74 students participated in this study and responded to AIG-based test forms. Then, we analyzed the data using four distinct designs based on the data collection design, and conceptualization of true scores and measurement conditions over hypothetical replications. This study also examined the theoretical relationships among the four data collection designs and highlighted the potential impact of confounding between item templates and item clones.

本文提出了几种多变量概括性理论设计,用于分析基于自动项目生成(AIG)的测试表格。本研究使用真实数据来说明分析过程,并讨论实际注意事项。我们收集了两组学生的数据,每一组都收到了AIG生成的不同表格。共有74名学生参与了本研究,并填写了基于ai的测试表格。然后,我们在数据收集设计的基础上,使用四种不同的设计来分析数据,并概念化真实分数和假设重复的测量条件。本研究也检验了四种数据收集设计之间的理论关系,并强调了项目模板和项目克隆之间混淆的潜在影响。
{"title":"Generalizability Theory Approach to Analyzing Automated-Item Generated Test Forms","authors":"Stella Y. Kim,&nbsp;Sungyeun Kim","doi":"10.1111/emip.12671","DOIUrl":"https://doi.org/10.1111/emip.12671","url":null,"abstract":"<p>This study presents several multivariate Generalizability theory designs for analyzing automatic item-generated (AIG) based test forms. The study used real data to illustrate the analysis procedure and discuss practical considerations. We collected the data from two groups of students, each group receiving a different form generated by AIG. A total of 74 students participated in this study and responded to AIG-based test forms. Then, we analyzed the data using four distinct designs based on the data collection design, and conceptualization of true scores and measurement conditions over hypothetical replications. This study also examined the theoretical relationships among the four data collection designs and highlighted the potential impact of confounding between item templates and item clones.</p>","PeriodicalId":47345,"journal":{"name":"Educational Measurement-Issues and Practice","volume":"44 2","pages":"20-31"},"PeriodicalIF":2.7,"publicationDate":"2025-05-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144118008","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"教育学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Applications and Modeling of Keystroke Logs in Writing Assessments 击键日志在写作评估中的应用和建模
IF 2.7 4区 教育学 Q1 EDUCATION & EDUCATIONAL RESEARCH Pub Date : 2025-04-21 DOI: 10.1111/emip.12668
Mo Zhang, Paul Deane, Andrew Hoang, Hongwen Guo, Chen Li

In this paper, we describe two empirical studies that demonstrate the application and modeling of keystroke logs in writing assessments. We illustrate two different approaches of modeling differences in writing processes: analysis of mean differences in handcrafted theory-driven features and use of large language models to identify stable personal characteristics. In the first study, we examined the effects of test environment on writing characteristics: at-home versus in-center, using features extracted from keystroke logs. In a second study, we explored ways to measure stable personal characteristics and traits. As opposed to feature engineering that can be difficult to scale, raw keystroke logs were used as input in the second study, and large language models were developed to infer latent relations in the data. Implications, limitations, and future research directions are also discussed.

在本文中,我们描述了两项实证研究,证明了击键日志在写作评估中的应用和建模。我们举例说明了两种不同的方法来模拟写作过程中的差异:分析手工制作的理论驱动特征的平均差异,以及使用大型语言模型来识别稳定的个人特征。在第一项研究中,我们使用从击键日志中提取的特征,检查了测试环境对书写特性的影响:在家还是在中心。在第二项研究中,我们探索了衡量稳定的个人特征和特质的方法。与难以扩展的特征工程相反,在第二项研究中,原始击键日志被用作输入,并开发了大型语言模型来推断数据中的潜在关系。讨论了研究的意义、局限性和未来的研究方向。
{"title":"Applications and Modeling of Keystroke Logs in Writing Assessments","authors":"Mo Zhang,&nbsp;Paul Deane,&nbsp;Andrew Hoang,&nbsp;Hongwen Guo,&nbsp;Chen Li","doi":"10.1111/emip.12668","DOIUrl":"https://doi.org/10.1111/emip.12668","url":null,"abstract":"<p>In this paper, we describe two empirical studies that demonstrate the application and modeling of keystroke logs in writing assessments. We illustrate two different approaches of modeling differences in writing processes: analysis of mean differences in handcrafted theory-driven features and use of large language models to identify stable personal characteristics. In the first study, we examined the effects of test environment on writing characteristics: at-home versus in-center, using features extracted from keystroke logs. In a second study, we explored ways to measure stable personal characteristics and traits. As opposed to feature engineering that can be difficult to scale, raw keystroke logs were used as input in the second study, and large language models were developed to infer latent relations in the data. Implications, limitations, and future research directions are also discussed.</p>","PeriodicalId":47345,"journal":{"name":"Educational Measurement-Issues and Practice","volume":"44 2","pages":"5-19"},"PeriodicalIF":2.7,"publicationDate":"2025-04-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144117931","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"教育学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Issue Cover 覆盖问题
IF 2.7 4区 教育学 Q1 EDUCATION & EDUCATIONAL RESEARCH Pub Date : 2025-02-17 DOI: 10.1111/emip.12611
{"title":"Issue Cover","authors":"","doi":"10.1111/emip.12611","DOIUrl":"https://doi.org/10.1111/emip.12611","url":null,"abstract":"","PeriodicalId":47345,"journal":{"name":"Educational Measurement-Issues and Practice","volume":"44 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2025-02-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/emip.12611","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143423611","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"教育学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Digital Module 37: Introduction to Item Response Tree (IRTree) Models 数字模块37:项目响应树(IRTree)模型介绍
IF 2.7 4区 教育学 Q1 EDUCATION & EDUCATIONAL RESEARCH Pub Date : 2025-02-17 DOI: 10.1111/emip.12665
Nana Kim, Jiayi Deng, Yun Leng Wong

Module Abstract

Item response tree (IRTree) models, an item response modeling approach that incorporates a tree structure, have become a popular method for many applications in measurement. IRTree models characterize the underlying response processes using a decision tree structure, where the internal decision outcome at each node is parameterized with an item response theory (IRT) model. Such models provide a flexible way of investigating and modeling underlying response processes, which can be useful for examining sources of individual differences in measurement and addressing measurement issues that traditional IRT models cannot deal with. In this module, we discuss the conceptual framework of IRTree models and demonstrate examples of their applications in the context of both cognitive and noncognitive assessments. We also introduce some possible extensions of the model and provide a demonstration of an example data analysis in R.

模块 摘要 项目反应树(IRTree)模型是一种包含树形结构的项目反应建模方法,在测量领域的许多应用中已成为一种流行的方法。IRTree 模型采用决策树结构来描述基本的反应过程,其中每个节点的内部决策结果都用项目反应理论(IRT)模型来参数化。这类模型提供了一种灵活的方法来研究和模拟基本的反应过程,这对于研究测量中个体差异的来源和解决传统 IRT 模型无法解决的测量问题非常有用。在本模块中,我们将讨论 IRTree 模型的概念框架,并举例说明其在认知和非认知评估中的应用。我们还将介绍该模型的一些可能扩展,并提供一个用 R 进行数据分析的示例。
{"title":"Digital Module 37: Introduction to Item Response Tree (IRTree) Models","authors":"Nana Kim,&nbsp;Jiayi Deng,&nbsp;Yun Leng Wong","doi":"10.1111/emip.12665","DOIUrl":"https://doi.org/10.1111/emip.12665","url":null,"abstract":"<div>\u0000 \u0000 <section>\u0000 \u0000 <h3> Module Abstract</h3>\u0000 \u0000 <p>Item response tree (IRTree) models, an item response modeling approach that incorporates a tree structure, have become a popular method for many applications in measurement. IRTree models characterize the underlying response processes using a decision tree structure, where the internal decision outcome at each node is parameterized with an item response theory (IRT) model. Such models provide a flexible way of investigating and modeling underlying response processes, which can be useful for examining sources of individual differences in measurement and addressing measurement issues that traditional IRT models cannot deal with. In this module, we discuss the conceptual framework of IRTree models and demonstrate examples of their applications in the context of both cognitive and noncognitive assessments. We also introduce some possible extensions of the model and provide a demonstration of an example data analysis in R.</p>\u0000 </section>\u0000 </div>","PeriodicalId":47345,"journal":{"name":"Educational Measurement-Issues and Practice","volume":"44 1","pages":"109-110"},"PeriodicalIF":2.7,"publicationDate":"2025-02-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143423609","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"教育学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
On the Cover: Unraveling Reading Recognition Trajectories: Classifying Student Development through Growth Mixture Modeling 封面:解读阅读识别轨迹:通过混合成长模型对学生发展进行分类
IF 2.7 4区 教育学 Q1 EDUCATION & EDUCATIONAL RESEARCH Pub Date : 2025-02-17 DOI: 10.1111/emip.12667
Yuan-Ling Liaw

The cover of this issue features “Unraveling Reading Recognition Trajectories: Classifying Student Development through Growth Mixture Modeling” by Xingyao Xiao and Sophia Rabe-Hesketh from the University of California, Berkeley. Using advanced Bayesian growth mixture modeling, their research examines how reading recognition develops between ages 6 and 14, identifying three distinct patterns of growth. This study provides a detailed and nuanced understanding of how students’ reading abilities progress over time.

Xiao and Rabe-Hesketh illustrated their findings using a multiplot visualization. It combines model-implied class-specific mean trajectories, a shaded 50% mid-range, and box-plots of observed reading scores, effectively highlighting the variability in reading progress among different learner groups. By juxtaposing observed data with model predictions, the visualization clearly depicts diverse growth patterns. Additionally, it emphasizes the variance and covariance of random effects, offering valuable insights often overlooked in similar analyses.

The three-class model described by Xiao and Rabe-Hesketh effectively explains different patterns of student growth. The first group, termed the “Early Bloomers,” comprises about 14% of the population who start with strong reading abilities and steadily improve. By age six, they show high reading scores and greater variability in growth trajectories compared to other groups. Xiao and Rabe-Hesketh note, “These students exhibit greater variability in growth curves at age six, with an 88% likelihood for those deviating 2 standard deviations below or above the mean to stray from the average growth rate.” This highlights their potential for early reading success.

The “Rapid Catch-Up Learners” represent 35% of students, starting with lower scores but progressing rapidly to often surpass Early Bloomers by adolescence. Xiao and Rabe-Hesketh explain, “Though showing minimal heterogeneity in growth trajectories at age 6, these paths diverge due to a positive correlation between intercepts and slope. Those with trajectories 2 standard deviations above or below the mean at age 6 possess an 81% likelihood of deviating from the average growth rate.” This group highlights the potential of slower starters to excel with targeted support.

Lastly, the “Steady Progressors” start with the lowest average scores at age six but show steady, consistent growth over time. By age 14, their scores begin to overlap with those of other groups, despite maintaining an initial gap. “These students are projected to deviate 605% more from the mean at age 14 than at age 6, approximately seven times as much.” Representing a majority of students, this group highlights the importance of persistence and gradual progress.

Through their research, Xiao and Rabe-Hesketh define the diverse trajectories of reading development. Whether a student's growth is rapid, steady, or gradual, every trajectory deser

本期封面文章是加州大学伯克利分校的肖星耀和索菲亚-拉贝-赫斯基(Sophia Rabe-Hesketh)的 "解读阅读识别轨迹:加州大学伯克利分校的Xingyao Xiao和Sophia Rabe-Hesketh撰写的 "通过成长混合模型对学生的发展进行分类"。他们的研究采用先进的贝叶斯成长混合模型,考察了 6 至 14 岁学生的阅读识别能力发展情况,并确定了三种不同的成长模式。这项研究详细而细致地揭示了学生的阅读能力是如何随着时间的推移而进步的。它结合了模型推测的班级平均轨迹、阴影 50%的中间范围以及观察到的阅读分数的箱形图,有效地突出了不同学习者群体之间阅读进步的差异性。通过将观测数据与模型预测并列,该可视化图表清晰地描述了不同的增长模式。此外,它还强调了随机效应的方差和协方差,提供了在类似分析中经常被忽视的宝贵见解。肖和拉贝-赫斯基思所描述的三类模型有效地解释了学生的不同成长模式。第一类被称为 "早期绽放者",约占总人口的 14%,他们开始时阅读能力很强,并稳步提高。到六岁时,他们的阅读得分很高,与其他群体相比,他们的成长轨迹变化更大。Xiao和Rabe-Hesketh指出:"这些学生在六岁时表现出更大的成长曲线变异性,低于或高于平均值2个标准差的学生偏离平均成长率的可能性为88%"。"快速追赶学习者 "占学生总数的 35%,他们开始时分数较低,但进步很快,到青春期时往往超过早期绽放者。Xiao 和 Rabe-Hesketh 解释说:"虽然在 6 岁时成长轨迹的异质性很小,但由于截距和斜率之间的正相关性,这些轨迹出现了分化。那些在 6 岁时生长轨迹高于或低于平均值 2 个标准差的人,有 81% 的可能性偏离平均生长速度"。最后,"稳步前进者 "在 6 岁时的平均成绩最低,但随着时间的推移,他们的成绩会稳步、持续地增长。到 14 岁时,他们的分数开始与其他组别重叠,尽管最初仍有差距。"预计这些学生 14 岁时的平均分偏差将比 6 岁时高出 605%,大约是 6 岁时的 7 倍"。肖和拉贝-赫斯基思通过他们的研究,确定了阅读发展的不同轨迹。无论学生的成长是快速、稳定还是循序渐进,每一种轨迹都值得肯定和鼓励。通过满足每个学习者的独特需求,教育者可以更好地支持这些不同的学习路径,为所有学生的成功和茁壮成长创造公平的机会。有关此可视化的更多详情或咨询,请联系肖星瑶([email protected])。我们邀请您参加 EM:IP 封面图形/数据可视化竞赛,为未来的期刊做出贡献。请发送电子邮件至[email protected]与廖远玲分享您的想法或问题。我们期待您的来信!
{"title":"On the Cover: Unraveling Reading Recognition Trajectories: Classifying Student Development through Growth Mixture Modeling","authors":"Yuan-Ling Liaw","doi":"10.1111/emip.12667","DOIUrl":"https://doi.org/10.1111/emip.12667","url":null,"abstract":"<p>The cover of this issue features “<i>Unraveling Reading Recognition Trajectories: Classifying Student Development through Growth Mixture Modeling</i>” by Xingyao Xiao and Sophia Rabe-Hesketh from the University of California, Berkeley. Using advanced Bayesian growth mixture modeling, their research examines how reading recognition develops between ages 6 and 14, identifying three distinct patterns of growth. This study provides a detailed and nuanced understanding of how students’ reading abilities progress over time.</p><p>Xiao and Rabe-Hesketh illustrated their findings using a multiplot visualization. It combines model-implied class-specific mean trajectories, a shaded 50% mid-range, and box-plots of observed reading scores, effectively highlighting the variability in reading progress among different learner groups. By juxtaposing observed data with model predictions, the visualization clearly depicts diverse growth patterns. Additionally, it emphasizes the variance and covariance of random effects, offering valuable insights often overlooked in similar analyses.</p><p>The three-class model described by Xiao and Rabe-Hesketh effectively explains different patterns of student growth. The first group, termed the “Early Bloomers,” comprises about 14% of the population who start with strong reading abilities and steadily improve. By age six, they show high reading scores and greater variability in growth trajectories compared to other groups. Xiao and Rabe-Hesketh note, “These students exhibit greater variability in growth curves at age six, with an 88% likelihood for those deviating 2 standard deviations below or above the mean to stray from the average growth rate.” This highlights their potential for early reading success.</p><p>The “Rapid Catch-Up Learners” represent 35% of students, starting with lower scores but progressing rapidly to often surpass Early Bloomers by adolescence. Xiao and Rabe-Hesketh explain, “Though showing minimal heterogeneity in growth trajectories at age 6, these paths diverge due to a positive correlation between intercepts and slope. Those with trajectories 2 standard deviations above or below the mean at age 6 possess an 81% likelihood of deviating from the average growth rate.” This group highlights the potential of slower starters to excel with targeted support.</p><p>Lastly, the “Steady Progressors” start with the lowest average scores at age six but show steady, consistent growth over time. By age 14, their scores begin to overlap with those of other groups, despite maintaining an initial gap. “These students are projected to deviate 605% more from the mean at age 14 than at age 6, approximately seven times as much.” Representing a majority of students, this group highlights the importance of persistence and gradual progress.</p><p>Through their research, Xiao and Rabe-Hesketh define the diverse trajectories of reading development. Whether a student's growth is rapid, steady, or gradual, every trajectory deser","PeriodicalId":47345,"journal":{"name":"Educational Measurement-Issues and Practice","volume":"44 1","pages":"6"},"PeriodicalIF":2.7,"publicationDate":"2025-02-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/emip.12667","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143423614","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"教育学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ITEMS Corner: Next Chapter of ITEMS 物品角:物品的下一章
IF 2.7 4区 教育学 Q1 EDUCATION & EDUCATIONAL RESEARCH Pub Date : 2025-02-17 DOI: 10.1111/emip.12666
Stella Y. Kim
{"title":"ITEMS Corner: Next Chapter of ITEMS","authors":"Stella Y. Kim","doi":"10.1111/emip.12666","DOIUrl":"https://doi.org/10.1111/emip.12666","url":null,"abstract":"","PeriodicalId":47345,"journal":{"name":"Educational Measurement-Issues and Practice","volume":"44 1","pages":"108"},"PeriodicalIF":2.7,"publicationDate":"2025-02-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143423608","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"教育学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Issue Cover 覆盖问题
IF 2.7 4区 教育学 Q1 EDUCATION & EDUCATIONAL RESEARCH Pub Date : 2025-01-05 DOI: 10.1111/emip.12566
{"title":"Issue Cover","authors":"","doi":"10.1111/emip.12566","DOIUrl":"https://doi.org/10.1111/emip.12566","url":null,"abstract":"","PeriodicalId":47345,"journal":{"name":"Educational Measurement-Issues and Practice","volume":"43 4","pages":""},"PeriodicalIF":2.7,"publicationDate":"2025-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/emip.12566","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143248557","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"教育学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Educational Measurement-Issues and Practice
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1