首页 > 最新文献

Journal of outcome measurement最新文献

英文 中文
Rasch analysis of distractors in multiple-choice items. 多项选择题干扰因素的Rasch分析。
Pub Date : 1998-01-01
W C Wang

In order to apply the Rasch model to multiple-choice items, incorrect responses to distractors are usually aggregated to a single category. In doing so, information of individual distractors disappears. In this paper, a Rasch-type analysis is proposed where one parameter is assigned to each distractor. The information is thus preserved. The proposed distractor model can be applied to investigate the performance of distractors, which is useful for item revision. This model is a necessary condition of the Rasch model, that is, fitting the distractor model will fit the Rasch model, but not vice versa. The results of a small simulation study show that parameter recovery of the distractor model is very satisfactory. A real data set of twenty multiple-choice items was analyzed. Some items were found to fit the Rasch model rather than the distractor model. It is this diagnostic value that makes the distractor model suitable for multiple-choice items.

为了将Rasch模型应用于多项选择题,对干扰因素的错误反应通常被汇总到一个类别中。这样一来,单个干扰因素的信息就消失了。本文提出了一种rasch型分析方法,其中每个干扰物分配一个参数。信息就这样被保存了下来。本文提出的干扰因素模型可用于研究干扰因素的表现,为项目修正提供依据。该模型是Rasch模型的必要条件,即拟合分心物模型将拟合Rasch模型,而不是相反。小型仿真研究结果表明,该模型的参数恢复效果令人满意。对20个选择题的真实数据集进行了分析。一些项目被发现适合Rasch模型而不是分心物模型。正是这种诊断价值使得分心物模型适用于多项选择题。
{"title":"Rasch analysis of distractors in multiple-choice items.","authors":"W C Wang","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>In order to apply the Rasch model to multiple-choice items, incorrect responses to distractors are usually aggregated to a single category. In doing so, information of individual distractors disappears. In this paper, a Rasch-type analysis is proposed where one parameter is assigned to each distractor. The information is thus preserved. The proposed distractor model can be applied to investigate the performance of distractors, which is useful for item revision. This model is a necessary condition of the Rasch model, that is, fitting the distractor model will fit the Rasch model, but not vice versa. The results of a small simulation study show that parameter recovery of the distractor model is very satisfactory. A real data set of twenty multiple-choice items was analyzed. Some items were found to fit the Rasch model rather than the distractor model. It is this diagnostic value that makes the distractor model suitable for multiple-choice items.</p>","PeriodicalId":79673,"journal":{"name":"Journal of outcome measurement","volume":"2 1","pages":"43-65"},"PeriodicalIF":0.0,"publicationDate":"1998-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"20579248","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Corrected Rasch asymptotic standard errors for person ability estimates. 校正了人的能力估计的Rasch渐近标准误差。
Pub Date : 1998-01-01
R M Smith

Most calibration programs designed for the family of Rasch psychometric models report the asymptotic standard errors for person and item measure estimates resulting from the calibration process. Although these estimates are theoretically correct, they may be influenced by any number of factors, e.g., restrictions due to the loss of degrees of freedom in estimation, targeting of the instrument, i.e., the degree of offset between mean item difficulty and mean person ability, and the presence of misfit in the data. The effect of these factors on the standard errors reported for the person has not been previously reported. The purpose of this study was to investigate the effects of these three factors on the asymptotic standard errors for person measures using simulated data. The results indicate that asymptotic errors systematically underestimate the observed standard deviation of ability in simulated data, though this underestimation is usually small for targeted instruments with reasonable sample size. However, the underestimation can easily be corrected with a simple linear function. These simulations use only dichotomous data and the results may not generalize to the rating scale and partial credit models.

大多数为Rasch心理测量模型家族设计的校准程序报告了由校准过程产生的个人和项目测量估计的渐近标准误差。虽然这些估计在理论上是正确的,但它们可能受到许多因素的影响,例如,由于估计中自由度的丧失而产生的限制,工具的目标,即平均项目难度与平均人能力之间的偏移程度,以及数据中存在不拟合。这些因素对人的标准误差的影响以前没有报道过。本研究的目的是探讨这三个因素对使用模拟数据的人的测量的渐近标准误差的影响。结果表明,渐近误差系统地低估了模拟数据中观测到的能力标准差,尽管对于具有合理样本量的目标仪器来说,这种低估通常很小。然而,低估可以很容易地用一个简单的线性函数来纠正。这些模拟只使用二分类数据,结果可能不能推广到评级量表和部分信用模型。
{"title":"Corrected Rasch asymptotic standard errors for person ability estimates.","authors":"R M Smith","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Most calibration programs designed for the family of Rasch psychometric models report the asymptotic standard errors for person and item measure estimates resulting from the calibration process. Although these estimates are theoretically correct, they may be influenced by any number of factors, e.g., restrictions due to the loss of degrees of freedom in estimation, targeting of the instrument, i.e., the degree of offset between mean item difficulty and mean person ability, and the presence of misfit in the data. The effect of these factors on the standard errors reported for the person has not been previously reported. The purpose of this study was to investigate the effects of these three factors on the asymptotic standard errors for person measures using simulated data. The results indicate that asymptotic errors systematically underestimate the observed standard deviation of ability in simulated data, though this underestimation is usually small for targeted instruments with reasonable sample size. However, the underestimation can easily be corrected with a simple linear function. These simulations use only dichotomous data and the results may not generalize to the rating scale and partial credit models.</p>","PeriodicalId":79673,"journal":{"name":"Journal of outcome measurement","volume":"2 4","pages":"351-64"},"PeriodicalIF":0.0,"publicationDate":"1998-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"20715781","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The dimensionality and validity of the Older Americans Resources and Services (OARS) Activities of Daily Living (ADL) Scale. 美国老年人资源和服务(OARS)日常生活活动(ADL)量表的维度和效度。
Pub Date : 1998-01-01
S E Doble, A G Fisher

The psychometric properties of the OARS ADL scale, comprised of seven physical activities of daily living (PADL) and seven instrumental activities of daily living (IADL) items, were examined using a Rasch measurement approach. Two of the PADL items failed to demonstrate acceptable goodness-of-fit with the measurement model but the remaining 12 items could be combined into a single measure of ADL ability. Although the OARS ADL scale was designed to identify those community-dwelling elderly who need supports and services to continue to live in the community, the scale items were found to be poorly targeted to community-dwelling elderly since almost half of our sample received maximal scores. Rasch analysis identified how we might improve the sensitivity of the OARS ADL scale but its utility in outcome and longitudinal studies remains questionable.

采用Rasch测量法对由7个日常生活体力活动(PADL)和7个日常生活工具活动(IADL)项目组成的OARS ADL量表进行心理测量。其中两个项目未能与测量模型显示出可接受的拟合优度,但其余12个项目可以合并为一个单一的ADL能力测量。虽然OARS ADL量表旨在识别那些需要支持和服务以继续在社区生活的社区居住老年人,但我们发现量表项目对社区居住老年人的针对性较差,因为我们的样本中几乎有一半获得了最高分数。Rasch分析确定了我们如何提高OARS ADL量表的敏感性,但其在结果和纵向研究中的效用仍然值得怀疑。
{"title":"The dimensionality and validity of the Older Americans Resources and Services (OARS) Activities of Daily Living (ADL) Scale.","authors":"S E Doble,&nbsp;A G Fisher","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>The psychometric properties of the OARS ADL scale, comprised of seven physical activities of daily living (PADL) and seven instrumental activities of daily living (IADL) items, were examined using a Rasch measurement approach. Two of the PADL items failed to demonstrate acceptable goodness-of-fit with the measurement model but the remaining 12 items could be combined into a single measure of ADL ability. Although the OARS ADL scale was designed to identify those community-dwelling elderly who need supports and services to continue to live in the community, the scale items were found to be poorly targeted to community-dwelling elderly since almost half of our sample received maximal scores. Rasch analysis identified how we might improve the sensitivity of the OARS ADL scale but its utility in outcome and longitudinal studies remains questionable.</p>","PeriodicalId":79673,"journal":{"name":"Journal of outcome measurement","volume":"2 1","pages":"4-24"},"PeriodicalIF":0.0,"publicationDate":"1998-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"20579245","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Using item mean squares to evaluate fit to the Rasch model. 使用项目均方来评估与Rasch模型的拟合。
Pub Date : 1998-01-01
R M Smith, R E Schumacker, M J Bush

Throughout the mid to late 1970's considerable research was conducted on the properties of Rasch fit mean squares. This work culminated in a variety of transformations to convert the mean squares into approximate t-statistics. This work was primarily motivated by the influence sample size has on the magnitude of the mean squares and the desire to have a single critical value that can generally be applied to most cases. In the late 1980's and the early 1990's the trend seems to have reversed, with numerous researchers using the untransformed fit mean squares as a means of testing fit to the Rasch measurement models. The principal motivation is cited as the influence sample size has on the sensitivity of the t-converted mean squares. The purpose of this paper is to present the historical development of these fit indices and the various transformations and to examine the impact of sample size on both the fit mean squares and the t-transformations of those mean squares. Because the sample size problem has little influence on the person mean square problem, due to the relatively short length (100 items or less), this paper focuses on the item fit mean squares, where it is common to find the statistics used with sample sizes ranging from 30 to 10,000.

在20世纪70年代中后期,人们对拉希拟合均方的性质进行了大量的研究。这项工作在将均方转换为近似t统计量的各种转换中达到高潮。这项工作的主要动机是样本量对均方大小的影响,以及希望有一个通常可应用于大多数情况的单一临界值。在20世纪80年代末和90年代初,随着许多研究人员使用未变换的拟合均方作为检验Rasch测量模型拟合的手段,趋势似乎已经逆转。主要动机被引用为样本量对t转换均方的灵敏度的影响。本文的目的是介绍这些拟合指数和各种变换的历史发展,并检查样本量对拟合均方和这些均方的t变换的影响。由于样本量问题对人均方问题的影响较小,由于样本量的长度相对较短(100个或更少),因此本文主要关注项目拟合均方,其中通常会发现样本量在30到10,000之间使用的统计量。
{"title":"Using item mean squares to evaluate fit to the Rasch model.","authors":"R M Smith,&nbsp;R E Schumacker,&nbsp;M J Bush","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Throughout the mid to late 1970's considerable research was conducted on the properties of Rasch fit mean squares. This work culminated in a variety of transformations to convert the mean squares into approximate t-statistics. This work was primarily motivated by the influence sample size has on the magnitude of the mean squares and the desire to have a single critical value that can generally be applied to most cases. In the late 1980's and the early 1990's the trend seems to have reversed, with numerous researchers using the untransformed fit mean squares as a means of testing fit to the Rasch measurement models. The principal motivation is cited as the influence sample size has on the sensitivity of the t-converted mean squares. The purpose of this paper is to present the historical development of these fit indices and the various transformations and to examine the impact of sample size on both the fit mean squares and the t-transformations of those mean squares. Because the sample size problem has little influence on the person mean square problem, due to the relatively short length (100 items or less), this paper focuses on the item fit mean squares, where it is common to find the statistics used with sample sizes ranging from 30 to 10,000.</p>","PeriodicalId":79673,"journal":{"name":"Journal of outcome measurement","volume":"2 1","pages":"66-78"},"PeriodicalIF":0.0,"publicationDate":"1998-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"20579249","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Detecting multidimensionality: which residual data-type works best? 检测多维度:哪种残差数据类型效果最好?
Pub Date : 1998-01-01
J M Linacre

Factor analysis is a powerful technique for investigating multidimensionality in observational data, but it fails to construct interval measures. Rasch analysis constructs interval measures, but only indirectly flags the presence of multidimensional structures. Simulation studies indicate that, for responses to complete tests, construction of Rasch measures from the observational data, followed by principal components factor analysis of Rasch residuals, provides an effective means of identifying multidimensionality. The most diagnostically useful residual form was found to be the standardized residual. The multidimensional structure of the Functional Independence Measure (FIMSM) is confirmed by means of Rasch analysis followed by factor analysis of standardized residuals.

因子分析是研究观测数据多维度的一种有效方法,但它无法构建区间测度。Rasch分析构建间隔度量,但只能间接标记多维结构的存在。仿真研究表明,对于完整测试的响应,从观测数据构建Rasch测度,然后对Rasch残差进行主成分因子分析,提供了一种识别多维度的有效手段。发现最具诊断价值的残差形式是标准化残差。通过Rasch分析和标准化残差因子分析,确定了功能独立性测度(FIMSM)的多维结构。
{"title":"Detecting multidimensionality: which residual data-type works best?","authors":"J M Linacre","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Factor analysis is a powerful technique for investigating multidimensionality in observational data, but it fails to construct interval measures. Rasch analysis constructs interval measures, but only indirectly flags the presence of multidimensional structures. Simulation studies indicate that, for responses to complete tests, construction of Rasch measures from the observational data, followed by principal components factor analysis of Rasch residuals, provides an effective means of identifying multidimensionality. The most diagnostically useful residual form was found to be the standardized residual. The multidimensional structure of the Functional Independence Measure (FIMSM) is confirmed by means of Rasch analysis followed by factor analysis of standardized residuals.</p>","PeriodicalId":79673,"journal":{"name":"Journal of outcome measurement","volume":"2 3","pages":"266-83"},"PeriodicalIF":0.0,"publicationDate":"1998-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"20625684","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Measuring individual differences in change with multidimensional Rasch models. 用多维拉希模型测量个体差异的变化。
Pub Date : 1998-01-01
W C Wang, M Wilson, R J Adams

Item response models have been developed to explore change measurement, including those proposed by Fischer and his colleagues (e.g., Fischer & Pazer, 1991; Fischer & Ponocny, 1994), Andersen (1985) and Embretson (1991). In this article, we propose another multidimensional Rasch model, the multidimensional random coefficient multinomial logit (MRCML) model (Adams, Wilson, & Wang, 1997). All these models are briefly reviewed and compared. The MRCML can be applied to not only polytomous items but also investigation of variations in item difficulties. Based on variations in difficulties across occasions and items, five kinds of models are proposed. Some simulation studies were conducted to examine parameter recovery of the MRCML model under various testing situations. All the parameters were recovered very well. A real data set was analyzed to show applications of the MRCML to measuring individual differences in change.

项目反应模型已被开发用于探索变化测量,包括Fischer及其同事提出的模型(例如,Fischer & Pazer, 1991;Fischer & Ponocny, 1994), Andersen(1985)和Embretson(1991)。在本文中,我们提出了另一种多维Rasch模型,即多维随机系数多项logit (MRCML)模型(Adams, Wilson, & Wang, 1997)。对这些模型进行了简要的回顾和比较。MRCML不仅可以应用于多同构题,还可以用于题目难度变化的调查。根据不同场合和项目的难度差异,提出了五种模型。进行了仿真研究,考察了MRCML模型在各种测试情况下的参数恢复情况。所有参数都恢复得很好。分析了一个真实的数据集,以显示MRCML在测量个体差异变化方面的应用。
{"title":"Measuring individual differences in change with multidimensional Rasch models.","authors":"W C Wang,&nbsp;M Wilson,&nbsp;R J Adams","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Item response models have been developed to explore change measurement, including those proposed by Fischer and his colleagues (e.g., Fischer & Pazer, 1991; Fischer & Ponocny, 1994), Andersen (1985) and Embretson (1991). In this article, we propose another multidimensional Rasch model, the multidimensional random coefficient multinomial logit (MRCML) model (Adams, Wilson, & Wang, 1997). All these models are briefly reviewed and compared. The MRCML can be applied to not only polytomous items but also investigation of variations in item difficulties. Based on variations in difficulties across occasions and items, five kinds of models are proposed. Some simulation studies were conducted to examine parameter recovery of the MRCML model under various testing situations. All the parameters were recovered very well. A real data set was analyzed to show applications of the MRCML to measuring individual differences in change.</p>","PeriodicalId":79673,"journal":{"name":"Journal of outcome measurement","volume":"2 3","pages":"240-65"},"PeriodicalIF":0.0,"publicationDate":"1998-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"20627038","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Physical disability construct convergence across instruments: towards a universal metric. 身体残疾构建了跨工具的趋同:走向一个通用的度量标准。
Pub Date : 1997-01-01
W P Fisher

Objectives: This study examines the stability of a physical disability construct across instruments and samples. The purpose is not to report a formal equating of instrument calibrations, but to indicate whether such an effort would be likely to succeed. Theory. The economics transforming health care from its orientation toward crisis-driven disease reactions to population- and evidence-based preventive health management and individualized disease management demand general scale-free measures of functional independence.

Methods: A new method, pseudo-common item equating, is demonstrated. Similar, but not identical items, from different instruments, calibrated on different samples, are compared.

Data: More than 30 articles presenting Rasch analyses of physical functioning scales were reviewed. Four instruments provided data from ten of these articles, for eleven different calibrations (two instruments are both included in one article).

Results: The final overall average correlation disattenuated for error is .93, with an average of 7 pseudo-common items, and an average p-value of .01, meaning that measures based on these calibrations should be linearly transformable versions of the same metric. Scientific importance. The quantitative stability of different areas of physical functional independence across instruments and samples suggests that the development and deployment of a universal metric is a realizable goal.

目的:本研究考察了跨仪器和样本的身体残疾结构的稳定性。其目的不是报告仪器校准的正式相等,而是指出这种努力是否可能成功。理论。将医疗保健从危机驱动的疾病反应导向转变为以人口和证据为基础的预防性健康管理和个性化疾病管理的经济学要求功能独立性的一般无标度措施。方法:提出一种新方法——伪公共项等值法。从不同的仪器,对不同的样品进行校准,比较相似但不相同的项目。资料:我们回顾了30多篇关于Rasch身体功能量表分析的文章。四种仪器提供了其中十种文章的数据,用于十一种不同的校准(两种仪器都包含在一篇文章中)。结果:最终的总体平均相关性为0.93,平均有7个伪共同项目,平均p值为0.01,这意味着基于这些校准的测量应该是同一度量的线性变换版本。科学的重要性。仪器和样品中不同物理功能独立区域的定量稳定性表明,开发和部署通用度量是一个可实现的目标。
{"title":"Physical disability construct convergence across instruments: towards a universal metric.","authors":"W P Fisher","doi":"","DOIUrl":"","url":null,"abstract":"<p><strong>Objectives: </strong>This study examines the stability of a physical disability construct across instruments and samples. The purpose is not to report a formal equating of instrument calibrations, but to indicate whether such an effort would be likely to succeed. Theory. The economics transforming health care from its orientation toward crisis-driven disease reactions to population- and evidence-based preventive health management and individualized disease management demand general scale-free measures of functional independence.</p><p><strong>Methods: </strong>A new method, pseudo-common item equating, is demonstrated. Similar, but not identical items, from different instruments, calibrated on different samples, are compared.</p><p><strong>Data: </strong>More than 30 articles presenting Rasch analyses of physical functioning scales were reviewed. Four instruments provided data from ten of these articles, for eleven different calibrations (two instruments are both included in one article).</p><p><strong>Results: </strong>The final overall average correlation disattenuated for error is .93, with an average of 7 pseudo-common items, and an average p-value of .01, meaning that measures based on these calibrations should be linearly transformable versions of the same metric. Scientific importance. The quantitative stability of different areas of physical functional independence across instruments and samples suggests that the development and deployment of a universal metric is a realizable goal.</p>","PeriodicalId":79673,"journal":{"name":"Journal of outcome measurement","volume":"1 2","pages":"87-113"},"PeriodicalIF":0.0,"publicationDate":"1997-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"20579303","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
On-line performance assessment using rating scales. 使用评级量表进行在线绩效评估。
Pub Date : 1997-01-01
J Stahl, R Shumway, B Bergstrom, A Fisher

The purpose of this paper is to report on the development of the on-line performance assessment instrument--the Assessment of Motor and Process Skills (AMPS). Issues that will be addressed in the paper include: (a) the establishment of the scoring rubric and its implementation in an extended Rasch model, (b) training of raters, (c) validation of the scoring rubric and procedures for monitoring the internal consistency of raters, and (d) technological implementation of the assessment instrument in a computerized program.

本文的目的是报告在线性能评估工具——运动和过程技能评估(AMPS)的发展。本文将解决的问题包括:(a)评分标准的建立及其在扩展的Rasch模型中的实施,(b)评分员的培训,(c)评分标准的验证和监控评分员内部一致性的程序,以及(d)评估工具在计算机程序中的技术实施。
{"title":"On-line performance assessment using rating scales.","authors":"J Stahl,&nbsp;R Shumway,&nbsp;B Bergstrom,&nbsp;A Fisher","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>The purpose of this paper is to report on the development of the on-line performance assessment instrument--the Assessment of Motor and Process Skills (AMPS). Issues that will be addressed in the paper include: (a) the establishment of the scoring rubric and its implementation in an extended Rasch model, (b) training of raters, (c) validation of the scoring rubric and procedures for monitoring the internal consistency of raters, and (d) technological implementation of the assessment instrument in a computerized program.</p>","PeriodicalId":79673,"journal":{"name":"Journal of outcome measurement","volume":"1 3","pages":"173-91"},"PeriodicalIF":0.0,"publicationDate":"1997-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"20579308","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Interpreting the chi-square statistics reported in the many-faceted Rasch model. 解释多面Rasch模型中的卡方统计。
Pub Date : 1997-01-01
R E Schumacker, M E Lunz

The different chi-square statistics reported in the many-faceted Rasch model analysis are presented and interpreted. In addition, other chi-square summary values are computed and presented for interpretation of facets. The chi-square values are useful for determining: (1) the significance of a facet in the Rasch model; (2) the significant contribution of facet main and interaction effects; (3) differences among facet elements; and (4) identifying the specific facet interaction adjustments to the subjects' calibrated logit ability measure.

不同的卡方统计报告在多方面的Rasch模型分析提出和解释。此外,计算和呈现其他卡方汇总值以解释各方面。卡方值用于确定:(1)Rasch模型中一个面的重要性;(2)面主效应和交互效应的显著贡献;(3)面元之间的差异;(4)确定具体的面交互作用对被试校正后的logit能力测量的调整。
{"title":"Interpreting the chi-square statistics reported in the many-faceted Rasch model.","authors":"R E Schumacker,&nbsp;M E Lunz","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>The different chi-square statistics reported in the many-faceted Rasch model analysis are presented and interpreted. In addition, other chi-square summary values are computed and presented for interpretation of facets. The chi-square values are useful for determining: (1) the significance of a facet in the Rasch model; (2) the significant contribution of facet main and interaction effects; (3) differences among facet elements; and (4) identifying the specific facet interaction adjustments to the subjects' calibrated logit ability measure.</p>","PeriodicalId":79673,"journal":{"name":"Journal of outcome measurement","volume":"1 3","pages":"239-57"},"PeriodicalIF":0.0,"publicationDate":"1997-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"20579311","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Post-hoc Rasch analysis of optimal categorization of an ordered-response scale. 有序反应量表最优分类的事后Rasch分析。
Pub Date : 1997-01-01
W Zhu, W F Updyke, C Lewandowski

The purpose of this study was to determine the optimal categorization of a self-efficacy ordered-response scale using the Rasch analysis and compare the performance of the Rasch statistics and parameter estimates with conventional statistics. A 50-item scale to measure psychomotor self-efficacy was administered to a total of 2,022 children, including 1,009 boys and 1,013 girls. The data analysis started by collapsing the original five adjacent categories into two, three, and four categories, and a total of 14 data sets were derived. Each of these data sets, including the original one, was analyzed using the Rasch rating scale model, and a set of Rasch model-data fit, category, and separation statistics and parameter estimates, as well as three conventional statistics, were computed and compared. It was found that, instead of the five-category construct designed, the best order of category meanings of the scale in respondents' perceptions was a three-category construct. The Rasch threshold estimates were sensitive indexes in determining the order of the categorization, and that item separation statistics were useful in determining the optimal categorization after its order was confirmed. The commonly used coefficient alpha was found not helpful at all in determining the optimal categorization. The Rasch analysis was demonstrated to be a useful post-hoc analytic approach in determining the optimal categorization of an ordered-response scale.

本研究的目的是利用Rasch分析确定自我效能有序反应量表的最佳分类,并将Rasch统计量和参数估计与常规统计量的性能进行比较。研究人员对2,022名儿童(包括1,009名男孩和1,013名女孩)进行了一项50项的精神运动自我效能测试。数据分析首先将原来相邻的5个类别折叠为2、3、4个类别,共导出14个数据集。每个数据集,包括原始数据集,都使用Rasch评级量表模型进行分析,并计算和比较一组Rasch模型-数据拟合,类别和分离统计量和参数估计,以及三种常规统计量。研究发现,与设计的五类结构不同,量表中类别意义的最佳顺序是三类结构。Rasch阈值估计是确定分类顺序的敏感指标,项目分离统计量在确定分类顺序后确定最优分类是有用的。常用的alpha系数在确定最优分类时根本没有帮助。Rasch分析被证明是一种有用的事后分析方法,用于确定有序反应量表的最佳分类。
{"title":"Post-hoc Rasch analysis of optimal categorization of an ordered-response scale.","authors":"W Zhu,&nbsp;W F Updyke,&nbsp;C Lewandowski","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>The purpose of this study was to determine the optimal categorization of a self-efficacy ordered-response scale using the Rasch analysis and compare the performance of the Rasch statistics and parameter estimates with conventional statistics. A 50-item scale to measure psychomotor self-efficacy was administered to a total of 2,022 children, including 1,009 boys and 1,013 girls. The data analysis started by collapsing the original five adjacent categories into two, three, and four categories, and a total of 14 data sets were derived. Each of these data sets, including the original one, was analyzed using the Rasch rating scale model, and a set of Rasch model-data fit, category, and separation statistics and parameter estimates, as well as three conventional statistics, were computed and compared. It was found that, instead of the five-category construct designed, the best order of category meanings of the scale in respondents' perceptions was a three-category construct. The Rasch threshold estimates were sensitive indexes in determining the order of the categorization, and that item separation statistics were useful in determining the optimal categorization after its order was confirmed. The commonly used coefficient alpha was found not helpful at all in determining the optimal categorization. The Rasch analysis was demonstrated to be a useful post-hoc analytic approach in determining the optimal categorization of an ordered-response scale.</p>","PeriodicalId":79673,"journal":{"name":"Journal of outcome measurement","volume":"1 4","pages":"286-304"},"PeriodicalIF":0.0,"publicationDate":"1997-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"20579242","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of outcome measurement
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1