首页 > 最新文献

Journal of outcome measurement最新文献

英文 中文
Comparison of seven different scales used to quantify severity of cervical spondylotic myelopathy and post-operative improvement. 用于量化脊髓型颈椎病严重程度和术后改善的七种不同量表的比较。
Pub Date : 2001-01-01
A Singh, H A Crockard

Considerable uncertainty exists over the benefit that patients receive from surgical decompressive treatment for cervical spondylotic myelopathy (CSM). Such difficulties might be addressed by accurate quantification of CSM severity as part of a trial determining the outcome of surgery in different patient groups. This study compares the applicability of various existing quantitative severity scales to measurement of CSM severity and the effects on severity of surgical decompression. Scores on the following scales were determined on 100 patients with CSM preoperatively and then again six months following surgical decompression: Odom's Criteria, Nurick grade, Ranawat grade, Myelopathy Disability Index (MDI), Japanese Orthopaedic Association (JOA) Score, European Myelopathy Score (EMS) and Short Form-36 Health Survey (SF36). All the scales showed significant improvement following surgery. However, each had differing qualities of reliability, validity and responsiveness that made them more or less suitable. The MDI showed the greatest sensitivity between different severity levels, sensitivity to operative change and reliability. However, analysis of all the questionnaire scales into components that looked at different aspects of function revealed potential problems with redundancy and a lack of consistency. This prospective observational study provides a rational basis for determining the advantages and disadvantages of different existing scales in measurement of CSM severity and for making adaptations to develop a scale more specifically suited to a comprehensive surgical trial.

对于脊髓型颈椎病(CSM)患者接受手术减压治疗的获益,存在相当大的不确定性。这些困难可以通过精确量化CSM严重程度来解决,作为确定不同患者组手术结果的试验的一部分。本研究比较了现有各种定量严重程度量表在CSM严重程度测量中的适用性以及对手术减压严重程度的影响。对100例CSM患者进行术前和术后6个月的评分:Odom标准、Nurick评分、Ranawat评分、脊髓病残疾指数(MDI)、日本骨科协会(JOA)评分、欧洲脊髓病评分(EMS)和sf -36健康调查(SF36)。术后所有量表均有明显改善。然而,每一种都有不同的可靠性、有效性和响应性,这使它们或多或少地适合。MDI在不同严重程度、手术改变敏感性和可靠性之间表现出最大的敏感性。然而,对所有问卷量表的分析显示了潜在的冗余和缺乏一致性的问题。这项前瞻性观察性研究为确定现有不同量表在测量脊髓型颈椎病严重程度方面的优缺点,并为制定更适合综合外科试验的量表提供了合理的依据。
{"title":"Comparison of seven different scales used to quantify severity of cervical spondylotic myelopathy and post-operative improvement.","authors":"A Singh,&nbsp;H A Crockard","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Considerable uncertainty exists over the benefit that patients receive from surgical decompressive treatment for cervical spondylotic myelopathy (CSM). Such difficulties might be addressed by accurate quantification of CSM severity as part of a trial determining the outcome of surgery in different patient groups. This study compares the applicability of various existing quantitative severity scales to measurement of CSM severity and the effects on severity of surgical decompression. Scores on the following scales were determined on 100 patients with CSM preoperatively and then again six months following surgical decompression: Odom's Criteria, Nurick grade, Ranawat grade, Myelopathy Disability Index (MDI), Japanese Orthopaedic Association (JOA) Score, European Myelopathy Score (EMS) and Short Form-36 Health Survey (SF36). All the scales showed significant improvement following surgery. However, each had differing qualities of reliability, validity and responsiveness that made them more or less suitable. The MDI showed the greatest sensitivity between different severity levels, sensitivity to operative change and reliability. However, analysis of all the questionnaire scales into components that looked at different aspects of function revealed potential problems with redundancy and a lack of consistency. This prospective observational study provides a rational basis for determining the advantages and disadvantages of different existing scales in measurement of CSM severity and for making adaptations to develop a scale more specifically suited to a comprehensive surgical trial.</p>","PeriodicalId":79673,"journal":{"name":"Journal of outcome measurement","volume":"5 1","pages":"798-818"},"PeriodicalIF":0.0,"publicationDate":"2001-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"25718599","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Measuring disability: application of the Rasch model to activities of daily living (ADL/IADL). 测量残疾:Rasch模型在日常生活活动(ADL/IADL)中的应用。
Pub Date : 2001-01-01
T J Sheehan, L M DeChello, R Garcia, J Fifield, N Rothfield, S Reisine

This paper describes a comparative analysis of (ADL) and (IADL) items administered to two samples, 4,430 persons representative of older Americans, and 605 persons representative of patients with rheumatoid arthrisit (RA). Responses are scored separately using both Likert and Rasch measurement models. While Likert scoring seems to provide information similar to Rasch, the descriptive statistics are often contrary if not contradictory, and estimates of reliability from Likert are inflated. The test characteristic curves derived from Rasch are similar despite differences between the levels of disability with the two samples. Correlations of Rasch item calibrations across three samples were .71, .76, and .80. The fit between the items and the samples, indicating the compatibility between the test and subjects, is seen much more clearly with Rasch with more than half of the general population measuring the extremes. Since research on disability depends on measures with known properties, the superiority of Rasch over Likert is evident.

本文描述了(ADL)和(IADL)项目对两个样本的比较分析,4430名美国老年人代表和605名类风湿性关节炎(RA)患者代表。分别使用Likert和Rasch测量模型对回答进行评分。虽然李克特评分似乎提供了与Rasch相似的信息,但描述性统计数据往往是相反的,如果不是矛盾的话,而且李克特的可靠性估计被夸大了。尽管两个样本的残疾水平存在差异,但从Rasch得出的测试特征曲线是相似的。三个样品的拉希项目校准的相关系数分别为0.71、0.76和0.80。项目和样本之间的契合度,表明了测试和受试者之间的兼容性,在Rasch中表现得更为明显,超过一半的普通人群测量了极端情况。由于对残疾的研究依赖于具有已知性质的测量,因此Rasch优于Likert是显而易见的。
{"title":"Measuring disability: application of the Rasch model to activities of daily living (ADL/IADL).","authors":"T J Sheehan,&nbsp;L M DeChello,&nbsp;R Garcia,&nbsp;J Fifield,&nbsp;N Rothfield,&nbsp;S Reisine","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>This paper describes a comparative analysis of (ADL) and (IADL) items administered to two samples, 4,430 persons representative of older Americans, and 605 persons representative of patients with rheumatoid arthrisit (RA). Responses are scored separately using both Likert and Rasch measurement models. While Likert scoring seems to provide information similar to Rasch, the descriptive statistics are often contrary if not contradictory, and estimates of reliability from Likert are inflated. The test characteristic curves derived from Rasch are similar despite differences between the levels of disability with the two samples. Correlations of Rasch item calibrations across three samples were .71, .76, and .80. The fit between the items and the samples, indicating the compatibility between the test and subjects, is seen much more clearly with Rasch with more than half of the general population measuring the extremes. Since research on disability depends on measures with known properties, the superiority of Rasch over Likert is evident.</p>","PeriodicalId":79673,"journal":{"name":"Journal of outcome measurement","volume":"5 1","pages":"839-63"},"PeriodicalIF":0.0,"publicationDate":"2001-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"25718605","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The impact of rater effects on weighted composite scores under nested and spiraled scoring designs, using the multifaceted Rasch model. 在嵌套和螺旋评分设计下,使用多面Rasch模型,评分者效应对加权综合分数的影响。
Pub Date : 2001-01-01
H M Taherbhai, M J Young

Constructed-response or open-ended tasks are increasingly used in recent years. Since these tasks cannot be machine-scored, variability among raters cannot be completely eliminated and their effects, when they are not modeled, can cast doubts on the reliability of the results. Besides rater effects, the estimation of student ability can also be impacted by differentially weighted tasks/items that formulate composite scores. This simulation study compares student ability estimates with their true abilities under different rater scoring designs and differentially weighted composite scores. Results indicate that the spiraled rater scoring design without modeling rater effects works as well as the nested design in which rater tendencies are modeled. As expected, differentially weighted composite scores have a confounding effect on student ability estimates. This is particularly true when open-ended tasks are weighted much more than the multiple-choice items and when rater effects interact with weighted composite scores.

建构式回应或开放式任务近年来越来越多地被使用。由于这些任务不能用机器评分,评分者之间的可变性不能完全消除,而且当他们没有建模时,他们的影响可能会对结果的可靠性产生怀疑。除了评分效应,对学生能力的估计也会受到构成综合分数的不同加权任务/项目的影响。本模拟研究比较了学生在不同评分设计和差异加权综合评分下的能力估计与真实能力。结果表明,不模拟评分者效应的螺旋评分设计与模拟评分者倾向的嵌套评分设计一样有效。正如预期的那样,差异加权综合分数对学生能力估计有混淆效应。当开放式任务的权重远高于多项选择题,以及评分效应与加权综合分数相互作用时,这种情况尤其明显。
{"title":"The impact of rater effects on weighted composite scores under nested and spiraled scoring designs, using the multifaceted Rasch model.","authors":"H M Taherbhai,&nbsp;M J Young","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Constructed-response or open-ended tasks are increasingly used in recent years. Since these tasks cannot be machine-scored, variability among raters cannot be completely eliminated and their effects, when they are not modeled, can cast doubts on the reliability of the results. Besides rater effects, the estimation of student ability can also be impacted by differentially weighted tasks/items that formulate composite scores. This simulation study compares student ability estimates with their true abilities under different rater scoring designs and differentially weighted composite scores. Results indicate that the spiraled rater scoring design without modeling rater effects works as well as the nested design in which rater tendencies are modeled. As expected, differentially weighted composite scores have a confounding effect on student ability estimates. This is particularly true when open-ended tasks are weighted much more than the multiple-choice items and when rater effects interact with weighted composite scores.</p>","PeriodicalId":79673,"journal":{"name":"Journal of outcome measurement","volume":"5 1","pages":"819-38"},"PeriodicalIF":0.0,"publicationDate":"2001-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"25718602","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The impact of socio-cultural and clinical factors on health-related quality of life reports among Hispanic and African-American cancer patients. 社会文化和临床因素对西班牙裔和非裔癌症患者健康相关生活质量报告的影响
Pub Date : 1999-01-01
G J Wan, M A Counte, D F Cella, L Hernandez, D B McGuire, S Deasay, G Shiomoto, E A Hahn

A hierarchical multiple linear regression approach (N = 761) was used to identify pertinent factors which influence health-related quality of life (HRQL) reports among Hispanic and African-American cancer patients. The independent variables include: performance status, disease site, disease stage, mode of administration, socio-economic status (SES), gender, age, living arrangement, race/ethnicity, religious affiliation, insurance status, and spiritual beliefs. The outcome measures, five subscales of HRQL (physical well-being, social well-being, satisfaction with treatment, emotional well-being, functional well-being) and overall HRQL (sum of the five subscales), were estimated using the Functional Assessment of Cancer Therapy (FACT) Scales. This study identified performance status and spiritual beliefs as consistent predictors of overall HRQL. This study also found no significant effects of SES, mode of administration, gender age, living arrangement and insurance status on the reporting of overall HRQL. Spiritual beliefs and performance status are important determinants of HRQL across a diverse group of cancer patients.

采用分层多元线性回归方法(N = 761)确定影响西班牙裔和非裔癌症患者健康相关生活质量(HRQL)报告的相关因素。自变量包括:表现状况、疾病部位、疾病分期、管理方式、社会经济地位(SES)、性别、年龄、居住安排、种族/民族、宗教信仰、保险状况和精神信仰。结果测量,HRQL的五个子量表(身体健康、社会健康、治疗满意度、情绪健康、功能健康)和总体HRQL(五个子量表的总和),使用癌症治疗功能评估(FACT)量表进行估计。本研究确定了表现状态和精神信仰是整体HRQL的一致预测因子。本研究还发现社会经济地位、管理方式、性别、年龄、生活安排和保险状况对总体HRQL报告没有显著影响。在不同的癌症患者群体中,精神信仰和表现状态是HRQL的重要决定因素。
{"title":"The impact of socio-cultural and clinical factors on health-related quality of life reports among Hispanic and African-American cancer patients.","authors":"G J Wan,&nbsp;M A Counte,&nbsp;D F Cella,&nbsp;L Hernandez,&nbsp;D B McGuire,&nbsp;S Deasay,&nbsp;G Shiomoto,&nbsp;E A Hahn","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>A hierarchical multiple linear regression approach (N = 761) was used to identify pertinent factors which influence health-related quality of life (HRQL) reports among Hispanic and African-American cancer patients. The independent variables include: performance status, disease site, disease stage, mode of administration, socio-economic status (SES), gender, age, living arrangement, race/ethnicity, religious affiliation, insurance status, and spiritual beliefs. The outcome measures, five subscales of HRQL (physical well-being, social well-being, satisfaction with treatment, emotional well-being, functional well-being) and overall HRQL (sum of the five subscales), were estimated using the Functional Assessment of Cancer Therapy (FACT) Scales. This study identified performance status and spiritual beliefs as consistent predictors of overall HRQL. This study also found no significant effects of SES, mode of administration, gender age, living arrangement and insurance status on the reporting of overall HRQL. Spiritual beliefs and performance status are important determinants of HRQL across a diverse group of cancer patients.</p>","PeriodicalId":79673,"journal":{"name":"Journal of outcome measurement","volume":"3 3","pages":"200-15"},"PeriodicalIF":0.0,"publicationDate":"1999-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"21296439","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Measuring change across multiple occasions using the Rasch Rating Scale Model. 使用Rasch评定量表模型测量多个场合的变化。
Pub Date : 1999-01-01
E W Wolfe, C W Chiu

When latent trait models are used to measure change across time, it is difficult to disentangle changes in one facet of the measurement context from changes in other facets. Hence, it is difficult to diagnose change. Wright (1999b) proposed an algorithm for disentangling change, and previously the authors applied this algorithm to measuring change across two occasions (Wolfe and Chiu, 1999). In this article we extend Wright's algorithm to disentangle changes in measures across three occasions. We describe a standard Rasch rating scale analysis of a multi-occasion evaluation that produces confusing results when subjected to a series of "separate" calibrations. Then, we apply Wright's correction to the same data to show that the algorithm reveals changes that are more similar to ones that would be expected. Our demonstration shows that Wright's procedure can reduce misfit to the Rasch Rating Scale Model as well as changing the interpretation of change within the measurement context.

当使用潜在特质模型来测量跨时间的变化时,很难将测量上下文的一个方面的变化与其他方面的变化区分开来。因此,很难诊断变化。Wright (1999b)提出了一种解纠缠变化的算法,之前作者将该算法应用于两种情况下的变化测量(Wolfe和Chiu, 1999)。在本文中,我们将Wright的算法扩展到三种情况下的度量变化。我们描述了一个多场合评估的标准拉西评定量表分析,当受到一系列“单独”校准时,会产生令人困惑的结果。然后,我们将Wright的修正应用于相同的数据,以表明该算法揭示的变化更类似于预期的变化。我们的论证表明,赖特的程序可以减少对拉什评定量表模型的不拟合,并改变测量环境中对变化的解释。
{"title":"Measuring change across multiple occasions using the Rasch Rating Scale Model.","authors":"E W Wolfe,&nbsp;C W Chiu","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>When latent trait models are used to measure change across time, it is difficult to disentangle changes in one facet of the measurement context from changes in other facets. Hence, it is difficult to diagnose change. Wright (1999b) proposed an algorithm for disentangling change, and previously the authors applied this algorithm to measuring change across two occasions (Wolfe and Chiu, 1999). In this article we extend Wright's algorithm to disentangle changes in measures across three occasions. We describe a standard Rasch rating scale analysis of a multi-occasion evaluation that produces confusing results when subjected to a series of \"separate\" calibrations. Then, we apply Wright's correction to the same data to show that the algorithm reveals changes that are more similar to ones that would be expected. Our demonstration shows that Wright's procedure can reduce misfit to the Rasch Rating Scale Model as well as changing the interpretation of change within the measurement context.</p>","PeriodicalId":79673,"journal":{"name":"Journal of outcome measurement","volume":"3 4","pages":"360-81"},"PeriodicalIF":0.0,"publicationDate":"1999-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"21430088","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Alternate forms reliability of the assessment of motor and process skills. 替代形式的可靠性评估的电机和工艺技能。
Pub Date : 1999-01-01
K N Kirkley, A G Fisher

The purpose of this study is to examine the alternate forms reliability of the AMPS (Assessment of Motor and Process Skills) (Fisher, 1997a) where alternate forms means different pairs of AMPS tasks. The participants for this study were persons selected from the AMPS database who had performed four AMPS tasks. The participants varied in age, gender, diagnosis, and level of assistance needed to live in the community. The AMPS was administered by trained and calibrated occupational therapists according to standardized procedures. The data for the 91 participants were subjected to 12 many-faceted Rasch analyses to generate ADL motor and ADL process ability measures for each task and each set of paired tasks. Repeated measures ANOVAs revealed no time effect across the four AMPS tasks. Pearson product moment correlations between Tasks 1 and 2 combined and Tasks 3 and 4 combined were r = .91 and r = .86 for the ADL motor and ADL process scales, respectively. Calculation of the standardized difference (z) revealed that no more than 8% of the participants had ADL motor or ADL process ability measures that differed significantly between observations once we accounted for real differences in a persons performance; 80% of the paired ADL motor and ADL process ability measures remained stable within +/- 0.5 logits when the participants performed two tasks. The AMPS ADL motor and ADL process scales can be used reliably in clinical practice and for research purposes.

本研究的目的是检验AMPS(运动和过程技能评估)的替代形式的可靠性(Fisher, 1997a),其中替代形式意味着不同的AMPS任务对。本研究的参与者是从AMPS数据库中选出的,他们执行了四项AMPS任务。参与者的年龄、性别、诊断和在社区生活所需的援助水平各不相同。AMPS由经过培训和校准的职业治疗师根据标准化程序进行管理。91名参与者的数据进行了12次多方面的Rasch分析,以生成每个任务和每组配对任务的ADL运动和ADL处理能力测量。重复测量方差分析显示四个AMPS任务之间没有时间效应。任务1和任务2组合和任务3和任务4组合在ADL运动和ADL过程量表上的Pearson积矩相关性分别为r = 0.91和r = 0.86。标准化差异(z)的计算表明,一旦我们考虑到个人表现的实际差异,不超过8%的参与者的ADL运动或ADL处理能力在观察结果之间存在显著差异;当参与者同时执行两项任务时,80%的ADL运动和ADL处理能力配对测量保持稳定在+/- 0.5 logits范围内。AMPS ADL运动和ADL过程量表可以可靠地用于临床实践和研究目的。
{"title":"Alternate forms reliability of the assessment of motor and process skills.","authors":"K N Kirkley,&nbsp;A G Fisher","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>The purpose of this study is to examine the alternate forms reliability of the AMPS (Assessment of Motor and Process Skills) (Fisher, 1997a) where alternate forms means different pairs of AMPS tasks. The participants for this study were persons selected from the AMPS database who had performed four AMPS tasks. The participants varied in age, gender, diagnosis, and level of assistance needed to live in the community. The AMPS was administered by trained and calibrated occupational therapists according to standardized procedures. The data for the 91 participants were subjected to 12 many-faceted Rasch analyses to generate ADL motor and ADL process ability measures for each task and each set of paired tasks. Repeated measures ANOVAs revealed no time effect across the four AMPS tasks. Pearson product moment correlations between Tasks 1 and 2 combined and Tasks 3 and 4 combined were r = .91 and r = .86 for the ADL motor and ADL process scales, respectively. Calculation of the standardized difference (z) revealed that no more than 8% of the participants had ADL motor or ADL process ability measures that differed significantly between observations once we accounted for real differences in a persons performance; 80% of the paired ADL motor and ADL process ability measures remained stable within +/- 0.5 logits when the participants performed two tasks. The AMPS ADL motor and ADL process scales can be used reliably in clinical practice and for research purposes.</p>","PeriodicalId":79673,"journal":{"name":"Journal of outcome measurement","volume":"3 1","pages":"53-70"},"PeriodicalIF":0.0,"publicationDate":"1999-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"20936401","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Detecting differential item functioning with five standardized item-fit indices in the Rasch model. 利用Rasch模型中的五个标准化项目拟合指标检测差异项目功能。
Pub Date : 1999-01-01
H Seol

This study examined five Rasch-model-based item-fit indices: unweighted and weighted standardized indices (denoted UWz and Wz), standardized likelihood index (denoted Lz), and Extended Caution Indices (denoted ECI2z and ECI4z), in terms of their distributional properties and the power of detecting item bias or Differential Item Functioning (DIF). The results indicated that although these five standardized item-fit indices did not depart significantly from a normal distribution, it appeared that the Type I error rates were not reasonable. For the power of five standardized item-fit indices to detect DIF, the results showed that all indices did perform poorly across various conditions. These findings lead to the conclusion that all indices used in this study are inadequate fit measures for detecting DIF.

本研究检验了五个基于rasch模型的项目拟合指数:未加权和加权标准化指数(分别表示UWz和Wz),标准化似然指数(分别表示Lz)和扩展谨慎指数(分别表示ECI2z和ECI4z),根据它们的分布特性和检测项目偏差或差异项目功能(DIF)的能力。结果表明,虽然这5个标准化项目拟合指标没有明显偏离正态分布,但I型错误率似乎不合理。对于五个标准化项目拟合指标检测DIF的能力,结果表明,所有指标在各种条件下都表现不佳。这些发现导致本研究中使用的所有指标都不适合检测DIF。
{"title":"Detecting differential item functioning with five standardized item-fit indices in the Rasch model.","authors":"H Seol","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>This study examined five Rasch-model-based item-fit indices: unweighted and weighted standardized indices (denoted UWz and Wz), standardized likelihood index (denoted Lz), and Extended Caution Indices (denoted ECI2z and ECI4z), in terms of their distributional properties and the power of detecting item bias or Differential Item Functioning (DIF). The results indicated that although these five standardized item-fit indices did not depart significantly from a normal distribution, it appeared that the Type I error rates were not reasonable. For the power of five standardized item-fit indices to detect DIF, the results showed that all indices did perform poorly across various conditions. These findings lead to the conclusion that all indices used in this study are inadequate fit measures for detecting DIF.</p>","PeriodicalId":79673,"journal":{"name":"Journal of outcome measurement","volume":"3 3","pages":"233-47"},"PeriodicalIF":0.0,"publicationDate":"1999-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"21296441","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The flow experience: a Rasch analysis of Jackson's Flow State Scale. 心流体验:Jackson心流状态量表的Rasch分析。
Pub Date : 1999-01-01
G Tenenbaum, G J Fogarty, S A Jackson

Jackson and Marsh (1996) reported the development of a Flow State Scale (FSS) for use in sport and physical activity. The FSS contains 36 items measuring the nine dimensions of flow described by Csikszentmihalyi (1990, 1993). Jackson and Marsh reported high internal consistency estimates for the subscales and evidence for nine first-order factors and one second-order factor when confirmatory factor analytic techniques were used. The present study extended this validation work by subjecting the data from the original sample (N = 394) of elite younger athletes and a subsequent sample (N = 398) of older athletes to Rasch analysis. These Rasch analyses showed quite clearly that the flow dimensions may be conceptualised as a continuum with "autotelic experience" being experienced more readily than dimensions such as "transformation of time", a state that may only be encountered at the height of a deep flow experience. The Rasch analyses provide useful additional information about the areas of the flow continuum tapped by the items and scales of the FSS and, in so doing, help to confirm the construct validity and generalisability of the scale itself.

Jackson和Marsh(1996)报道了用于运动和身体活动的心流状态量表(FSS)的发展。FSS包含36个项目,测量Csikszentmihalyi(1999,1993)描述的9个维度的流动。Jackson和Marsh报告了使用验证性因子分析技术时,对子量表和九个一阶因子和一个二阶因子的高内部一致性估计和证据。本研究通过对原始样本(N = 394)的优秀年轻运动员和随后样本(N = 398)的老年运动员的数据进行Rasch分析,扩展了这一验证工作。这些Rasch分析非常清楚地表明,流维度可以被概念化为一个连续体,与“时间转换”这样的维度相比,“自成一体的体验”更容易被体验,这种状态可能只会在深度流体验的高度遇到。Rasch分析为FSS的项目和量表所挖掘的流动连续区域提供了有用的额外信息,这样做有助于确认量表本身的结构有效性和普遍性。
{"title":"The flow experience: a Rasch analysis of Jackson's Flow State Scale.","authors":"G Tenenbaum,&nbsp;G J Fogarty,&nbsp;S A Jackson","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Jackson and Marsh (1996) reported the development of a Flow State Scale (FSS) for use in sport and physical activity. The FSS contains 36 items measuring the nine dimensions of flow described by Csikszentmihalyi (1990, 1993). Jackson and Marsh reported high internal consistency estimates for the subscales and evidence for nine first-order factors and one second-order factor when confirmatory factor analytic techniques were used. The present study extended this validation work by subjecting the data from the original sample (N = 394) of elite younger athletes and a subsequent sample (N = 398) of older athletes to Rasch analysis. These Rasch analyses showed quite clearly that the flow dimensions may be conceptualised as a continuum with \"autotelic experience\" being experienced more readily than dimensions such as \"transformation of time\", a state that may only be encountered at the height of a deep flow experience. The Rasch analyses provide useful additional information about the areas of the flow continuum tapped by the items and scales of the FSS and, in so doing, help to confirm the construct validity and generalisability of the scale itself.</p>","PeriodicalId":79673,"journal":{"name":"Journal of outcome measurement","volume":"3 3","pages":"278-94"},"PeriodicalIF":0.0,"publicationDate":"1999-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"21296444","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Using IRT variable maps to enrich understanding of rehabilitation data. 利用IRT变量图丰富对康复数据的理解。
Pub Date : 1999-01-01
W Coster, L Ludlow, M Mancini

One of the benefits of item response theory (IRT) applications in scale development is the greater transparency of resulting scores. This feature allows translation of a total score on a particular scale into a profile of probable item responses with relative ease, for example by using the variable map that often is part of IRT analysis output. Although there have been a few examples in the literature using variable maps to enrich clinical interpretation of individual functional assessment scores, this feature of IRT output has received very limited application in rehabilitation research. The present paper illustrates the application of variable maps to support more in-depth interpretation of functional assessment scores in research and clinical contexts. Two examples are presented from an outcome prediction study conducted during the standardization of a new functional assessment for elementary school students with disabilities, the School Function Assessment. Two different applications are described: creating a dichotomous outcome variable using scores from a continuous scale, and interpreting the meaning of a classification cut-off score identified through Classification and Regression Tree (CART) analysis.

项目反应理论(IRT)在量表开发中的应用的好处之一是结果分数的更大的透明度。该功能允许将特定尺度上的总分相对容易地转换为可能项目反应的概况,例如通过使用通常是IRT分析输出的一部分的变量图。虽然文献中有一些使用变量图来丰富个人功能评估分数的临床解释的例子,但IRT输出的这一特征在康复研究中的应用非常有限。本文阐述了变量图的应用,以支持在研究和临床环境中更深入地解释功能评估分数。本文以一项针对小学残障学生的新功能评估“学校功能评估”的标准化研究为例,提出两项结果预测研究。描述了两种不同的应用:使用连续量表的分数创建二分类结果变量,以及解释通过分类和回归树(CART)分析确定的分类截止分数的含义。
{"title":"Using IRT variable maps to enrich understanding of rehabilitation data.","authors":"W Coster,&nbsp;L Ludlow,&nbsp;M Mancini","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>One of the benefits of item response theory (IRT) applications in scale development is the greater transparency of resulting scores. This feature allows translation of a total score on a particular scale into a profile of probable item responses with relative ease, for example by using the variable map that often is part of IRT analysis output. Although there have been a few examples in the literature using variable maps to enrich clinical interpretation of individual functional assessment scores, this feature of IRT output has received very limited application in rehabilitation research. The present paper illustrates the application of variable maps to support more in-depth interpretation of functional assessment scores in research and clinical contexts. Two examples are presented from an outcome prediction study conducted during the standardization of a new functional assessment for elementary school students with disabilities, the School Function Assessment. Two different applications are described: creating a dichotomous outcome variable using scores from a continuous scale, and interpreting the meaning of a classification cut-off score identified through Classification and Regression Tree (CART) analysis.</p>","PeriodicalId":79673,"journal":{"name":"Journal of outcome measurement","volume":"3 2","pages":"123-33"},"PeriodicalIF":0.0,"publicationDate":"1999-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"21075425","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Measuring pretest-posttest change with a Rasch Rating Scale Model. 用Rasch评定量表模型测量前测后测变化。
Pub Date : 1999-01-01
E W Wolfe, C W Chiu

When measures are taken on the same individual over time, it is difficult to determine whether observed differences are the result of changes in the person or changes in other facets of the measurement situation (e.g., interpretation of items or use of rating scale). This paper describes a method for disentangling changes in persons from changes in the interpretation of Likert-type questionnaire items and the use of rating scales (Wright, 1996a). The procedure relies on anchoring strategies to create a common frame of reference for interpreting measures that are taken at different times and provides a detailed illustration of how to implement these procedures using FACETS.

当对同一个人长期采取测量时,很难确定观察到的差异是由于个人的变化还是由于测量情况的其他方面的变化(例如,对项目的解释或评定量表的使用)造成的。本文描述了一种将人的变化与李克特型问卷项目的解释变化和评定量表的使用区分开来的方法(Wright, 1996a)。该程序依赖于锚定策略来创建一个通用的参考框架,用于解释在不同时间采取的措施,并提供了如何使用facet实施这些程序的详细说明。
{"title":"Measuring pretest-posttest change with a Rasch Rating Scale Model.","authors":"E W Wolfe,&nbsp;C W Chiu","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>When measures are taken on the same individual over time, it is difficult to determine whether observed differences are the result of changes in the person or changes in other facets of the measurement situation (e.g., interpretation of items or use of rating scale). This paper describes a method for disentangling changes in persons from changes in the interpretation of Likert-type questionnaire items and the use of rating scales (Wright, 1996a). The procedure relies on anchoring strategies to create a common frame of reference for interpreting measures that are taken at different times and provides a detailed illustration of how to implement these procedures using FACETS.</p>","PeriodicalId":79673,"journal":{"name":"Journal of outcome measurement","volume":"3 2","pages":"134-61"},"PeriodicalIF":0.0,"publicationDate":"1999-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"21075427","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of outcome measurement
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1