首页 > 最新文献

Journal of outcome measurement最新文献

英文 中文
Generalizability theory: a unified approach to assessing the dependability (reliability) of measurements in the health sciences. 概括性理论:评估健康科学测量的可靠性(可靠性)的统一方法。
Pub Date : 1998-01-01
D M VanLeeuwen, M D Barnes, M Pase

The reliability of health promotion program evaluation measures, behavioral and attitudinal measures, and clinical measures is a concern to many health educators. Classical reliability coefficients, such as Cronbach's alpha, apply to narrowly defined, prespecified measurement situations. Classical theory does not provide adequate reliability assessments for criterion-referenced measures, for measurement situations having multiple sources of error, or for aggregate-level variables. Generalizability theory can be used to assess the reliability of measures in these situations that are not adequately modeled by Classical theory. Additionally, Generalizability theory affords a broader view and a deeper understanding of the dependability of measurements and the role of different sources of error in the variability of measures.

健康促进计划评估措施、行为和态度措施以及临床措施的可靠性是许多健康教育者关注的问题。经典的信度系数,如Cronbach’s alpha,适用于狭义的、预先规定的测量情况。经典理论没有提供足够的可靠性评估的标准参考措施,测量情况有多个误差来源,或为总水平的变量。泛化理论可以用来评估这些情况下的措施的可靠性,没有充分建模的经典理论。此外,概括性理论对测量的可靠性和不同误差来源在测量可变性中的作用提供了更广泛的观点和更深入的理解。
{"title":"Generalizability theory: a unified approach to assessing the dependability (reliability) of measurements in the health sciences.","authors":"D M VanLeeuwen,&nbsp;M D Barnes,&nbsp;M Pase","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>The reliability of health promotion program evaluation measures, behavioral and attitudinal measures, and clinical measures is a concern to many health educators. Classical reliability coefficients, such as Cronbach's alpha, apply to narrowly defined, prespecified measurement situations. Classical theory does not provide adequate reliability assessments for criterion-referenced measures, for measurement situations having multiple sources of error, or for aggregate-level variables. Generalizability theory can be used to assess the reliability of measures in these situations that are not adequately modeled by Classical theory. Additionally, Generalizability theory affords a broader view and a deeper understanding of the dependability of measurements and the role of different sources of error in the variability of measures.</p>","PeriodicalId":79673,"journal":{"name":"Journal of outcome measurement","volume":"2 4","pages":"302-25"},"PeriodicalIF":0.0,"publicationDate":"1998-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"20715777","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Team assessment utilizing a many-facet Rasch model. 利用多面Rasch模型的团队评估。
Pub Date : 1998-01-01
J M Allen, R E Schumacker

As organizations begin to implement work teams, their assessment will ultimately reflect compensation strategies that move away from individual assessment. This will involve not only using multiple raters, but also the use of multiple criteria. Team assessment using multiple raters and multiple criteria is therefore necessitated; however, this can produce differences in ratings due to the leniency or severity of the individual team raters. This study analyzed the ratings of individual members on 31 different teams across 12 different criteria of team performance. Utilizing the many-facet Rasch model, statistical differences between the teams and 12 criteria were calculated.

当组织开始实现工作团队时,他们的评估将最终反映出远离个人评估的补偿策略。这不仅涉及使用多个评级机构,还涉及使用多个标准。因此,需要使用多个评分者和多个标准进行团队评估;然而,这可能会由于个别团队评分者的宽容或严厉而产生评分差异。这项研究分析了31个不同团队的成员在12种不同的团队绩效标准下的评分。利用多面Rasch模型,计算了团队和12个标准之间的统计差异。
{"title":"Team assessment utilizing a many-facet Rasch model.","authors":"J M Allen,&nbsp;R E Schumacker","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>As organizations begin to implement work teams, their assessment will ultimately reflect compensation strategies that move away from individual assessment. This will involve not only using multiple raters, but also the use of multiple criteria. Team assessment using multiple raters and multiple criteria is therefore necessitated; however, this can produce differences in ratings due to the leniency or severity of the individual team raters. This study analyzed the ratings of individual members on 31 different teams across 12 different criteria of team performance. Utilizing the many-facet Rasch model, statistical differences between the teams and 12 criteria were calculated.</p>","PeriodicalId":79673,"journal":{"name":"Journal of outcome measurement","volume":"2 2","pages":"142-58"},"PeriodicalIF":0.0,"publicationDate":"1998-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"20580455","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Rasch measurement for reducing the items of the Nottingham Health Profile. 减少诺丁汉健康概况项目的Rasch测量。
Pub Date : 1998-01-01
L Prieto, J Alonso, R Lamarca, B D Wright

The present study aimed to develop a short form of the Spanish version of the Nottingham Health Profile (NHP) by means of Rasch analysis. Data from several Spanish studies that included the NHP since 1987 were collected in a common database. Forty-five different studies were included, covering a total of 9,419 subjects both from the general population and with different clinical pathologies. The overall questionnaire (38 items) was simultaneously analyzed using the dichotomous response model. Parameter estimates, model-data fit and separation statistics were computed. The items of the NHP were additionally regrouped into two different scales: Physical (19 items) and Psychological (19 items). Separated Physical and Psychological parameter estimates were produced using the simultaneous item calibrations as anchor values. Misfitting items were deleted, resulting in a 22 item final short form (NHP22)-11 Physical and 11 Psychological-. The evaluation of the item hierarchies confirmed the construct validity of the new questionnaire. To demonstrate the invariance of the NHP22 item calibrations, Rasch analyses were performed separately for each study included in the sample and for several sociodemographic and health status variables. Results confirmed the validity of using the NHP22 item calibrations to measure different groups of people categorized by gender, clinical and health status.

本研究旨在开发一种简短形式的西班牙语版的诺丁汉健康概况(NHP)通过拉赫分析的手段。自1987年以来,包括NHP在内的几项西班牙研究的数据被收集在一个共同的数据库中。纳入了45项不同的研究,涵盖了9419名来自普通人群和具有不同临床病理的受试者。问卷共38个题项,采用二分法同时分析。计算参数估计、模型数据拟合和分离统计。此外,NHP的项目被重新组合成两个不同的量表:生理(19项)和心理(19项)。分离的生理和心理参数估计是使用同时项目校准作为锚值产生的。不匹配的项目被删除,最终的简短表格(NHP22)包含22个项目-11 Physical和11 Psychological。项目层次的评估证实了新问卷的结构效度。为了证明NHP22项目校准的不变性,对样本中的每个研究以及几个社会人口和健康状况变量分别进行了Rasch分析。结果证实了使用NHP22项目校准来测量按性别、临床和健康状况分类的不同人群的有效性。
{"title":"Rasch measurement for reducing the items of the Nottingham Health Profile.","authors":"L Prieto,&nbsp;J Alonso,&nbsp;R Lamarca,&nbsp;B D Wright","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>The present study aimed to develop a short form of the Spanish version of the Nottingham Health Profile (NHP) by means of Rasch analysis. Data from several Spanish studies that included the NHP since 1987 were collected in a common database. Forty-five different studies were included, covering a total of 9,419 subjects both from the general population and with different clinical pathologies. The overall questionnaire (38 items) was simultaneously analyzed using the dichotomous response model. Parameter estimates, model-data fit and separation statistics were computed. The items of the NHP were additionally regrouped into two different scales: Physical (19 items) and Psychological (19 items). Separated Physical and Psychological parameter estimates were produced using the simultaneous item calibrations as anchor values. Misfitting items were deleted, resulting in a 22 item final short form (NHP22)-11 Physical and 11 Psychological-. The evaluation of the item hierarchies confirmed the construct validity of the new questionnaire. To demonstrate the invariance of the NHP22 item calibrations, Rasch analyses were performed separately for each study included in the sample and for several sociodemographic and health status variables. Results confirmed the validity of using the NHP22 item calibrations to measure different groups of people categorized by gender, clinical and health status.</p>","PeriodicalId":79673,"journal":{"name":"Journal of outcome measurement","volume":"2 4","pages":"285-301"},"PeriodicalIF":0.0,"publicationDate":"1998-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"20715774","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Job Responsibilities Scale: invariance in a longitudinal prospective study. 工作责任量表:纵向前瞻性研究中的不变性。
Pub Date : 1998-01-01
L H Ludlow, M E Lunz

The purpose of the present analysis was to determine the degree of invariance of the Job Responsibilities Scale from 1993 to 1995. Factor analyses were conducted on both year's data and nearly identical factor patterns were found. Rasch rating scale analyses were conducted and nearly identical pairs of item estimates were found. These results suggest that even though the overall frequency of performance on some medical technology laboratory tasks increased from 1993 to 1995, the relationships among the tasks themselves remained the same (invariant). This conclusion allows for a description of what it means to increase in level of personal job responsibility from year-to-year. In addition, these results suggest that at the conclusion of this prospective study it may be possible to objectively define the typical career mobility pattern of entry level medical technologists.

本分析的目的是确定1993年至1995年工作职责表的不变程度。对这两年的数据进行了因子分析,发现了几乎相同的因子模式。对Rasch量表进行了分析,发现了几乎相同的项目估计对。这些结果表明,尽管从1993年到1995年,一些医疗技术实验室任务的总体表现频率有所增加,但任务本身之间的关系保持不变(不变)。这一结论允许对个人工作责任水平逐年增加的含义进行描述。此外,这些结果表明,在本前瞻性研究的结论中,有可能客观地界定初级医疗技术人员的典型职业流动模式。
{"title":"The Job Responsibilities Scale: invariance in a longitudinal prospective study.","authors":"L H Ludlow,&nbsp;M E Lunz","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>The purpose of the present analysis was to determine the degree of invariance of the Job Responsibilities Scale from 1993 to 1995. Factor analyses were conducted on both year's data and nearly identical factor patterns were found. Rasch rating scale analyses were conducted and nearly identical pairs of item estimates were found. These results suggest that even though the overall frequency of performance on some medical technology laboratory tasks increased from 1993 to 1995, the relationships among the tasks themselves remained the same (invariant). This conclusion allows for a description of what it means to increase in level of personal job responsibility from year-to-year. In addition, these results suggest that at the conclusion of this prospective study it may be possible to objectively define the typical career mobility pattern of entry level medical technologists.</p>","PeriodicalId":79673,"journal":{"name":"Journal of outcome measurement","volume":"2 4","pages":"326-37"},"PeriodicalIF":0.0,"publicationDate":"1998-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"20715778","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Identifying measurement disturbance effects using Rasch item fit statistics and the Logit Residual Index. 利用拉希项目拟合统计和罗吉特残差指数识别测量扰动效应。
Pub Date : 1998-01-01
R E Mount, R E Schumacker

A Monte Carlo study was conducted using simulated dichotomous data to determine the effects of guessing on Rasch item fit statistics (weighted total, unweighted total, and unweighted between fit statistics) and the Logit Residual Index (LRI). The data were simulated using 100 items, 100 persons, three levels of guessing (0%, 25%, and 50%), and two item difficulty distributions (normal and uniform). The results of the study indicated that no significant differences were found between the mean Rasch item fit statistics for each distribution type as the probability of guessing the correct answer increased. The mean item scores differed significantly with uniformly distributed item difficulties, but not normally distributed item difficulties. The LRI was more sensitive to large positive item misfit values associated with the unweighted total fit statistic than to similar values associated with the weighted total fit or unweighted between fit statistics. The greatest magnitude of change in LRI values (negative) was observed when the unweighted total fit statistic had large positive values greater than 2.4. The LRI statistic was most useful in identifying the linear trend in the residuals for each item, thereby indicating differences in ability groups, i.e. differential item functioning.

采用蒙特卡罗方法,利用模拟二分类数据确定猜测对Rasch项目拟合统计量(加权总数、未加权总数和拟合统计量之间的未加权总数)和Logit残差指数(LRI)的影响。数据模拟使用100个项目,100人,三个猜测水平(0%,25%和50%),以及两个项目难度分布(正态和均匀)。研究结果表明,随着猜测正确答案的概率增加,每种分布类型的平均Rasch项目拟合统计量之间没有显着差异。平均项目得分与均匀分布的项目难度差异显著,而与正态分布的项目难度差异不显著。LRI对与未加权总拟合统计相关的较大的正项目不拟合值比与加权总拟合或未加权间拟合统计相关的相似值更敏感。当未加权总拟合统计量大于2.4时,LRI值的变化幅度最大(负)。LRI统计在确定每个项目的残差的线性趋势方面最有用,从而表明能力组的差异,即差异项目功能。
{"title":"Identifying measurement disturbance effects using Rasch item fit statistics and the Logit Residual Index.","authors":"R E Mount,&nbsp;R E Schumacker","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>A Monte Carlo study was conducted using simulated dichotomous data to determine the effects of guessing on Rasch item fit statistics (weighted total, unweighted total, and unweighted between fit statistics) and the Logit Residual Index (LRI). The data were simulated using 100 items, 100 persons, three levels of guessing (0%, 25%, and 50%), and two item difficulty distributions (normal and uniform). The results of the study indicated that no significant differences were found between the mean Rasch item fit statistics for each distribution type as the probability of guessing the correct answer increased. The mean item scores differed significantly with uniformly distributed item difficulties, but not normally distributed item difficulties. The LRI was more sensitive to large positive item misfit values associated with the unweighted total fit statistic than to similar values associated with the weighted total fit or unweighted between fit statistics. The greatest magnitude of change in LRI values (negative) was observed when the unweighted total fit statistic had large positive values greater than 2.4. The LRI statistic was most useful in identifying the linear trend in the residuals for each item, thereby indicating differences in ability groups, i.e. differential item functioning.</p>","PeriodicalId":79673,"journal":{"name":"Journal of outcome measurement","volume":"2 4","pages":"338-50"},"PeriodicalIF":0.0,"publicationDate":"1998-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"20715779","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The functional assessment measure (FAM) in closed traumatic brain injury outpatients: a Rasch-based psychometric study. 闭合性外伤性脑损伤门诊患者功能评估量表(FAM):一项基于rasch的心理测量学研究。
Pub Date : 1998-01-01
L Tesio, A Cantagallo

The Functional Assessment Measure (FAM) has been proposed as a measure of disability in post-acute Traumatic Brain Injury (TBI) outpatients. It is comprised of the 18 items of The Functional Independence Measure (FIMSM), scored in terms of dependence, and of 12 newly designed items, scored in terms of dependence (7 items) or performance (5 items). The FIMSM covers the domains of self-care, sphincter management, mobility, locomotion, communication and social cognition. The 12 new items explore the domains of community integration, emotional status, orientation, attention, reading/writing skills, swallowing and speech intelligibility. By addressing a set of problems quite specific for TBI outpatients the FAM was intended to raise the ceiling of the FIMSM and to allow a more precise estimate of their disability. These claims, however, were never supported in previous studies. We administered the FAM to 60 TBI outpatient, 2-88 months (median 16) from trauma. Rasch analysis (rating scale model) was adopted to test the psychometric properties of the scale. The FAM was reliable (Rasch item and person reliability 0.91 and 0.93, respectively). Two of the 12 FAM-specific items were severely misfitting with the general construct, and were deleted. Within the 28-item refined FAM scale, 4 new items and 2 FIMSM items still retained signs of misfit. The FAM was on average too easy. The most difficult item (a new one, Employability) did not attain the average ability of the subjects. Also, it was only slightly more difficult than than the most difficult FIMSM item (Memory). The FAM does not seem to improve the FIMSM as a far as TBI outpatients are to be assessed.

功能评估量表(FAM)已被提议作为急性创伤性脑损伤(TBI)后门诊患者残疾的衡量标准。它由18个功能独立性量表(FIMSM)的依赖得分项目和12个新设计的依赖得分项目(7个项目)或性能得分项目(5个项目)组成。FIMSM涵盖了自我护理、括约肌管理、移动、运动、沟通和社会认知等领域。这12个新项目探讨了社区整合、情感状态、取向、注意力、阅读/写作技能、吞咽和言语可理解性等领域。通过解决TBI门诊患者的一系列非常具体的问题,FAM旨在提高FIMSM的上限,并允许对其残疾进行更精确的估计。然而,这些说法在以前的研究中从未得到支持。我们对60名创伤后2-88个月(中位16个月)的TBI门诊患者进行FAM治疗。采用Rasch分析(评定量表模型)对量表的心理测量特性进行检验。FAM是可靠的(Rasch项目和个人信度分别为0.91和0.93)。12个fam特异性项目中有2个与一般结构严重不匹配,被删除。在改良的28个FAM量表中,4个新条目和2个FIMSM条目仍然保留了不匹配的迹象。FAM平均来说太简单了。最难的项目(新设的“就业能力”)没有达到受试者的平均水平。而且,它只比最难的FIMSM项目(记忆)稍微难一点。FAM似乎并没有改善FIMSM,就TBI门诊患者进行评估而言。
{"title":"The functional assessment measure (FAM) in closed traumatic brain injury outpatients: a Rasch-based psychometric study.","authors":"L Tesio,&nbsp;A Cantagallo","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>The Functional Assessment Measure (FAM) has been proposed as a measure of disability in post-acute Traumatic Brain Injury (TBI) outpatients. It is comprised of the 18 items of The Functional Independence Measure (FIMSM), scored in terms of dependence, and of 12 newly designed items, scored in terms of dependence (7 items) or performance (5 items). The FIMSM covers the domains of self-care, sphincter management, mobility, locomotion, communication and social cognition. The 12 new items explore the domains of community integration, emotional status, orientation, attention, reading/writing skills, swallowing and speech intelligibility. By addressing a set of problems quite specific for TBI outpatients the FAM was intended to raise the ceiling of the FIMSM and to allow a more precise estimate of their disability. These claims, however, were never supported in previous studies. We administered the FAM to 60 TBI outpatient, 2-88 months (median 16) from trauma. Rasch analysis (rating scale model) was adopted to test the psychometric properties of the scale. The FAM was reliable (Rasch item and person reliability 0.91 and 0.93, respectively). Two of the 12 FAM-specific items were severely misfitting with the general construct, and were deleted. Within the 28-item refined FAM scale, 4 new items and 2 FIMSM items still retained signs of misfit. The FAM was on average too easy. The most difficult item (a new one, Employability) did not attain the average ability of the subjects. Also, it was only slightly more difficult than than the most difficult FIMSM item (Memory). The FAM does not seem to improve the FIMSM as a far as TBI outpatients are to be assessed.</p>","PeriodicalId":79673,"journal":{"name":"Journal of outcome measurement","volume":"2 2","pages":"79-96"},"PeriodicalIF":0.0,"publicationDate":"1998-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"20580452","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A research program for accountable and patient-centered health outcome measures. 负责任和以患者为中心的健康结果测量的研究项目。
Pub Date : 1998-01-01
W P Fisher

This article addresses the relevance of probabilistic conjoint (Rasch) measurement to five issues of accountability and patient-centeredness in health care. Goals for research, data quality standards, and standard metrics are proposed. The article is intended to begin to address concerns voiced by health care researchers, policy analysts, and the public about ways in which health care outcome measures can be improved.

本文讨论了概率联合(Rasch)测量与医疗保健中问责制和以患者为中心的五个问题的相关性。提出了研究目标、数据质量标准和标准度量。这篇文章旨在开始解决卫生保健研究人员、政策分析人员和公众对卫生保健结果措施可以改进的方式所表达的关切。
{"title":"A research program for accountable and patient-centered health outcome measures.","authors":"W P Fisher","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>This article addresses the relevance of probabilistic conjoint (Rasch) measurement to five issues of accountability and patient-centeredness in health care. Goals for research, data quality standards, and standard metrics are proposed. The article is intended to begin to address concerns voiced by health care researchers, policy analysts, and the public about ways in which health care outcome measures can be improved.</p>","PeriodicalId":79673,"journal":{"name":"Journal of outcome measurement","volume":"2 3","pages":"222-39"},"PeriodicalIF":0.0,"publicationDate":"1998-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"20627035","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The effect of item pool restriction on the precision of ability measurement for a Rasch-based CAT: comparisons to traditional fixed length examinations. 题目池限制对基于rasch的CAT能力测量精度的影响:与传统固定长度考试的比较。
Pub Date : 1998-01-01
P N Halkitis

This paper describes a method for examining the precision of a computerized adaptive test with a limited item pool. Standard errors of measurement ascertained in the testing of simulees with a CAT using a restricted pool were compared to the results obtained in a live paper-and-pencil achievement testing of 4494 nursing students on four versions of an examination of calculations of drug administration. CAT measures of precision were considered when the simulated examine pools were uniform and normal. Precision indices were also considered in terms of the number of CAT items required to reach the precision of the traditional tests. Results suggest that regardless of the size of the item pool, CAT provides greater precision in measurement with a smaller number of items administered even when the choice of items is limited but fails to achieve equiprecision along the entire ability continuum.

本文描述了一种用有限题库检验计算机化自适应测试精度的方法。用有限池的CAT对模拟人进行测试,确定了测量的标准误差,并将其与4494名护理学生在四种版本的药物给药计算考试中纸笔成绩测试的结果进行了比较。当模拟检验池均匀且正态时,考虑CAT的精度度量。根据达到传统测试精度所需的CAT项目数量,还考虑了精度指标。结果表明,无论项目池的大小,即使在项目选择有限的情况下,CAT也能以较少的项目数量提供更高的测量精度,但不能在整个能力连续体上实现等精度。
{"title":"The effect of item pool restriction on the precision of ability measurement for a Rasch-based CAT: comparisons to traditional fixed length examinations.","authors":"P N Halkitis","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>This paper describes a method for examining the precision of a computerized adaptive test with a limited item pool. Standard errors of measurement ascertained in the testing of simulees with a CAT using a restricted pool were compared to the results obtained in a live paper-and-pencil achievement testing of 4494 nursing students on four versions of an examination of calculations of drug administration. CAT measures of precision were considered when the simulated examine pools were uniform and normal. Precision indices were also considered in terms of the number of CAT items required to reach the precision of the traditional tests. Results suggest that regardless of the size of the item pool, CAT provides greater precision in measurement with a smaller number of items administered even when the choice of items is limited but fails to achieve equiprecision along the entire ability continuum.</p>","PeriodicalId":79673,"journal":{"name":"Journal of outcome measurement","volume":"2 2","pages":"97-122"},"PeriodicalIF":0.0,"publicationDate":"1998-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"20580453","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Analyzing nonadditive conjoint structures: compounding events by Rasch model probabilities. 分析非加性联合结构:用Rasch模型概率复合事件。
Pub Date : 1998-01-01
G Karabatsos

The following study proposes a Rasch method to measure variables of nonadditive conjoint structures, where dichotomous response combinations are evaluated. In this framework, both the number of endorsed items and their latent positions are considered. This is different from the cumulative response process (measurable by the Rasch model), where the probability of a positive response to an item with measure delta iota is considered a monotonic increasing function of the person's measure beta nu. This is also unlike the unfolding framework, where the probability of a positive response is maximum when beta nu = delta iota, and monotonically decreases as magnitude of beta nu-delta iota approaches infinity. The method involves four steps. In Step 1, items are scaled by the Rasch model for paired comparisons to produce a variable definition. These scale values serve as a basis for Steps 2 and 4. In Step 2, the nonadditive conjoint system is restructured to additive. The quantitative hypothesis of the restructured data is tested by the axioms of conjoint measurement theory in Step 3. This data is then analyzed by the Rasch rating scale model in Step 4 to evaluate individual response combinations, using the Step 1 item calibrations as anchors. The method was applied to simulated person responses of the Schedule of Recent Events (Holmes and Rahe, 1967). The results suggest that the method is useful and effective. It scales items with a robust method of paired comparisons, ensures additivity and quantification of the conjoint person-item matrix, produces a reasonable ordering of person measures from the perspective of individual response combinations, and provides satisfactory person and item separation (i.e., reliability). Furthermore, the restructured data reproduces SRE item scale values obtained by paired comparisons in Step 1.

下面的研究提出了一种Rasch方法来测量非可加性连接结构的变量,其中二元响应组合进行了评估。在这一框架内,核可的项目数目及其潜在的立场都要加以考虑。这与累积反应过程(通过Rasch模型可测量)不同,在累积反应过程中,对测量值为delta iota的项目做出积极反应的概率被认为是一个单调的递增函数。这也不同于展开的框架,在展开的框架中,当β nu = δ iota时,正响应的概率是最大的,并且随着β nu- δ iota的大小接近无穷大而单调减少。该方法包括四个步骤。在步骤1中,通过Rasch模型对项目进行缩放,以进行配对比较,从而产生变量定义。这些比例值作为步骤2和步骤4的基础。第二步,将非加性联结系统重构为加性联结系统。在步骤3中,利用联合测量理论的公理对重构数据的定量假设进行检验。然后使用步骤1的项目校准作为锚点,通过步骤4中的Rasch评分量表模型分析这些数据以评估个人反应组合。将该方法应用于模拟人对近期事件表的反应(Holmes和Rahe, 1967)。结果表明,该方法是实用、有效的。它用一种稳健的配对比较方法来衡量项目,确保联合人-项目矩阵的可加性和量化性,从个体反应组合的角度产生合理的人测量顺序,并提供令人满意的人与项目分离(即可靠性)。此外,重组后的数据再现了步骤1中通过配对比较获得的SRE项目量表值。
{"title":"Analyzing nonadditive conjoint structures: compounding events by Rasch model probabilities.","authors":"G Karabatsos","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>The following study proposes a Rasch method to measure variables of nonadditive conjoint structures, where dichotomous response combinations are evaluated. In this framework, both the number of endorsed items and their latent positions are considered. This is different from the cumulative response process (measurable by the Rasch model), where the probability of a positive response to an item with measure delta iota is considered a monotonic increasing function of the person's measure beta nu. This is also unlike the unfolding framework, where the probability of a positive response is maximum when beta nu = delta iota, and monotonically decreases as magnitude of beta nu-delta iota approaches infinity. The method involves four steps. In Step 1, items are scaled by the Rasch model for paired comparisons to produce a variable definition. These scale values serve as a basis for Steps 2 and 4. In Step 2, the nonadditive conjoint system is restructured to additive. The quantitative hypothesis of the restructured data is tested by the axioms of conjoint measurement theory in Step 3. This data is then analyzed by the Rasch rating scale model in Step 4 to evaluate individual response combinations, using the Step 1 item calibrations as anchors. The method was applied to simulated person responses of the Schedule of Recent Events (Holmes and Rahe, 1967). The results suggest that the method is useful and effective. It scales items with a robust method of paired comparisons, ensures additivity and quantification of the conjoint person-item matrix, produces a reasonable ordering of person measures from the perspective of individual response combinations, and provides satisfactory person and item separation (i.e., reliability). Furthermore, the restructured data reproduces SRE item scale values obtained by paired comparisons in Step 1.</p>","PeriodicalId":79673,"journal":{"name":"Journal of outcome measurement","volume":"2 3","pages":"191-221"},"PeriodicalIF":0.0,"publicationDate":"1998-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"20627034","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Factor structure and dimensionality of the multidimensional health locus of control scales in measuring adults with epilepsy. 成人癫痫患者多维健康控制位点量表的因素结构与维度。
Pub Date : 1998-01-01
S Gehlert, C H Chang

External locus of control has been implicated in the development of psychosocial problems in epilepsy, and adults with epilepsy exhibit scores that are more external than those of the normative sample of the Multidimensional Health Locus of Control (MHLC) scales. Although the MHLC scales has the potential to be quite useful in the assessment and treatment of adults with epilepsy, it has not been assessed psychometrically using data from persons with epilepsy. The present study examined the internal consistency, factor structure, and construct validity of the scales using data from a survey of 143 adults with epilepsy. Results from reliability analysis, confirmatory factor analysis, and Rasch analysis supported the hypothesized three-factor structure of the measure, which was internally reliable and factorially valid.

外部控制点与癫痫患者心理社会问题的发展有关,成人癫痫患者在多维健康控制点(MHLC)量表中表现出比标准样本更高的外部得分。尽管MHLC量表在成人癫痫患者的评估和治疗中具有相当有用的潜力,但尚未使用癫痫患者的数据对其进行心理测量学评估。本研究利用143例成人癫痫患者的调查数据,检验了量表的内部一致性、因素结构和结构效度。信度分析、验证性因子分析和Rasch分析的结果支持假设的三因素结构,该测量具有内部信度和因子效度。
{"title":"Factor structure and dimensionality of the multidimensional health locus of control scales in measuring adults with epilepsy.","authors":"S Gehlert,&nbsp;C H Chang","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>External locus of control has been implicated in the development of psychosocial problems in epilepsy, and adults with epilepsy exhibit scores that are more external than those of the normative sample of the Multidimensional Health Locus of Control (MHLC) scales. Although the MHLC scales has the potential to be quite useful in the assessment and treatment of adults with epilepsy, it has not been assessed psychometrically using data from persons with epilepsy. The present study examined the internal consistency, factor structure, and construct validity of the scales using data from a survey of 143 adults with epilepsy. Results from reliability analysis, confirmatory factor analysis, and Rasch analysis supported the hypothesized three-factor structure of the measure, which was internally reliable and factorially valid.</p>","PeriodicalId":79673,"journal":{"name":"Journal of outcome measurement","volume":"2 3","pages":"173-90"},"PeriodicalIF":0.0,"publicationDate":"1998-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"20627033","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of outcome measurement
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1