首页 > 最新文献

Journal of outcome measurement最新文献

英文 中文
Distractors--can they be biased too? 干扰因素——它们也会有偏见吗?
Pub Date : 1999-01-01
S Alagumalai, J P Keeves

Numerous work has been done on item bias and differential item functioning. Although there is some research on distractor analysis, no detailed study has been attempted to examine the way distractors in an item function, with regards to comparing distractor performance. This paper examines how distractors function differentially and compares various methods for identifying this. The Pearson chi-square, likelihood ratio chi-square and Neyman weighted least squares chi-square tests are some of these methods. Possible causes of distractor bias are discussed with illustrations from a physics problem-solving scale.

在项目偏差和差异项目功能方面已经做了大量的工作。虽然有一些关于干扰物分析的研究,但没有详细的研究试图检验干扰物在项目中的作用方式,并比较干扰物的表现。本文考察了干扰物的功能差异,并比较了识别干扰物的各种方法。皮尔逊卡方、似然比卡方和内曼加权最小二乘卡方检验就是这些方法中的一些。用物理问题解决量表的插图讨论了干扰物偏差的可能原因。
{"title":"Distractors--can they be biased too?","authors":"S Alagumalai,&nbsp;J P Keeves","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Numerous work has been done on item bias and differential item functioning. Although there is some research on distractor analysis, no detailed study has been attempted to examine the way distractors in an item function, with regards to comparing distractor performance. This paper examines how distractors function differentially and compares various methods for identifying this. The Pearson chi-square, likelihood ratio chi-square and Neyman weighted least squares chi-square tests are some of these methods. Possible causes of distractor bias are discussed with illustrations from a physics problem-solving scale.</p>","PeriodicalId":79673,"journal":{"name":"Journal of outcome measurement","volume":"3 1","pages":"89-102"},"PeriodicalIF":0.0,"publicationDate":"1999-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"20936403","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Teacher receptivity to a system-wide change in a centralized education system: a Rasch measurement model analysis. 集中式教育系统中教师对全系统变革的接受程度:Rasch测量模型分析。
Pub Date : 1999-01-01
R F Waugh

The Education Department of Western Australia has implemented a new system called Student Outcome Statements, by trial in 1995/1996, then an a voluntary basis from 1997, with the intention of making it mandatory after 2001. The system describes, in order, the outcomes that students are expected to achieve in eight broad learning areas. The study has three aims. One, to create a scale for teacher receptivity to the use of Student Outcome Statements, based on eight orientations to receptivity: evaluative attitudes, behavior intentions, feelings towards Student Outcome Statements compared to the previous system, the benefits of the new system, support from significant others, alleviation of concerns, collaboration with other teachers, and involvement in decision-making. Two, to analyze the psychometric properties of the scale using the Extended Logistic Model of Rasch (Andrich, 1988; Rasch, 1960/1980) with the computer program RUMM (Andrich, Sheridan & Luo, 1997). Three, to provide advice to decision-makers about how better to implement the system of Student Outcome Statements.

西澳大利亚州教育部在1995/1996年试行了一项名为“学生成绩陈述”的新制度,从1997年开始实行自愿制,并打算在2001年以后强制实行。该系统按顺序描述了学生在八个广泛的学习领域期望取得的成果。这项研究有三个目的。第一,根据接受度的八个方向,创建教师对使用学生成果陈述的接受度量表:评价态度、行为意图、与以前系统相比对学生成果陈述的感受、新系统的好处、重要他人的支持、减轻担忧、与其他教师的合作以及参与决策。二是运用Rasch (Andrich, 1988)的扩展Logistic模型分析量表的心理测量特性;Rasch, 1960/1980)与计算机程序RUMM (Andrich, Sheridan & Luo, 1997)。第三,就如何更好地实施学生成绩陈述系统向决策者提供建议。
{"title":"Teacher receptivity to a system-wide change in a centralized education system: a Rasch measurement model analysis.","authors":"R F Waugh","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>The Education Department of Western Australia has implemented a new system called Student Outcome Statements, by trial in 1995/1996, then an a voluntary basis from 1997, with the intention of making it mandatory after 2001. The system describes, in order, the outcomes that students are expected to achieve in eight broad learning areas. The study has three aims. One, to create a scale for teacher receptivity to the use of Student Outcome Statements, based on eight orientations to receptivity: evaluative attitudes, behavior intentions, feelings towards Student Outcome Statements compared to the previous system, the benefits of the new system, support from significant others, alleviation of concerns, collaboration with other teachers, and involvement in decision-making. Two, to analyze the psychometric properties of the scale using the Extended Logistic Model of Rasch (Andrich, 1988; Rasch, 1960/1980) with the computer program RUMM (Andrich, Sheridan & Luo, 1997). Three, to provide advice to decision-makers about how better to implement the system of Student Outcome Statements.</p>","PeriodicalId":79673,"journal":{"name":"Journal of outcome measurement","volume":"3 1","pages":"71-88"},"PeriodicalIF":0.0,"publicationDate":"1999-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"20936402","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Parameter recovery for the rating scale model using PARSCALE. 使用PARSCALE对评级尺度模型进行参数恢复。
Pub Date : 1999-01-01
G A French, B G Dodd

The purpose of the present study was to investigate item and trait parameter recovery for Andrich's rating scale model using the PARSCALE computer program. The four factors upon which the simulated data matrices varied were (a) the distribution of the scale values for the items (skewed or uniform), (b) the number of category response options (4 or 5), (c) the distribution of known trait levels (normal or skewed), and (d) the sample size (60, 125, 250, 500, or 1,000). Each condition was replicated 10 times resulting in 400 data matrices. Accurate item and trait parameter estimates were obtained for all sample sizes examined. As expected, sample size seemed to have little influence on the recovery of trait parameters but did influence item parameter recovery. The distribution of known trait levels did not seriously impact the item parameter recovery. It was concluded that Andrich's rating scale model allows for the use of considerably smaller calibration samples than are typically recommended for other polytomous IRT models.

本研究的目的是利用PARSCALE计算机程序对Andrich评定量表模型的项目和特征参数进行恢复。模拟数据矩阵变化的四个因素是(a)项目量表值的分布(倾斜或均匀),(b)类别反应选项的数量(4或5),(c)已知特征水平的分布(正常或倾斜),以及(d)样本量(60、125、250、500或1000)。每个条件被复制10次,得到400个数据矩阵。准确的项目和性状参数估计得到了所有样本量的检验。正如预期的那样,样本量似乎对特质参数的恢复影响不大,但对项目参数的恢复有影响。已知性状水平的分布对项目参数的恢复没有严重影响。得出的结论是,Andrich的评级尺度模型允许使用比通常推荐的其他多尺度IRT模型小得多的校准样本。
{"title":"Parameter recovery for the rating scale model using PARSCALE.","authors":"G A French,&nbsp;B G Dodd","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>The purpose of the present study was to investigate item and trait parameter recovery for Andrich's rating scale model using the PARSCALE computer program. The four factors upon which the simulated data matrices varied were (a) the distribution of the scale values for the items (skewed or uniform), (b) the number of category response options (4 or 5), (c) the distribution of known trait levels (normal or skewed), and (d) the sample size (60, 125, 250, 500, or 1,000). Each condition was replicated 10 times resulting in 400 data matrices. Accurate item and trait parameter estimates were obtained for all sample sizes examined. As expected, sample size seemed to have little influence on the recovery of trait parameters but did influence item parameter recovery. The distribution of known trait levels did not seriously impact the item parameter recovery. It was concluded that Andrich's rating scale model allows for the use of considerably smaller calibration samples than are typically recommended for other polytomous IRT models.</p>","PeriodicalId":79673,"journal":{"name":"Journal of outcome measurement","volume":"3 2","pages":"176-99"},"PeriodicalIF":0.0,"publicationDate":"1999-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"21075949","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Mapping variables. 映射变量。
Pub Date : 1999-01-01
M H Stone, B D Wright, A J Stenner

This paper describes Mapping Variables, the principal technique for planning and constructing a test or rating instrument. A variable map is also useful for interpreting results. Modest reference is made to the history of mapping leading to its importance in psychometrics. Several maps are given to show the importance and value of mapping a variable by person and item data. The need for a critical appraisal of maps is also stressed.

本文描述了映射变量,这是规划和构建测试或评级仪器的主要技术。变量映射对于解释结果也很有用。适度参考地图的历史导致其在心理测量学的重要性。给出了几个映射,以显示按人员和项目数据映射变量的重要性和价值。还强调需要对地图进行批判性评价。
{"title":"Mapping variables.","authors":"M H Stone,&nbsp;B D Wright,&nbsp;A J Stenner","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>This paper describes Mapping Variables, the principal technique for planning and constructing a test or rating instrument. A variable map is also useful for interpreting results. Modest reference is made to the history of mapping leading to its importance in psychometrics. Several maps are given to show the importance and value of mapping a variable by person and item data. The need for a critical appraisal of maps is also stressed.</p>","PeriodicalId":79673,"journal":{"name":"Journal of outcome measurement","volume":"3 4","pages":"308-22"},"PeriodicalIF":0.0,"publicationDate":"1999-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"21430144","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Many-facet Rasch analysis with crossed, nested, and mixed designs. 交叉、嵌套和混合设计的多面Rasch分析。
Pub Date : 1999-01-01
R E Schumacker

Many-facet Rasch analysis provides the bases for making fair and meaningful decisions from individual ratings by judges on tasks. The typical measurement design employed in a many-facet Rasch analysis has judges crossed with other facets or conditions of measurement. A nested design does not permit facets to be compared. However, a mixed design can be used to achieve a common vertical ruler when the frame of reference permits commensurate measures to be linked. Examples of crossed, nested, and mixed designs are presented to illustrate how a many-facet Rasch analysis can be modified to meet the connectivity requirement for comparing facet measures.

多面Rasch分析提供了公平和有意义的决策的基础,从个人评分的任务评委。在多面拉希分析中采用的典型测量设计具有与其他面或测量条件交叉的判断。嵌套设计不允许对facet进行比较。然而,混合设计可以用来实现一个共同的垂直尺子,当参考框架允许相应的措施连接。提出了交叉、嵌套和混合设计的示例,以说明如何修改多面Rasch分析以满足比较面度量的连通性要求。
{"title":"Many-facet Rasch analysis with crossed, nested, and mixed designs.","authors":"R E Schumacker","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Many-facet Rasch analysis provides the bases for making fair and meaningful decisions from individual ratings by judges on tasks. The typical measurement design employed in a many-facet Rasch analysis has judges crossed with other facets or conditions of measurement. A nested design does not permit facets to be compared. However, a mixed design can be used to achieve a common vertical ruler when the frame of reference permits commensurate measures to be linked. Examples of crossed, nested, and mixed designs are presented to illustrate how a many-facet Rasch analysis can be modified to meet the connectivity requirement for comparing facet measures.</p>","PeriodicalId":79673,"journal":{"name":"Journal of outcome measurement","volume":"3 4","pages":"323-38"},"PeriodicalIF":0.0,"publicationDate":"1999-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"21430145","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Grades of severity and the validation of an atopic dermatitis assessment measure (ADAM). 严重程度分级和特应性皮炎评估措施(ADAM)的验证。
Pub Date : 1999-01-01
D P Charman, G A Varigos

There has generally been a dearth of good clinical descriptions of grades of disease severity. The aim of this study was to produce reliable and valid descriptions of grades of severity of Atopic Dermatitis (AD). The ADAM (AD Assessment Measure) measure was used to assess AD severity in 171 male and female paediatric patients (mean age = 54 months) at the Royal Children's Hospital in Melbourne, Australia. The assessments were subject to Partial Credit analyses to produce clinically relevant "word pictures" of grades of severity of AD. Patterns of AD were shown to vary according to age, sex and severity. These descriptions will be useful for clinical training and research. Moreover, the approach to validation adopted here has important implications for the future of measurement in medicine.

通常缺乏对疾病严重程度分级的良好临床描述。本研究的目的是对特应性皮炎(AD)的严重程度进行可靠和有效的描述。ADAM (AD Assessment Measure)用于评估澳大利亚墨尔本皇家儿童医院171名男性和女性儿科患者(平均年龄= 54个月)的AD严重程度。评估采用部分信用分析,以产生与临床相关的阿尔茨海默病严重程度等级的“文字图片”。阿尔茨海默病的模式因年龄、性别和严重程度而异。这些描述将对临床培训和研究有用。此外,这里采用的验证方法对医学测量的未来具有重要意义。
{"title":"Grades of severity and the validation of an atopic dermatitis assessment measure (ADAM).","authors":"D P Charman,&nbsp;G A Varigos","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>There has generally been a dearth of good clinical descriptions of grades of disease severity. The aim of this study was to produce reliable and valid descriptions of grades of severity of Atopic Dermatitis (AD). The ADAM (AD Assessment Measure) measure was used to assess AD severity in 171 male and female paediatric patients (mean age = 54 months) at the Royal Children's Hospital in Melbourne, Australia. The assessments were subject to Partial Credit analyses to produce clinically relevant \"word pictures\" of grades of severity of AD. Patterns of AD were shown to vary according to age, sex and severity. These descriptions will be useful for clinical training and research. Moreover, the approach to validation adopted here has important implications for the future of measurement in medicine.</p>","PeriodicalId":79673,"journal":{"name":"Journal of outcome measurement","volume":"3 2","pages":"162-75"},"PeriodicalIF":0.0,"publicationDate":"1999-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"21075428","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Round-off error, blind faith, and the powers that be: a caution on numerical error in coefficients for polynomial curves fit to psychophysical data. 四舍五入误差,盲目信仰,和权力:对拟合心理物理数据的多项式曲线系数的数值误差的警告。
Pub Date : 1998-01-01
V J Samar, C L De Filippo

Graphing and statistics software often permits users to fit polynomial curves, like a parabola or sigmoid, to scatter plots of psychophysical data points. These programs typically calculate the curve using double- or extended-precision numerical algorithms and display the resulting curve overlaid graphically on the scatter plot, but they may simultaneously display the equation that generates that curve with numerical coefficients that have been rounded off to only a few decimal places. If this equation is used for experimental or clinical applications, the round-off error, especially on coefficients for the higher powers, can produce anomalous findings due to systematic and extreme distortions of the fitted curve, even artifactually reversing the algebraic sign of the true slope of the fitted curve at particular data points. Care must be exercised in setting round-off criteria for coefficients of polynomial terms in curve-fit equations to avoid nonsensical measurement and prediction.

绘图和统计软件通常允许用户拟合多项式曲线,如抛物线或s形曲线,以分散心理物理数据点。这些程序通常使用双精度或扩展精度数值算法计算曲线,并在散点图上以图形方式显示结果曲线,但它们可能同时显示生成该曲线的方程,其中的数值系数已四舍五入到小数位数。如果将该方程用于实验或临床应用,那么由于拟合曲线的系统性和极端扭曲,特别是系数的舍入误差,可能会产生异常结果,甚至人为地逆转拟合曲线在特定数据点的真实斜率的代数符号。在设定曲线拟合方程中多项式项系数的舍入准则时必须小心,以避免无意义的测量和预测。
{"title":"Round-off error, blind faith, and the powers that be: a caution on numerical error in coefficients for polynomial curves fit to psychophysical data.","authors":"V J Samar,&nbsp;C L De Filippo","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Graphing and statistics software often permits users to fit polynomial curves, like a parabola or sigmoid, to scatter plots of psychophysical data points. These programs typically calculate the curve using double- or extended-precision numerical algorithms and display the resulting curve overlaid graphically on the scatter plot, but they may simultaneously display the equation that generates that curve with numerical coefficients that have been rounded off to only a few decimal places. If this equation is used for experimental or clinical applications, the round-off error, especially on coefficients for the higher powers, can produce anomalous findings due to systematic and extreme distortions of the fitted curve, even artifactually reversing the algebraic sign of the true slope of the fitted curve at particular data points. Care must be exercised in setting round-off criteria for coefficients of polynomial terms in curve-fit equations to avoid nonsensical measurement and prediction.</p>","PeriodicalId":79673,"journal":{"name":"Journal of outcome measurement","volume":"2 2","pages":"159-67"},"PeriodicalIF":0.0,"publicationDate":"1998-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"20580456","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Man is the measure ... the measurer. 人是衡量标准……测量器。
Pub Date : 1998-01-01
M H Stone

Measures originated from human anatomy. Metrology has moved from man the measure to man the measurer. This transformation is documented using examples taken from the history of metrology. The outcome measure are units constructed and maintained for their utility, constancy and generality.

措施源于人体解剖学。计量学已经从人是测量者发展到人是测量者。这种转变是用计量学历史上的例子来记录的。结果度量是根据其效用、稳定性和普遍性而构建和维护的单位。
{"title":"Man is the measure ... the measurer.","authors":"M H Stone","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Measures originated from human anatomy. Metrology has moved from man the measure to man the measurer. This transformation is documented using examples taken from the history of metrology. The outcome measure are units constructed and maintained for their utility, constancy and generality.</p>","PeriodicalId":79673,"journal":{"name":"Journal of outcome measurement","volume":"2 1","pages":"25-32"},"PeriodicalIF":0.0,"publicationDate":"1998-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"20579246","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Evidence for the validity of a Rasch model technique for identifying differential item functioning. Rasch模型技术鉴别不同项目功能有效性的证据。
Pub Date : 1998-01-01
J D Scheuneman, R G Subhiyah

This paper presents an analysis of differential item functioning (DIF) in a certification examination for a medical specialty. The groups analyzed were (1) physicians from different subspecialties within this area and (2) physicians who qualified for the examination through two different experiential pathways. The DIF analyses were performed using a simple Rasch model procedure. The results were shown to be readily interpretable in terms of the known differences between the groups being compared. These results serve as validity evidence for the Rasch model procedure as a means for evaluating DIF in examinations. The conclusion is drawn that complex procedures are not required to generate interpretable results if relevant differences between the groups being compared are known. This suggests that the inability of many researchers to interpret results for racial/ethnic or gender groups is not due to inadequacies of the methods, but more likely to lack of pertinent knowledge about group differences.

本文介绍了在医学专业认证考试中的差异项目功能(DIF)的分析。分析的群体是:(1)来自该地区不同亚专科的医生;(2)通过两种不同的经验途径获得考试资格的医生。使用简单的Rasch模型程序进行DIF分析。结果表明,根据所比较的组之间的已知差异,很容易解释。这些结果为Rasch模型程序作为评估检查中DIF的手段提供了有效性证据。得出的结论是,如果被比较的群体之间的相关差异是已知的,则不需要复杂的程序来产生可解释的结果。这表明,许多研究人员无法解释种族/民族或性别群体的结果不是由于方法的不足,而更可能是缺乏关于群体差异的相关知识。
{"title":"Evidence for the validity of a Rasch model technique for identifying differential item functioning.","authors":"J D Scheuneman,&nbsp;R G Subhiyah","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>This paper presents an analysis of differential item functioning (DIF) in a certification examination for a medical specialty. The groups analyzed were (1) physicians from different subspecialties within this area and (2) physicians who qualified for the examination through two different experiential pathways. The DIF analyses were performed using a simple Rasch model procedure. The results were shown to be readily interpretable in terms of the known differences between the groups being compared. These results serve as validity evidence for the Rasch model procedure as a means for evaluating DIF in examinations. The conclusion is drawn that complex procedures are not required to generate interpretable results if relevant differences between the groups being compared are known. This suggests that the inability of many researchers to interpret results for racial/ethnic or gender groups is not due to inadequacies of the methods, but more likely to lack of pertinent knowledge about group differences.</p>","PeriodicalId":79673,"journal":{"name":"Journal of outcome measurement","volume":"2 1","pages":"33-42"},"PeriodicalIF":0.0,"publicationDate":"1998-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"20579247","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Controlling the judge variable in grading essay-type items: an application of Rasch analyses to the recruitment exam for Korean public school teachers. 控制作文类题目评分的裁判变量:Rasch分析在韩国公立学校教师招聘考试中的应用。
Pub Date : 1998-01-01
S Chae

The purpose of this paper is to show how the Rasch measurement model can be used to control the effects of judge variable on the grading of essay-type items in the recruitment test for Korean teachers. Special attention is given to two aspects of judges' involvement in the grading. One is to identify a way to minimize the variation of grading due to judge severity. The other concern is to figure out a way to reduce the number of judges without threatening objectivity of ability estimates. Results from the FACETS analyses tell us not only how much grading standards vary among judges and how to adjust them but also it produces comparably reliable ability estimates with fewer judges.

本文的目的是为了展示如何使用Rasch测量模型来控制判断变量对韩国教师招聘测试中作文类项目评分的影响。特别注意评委参与评分的两个方面。一是确定一种方法,以尽量减少由于法官的严重性评分的变化。另一个问题是如何在不影响能力评估客观性的情况下减少法官的数量。facet分析的结果不仅告诉我们法官之间的评分标准有多大差异以及如何调整这些标准,而且还可以在较少的法官情况下产生相对可靠的能力估计。
{"title":"Controlling the judge variable in grading essay-type items: an application of Rasch analyses to the recruitment exam for Korean public school teachers.","authors":"S Chae","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>The purpose of this paper is to show how the Rasch measurement model can be used to control the effects of judge variable on the grading of essay-type items in the recruitment test for Korean teachers. Special attention is given to two aspects of judges' involvement in the grading. One is to identify a way to minimize the variation of grading due to judge severity. The other concern is to figure out a way to reduce the number of judges without threatening objectivity of ability estimates. Results from the FACETS analyses tell us not only how much grading standards vary among judges and how to adjust them but also it produces comparably reliable ability estimates with fewer judges.</p>","PeriodicalId":79673,"journal":{"name":"Journal of outcome measurement","volume":"2 2","pages":"123-41"},"PeriodicalIF":0.0,"publicationDate":"1998-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"20580454","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of outcome measurement
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1