The "motor" (activities of daily living) component of the FONE FIM, the telephone version of the Functional Independence Measure (FIM) was evaluated in a cohort of 132 patients who had been discharged to home from a geriatric inpatient assessment and rehabilitation program. In the current study, Rasch person ability measures were derived from telephone assessments 5 weeks after discharge and in-home assessments 1 week later. Concordance between the modes was shown to be satisfactory for the Rasch measures based on intraclass correlation coefficients. However, the telephone mode consistently generated lower estimates than did the observational mode. This was due to the fact that the telephone mode underestimated motor function for the majority of patients who were at higher levels of cognition and motor function, but overestimated for patients who were at lower levels of cognition and motor function. At the item level, concordance, as determined by Kappa statistics, was better when the FONE FIM responses came from the patient rather than proxy respondents, and when the assessments were done by more experienced rather than less experienced raters. Based on these findings, a mixed strategy, the telephone mode for patients capable of responding to the FONE FIM and in-home assessments for those who are incapable, is recommended.
{"title":"Evaluating the FONE FIM: Part II. Concurrent validity & influencing factors.","authors":"W C Chang, C Chan, S E Slaughter, D Cartwright","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>The \"motor\" (activities of daily living) component of the FONE FIM, the telephone version of the Functional Independence Measure (FIM) was evaluated in a cohort of 132 patients who had been discharged to home from a geriatric inpatient assessment and rehabilitation program. In the current study, Rasch person ability measures were derived from telephone assessments 5 weeks after discharge and in-home assessments 1 week later. Concordance between the modes was shown to be satisfactory for the Rasch measures based on intraclass correlation coefficients. However, the telephone mode consistently generated lower estimates than did the observational mode. This was due to the fact that the telephone mode underestimated motor function for the majority of patients who were at higher levels of cognition and motor function, but overestimated for patients who were at lower levels of cognition and motor function. At the item level, concordance, as determined by Kappa statistics, was better when the FONE FIM responses came from the patient rather than proxy respondents, and when the assessments were done by more experienced rather than less experienced raters. Based on these findings, a mixed strategy, the telephone mode for patients capable of responding to the FONE FIM and in-home assessments for those who are incapable, is recommended.</p>","PeriodicalId":79673,"journal":{"name":"Journal of outcome measurement","volume":"1 4","pages":"259-85"},"PeriodicalIF":0.0,"publicationDate":"1997-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"20579241","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
It is often impossible to validate cut scores set using judged item review methods due to the fact that many high stakes testing programs attempt to limit the number of common items across consecutively administered forms. However, over time, with a stable item pool, secondary links through other test administrations allow the use of common item equating to test the stability of the judged cut scores. In this study five forms of a basic science examination administered over a three year period in a national board testing program were analyzed to determine the stability of judged cut scores. The stability was determined by comparison of the judged cut scores with the equated cut scores derived by the Rasch common item equating technique. The results indicate cut scores derived from the modified Nedelsky procedure were within equating error of the Rasch equated cut scores over five administrations.
{"title":"Validating standard setting with a modified nedelsky procedure through common item test equating.","authors":"R M Smith, L J Gross","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>It is often impossible to validate cut scores set using judged item review methods due to the fact that many high stakes testing programs attempt to limit the number of common items across consecutively administered forms. However, over time, with a stable item pool, secondary links through other test administrations allow the use of common item equating to test the stability of the judged cut scores. In this study five forms of a basic science examination administered over a three year period in a national board testing program were analyzed to determine the stability of judged cut scores. The stability was determined by comparison of the judged cut scores with the equated cut scores derived by the Rasch common item equating technique. The results indicate cut scores derived from the modified Nedelsky procedure were within equating error of the Rasch equated cut scores over five administrations.</p>","PeriodicalId":79673,"journal":{"name":"Journal of outcome measurement","volume":"1 2","pages":"164-72"},"PeriodicalIF":0.0,"publicationDate":"1997-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"20579307","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}