Isobel Landray, James Carpenter, Kaveh Vahdani, Katherine Miszkiel, Lakshmi A Ratnam, Geoffrey E Rose
{"title":"Reproducibility of the Unaided Subjective Assessment of Orbital Computed X-Ray Tomographic Features in Thyroid Eye Disease.","authors":"Isobel Landray, James Carpenter, Kaveh Vahdani, Katherine Miszkiel, Lakshmi A Ratnam, Geoffrey E Rose","doi":"10.1097/IOP.0000000000002929","DOIUrl":null,"url":null,"abstract":"<p><strong>Purpose: </strong>To assess the reproducibility of subjective interpretation of computed x-ray tomography for 8 features associated with thyroid eye disease.</p><p><strong>Methods: </strong>Patients with confirmed thyroid eye disease had 3 distinct orbital computed x-ray tomography sections presented as anonymized montages to 3 masked observers (#1 orbital radiologist, #2 general radiologist, and #3 orbital surgeon). Eight features were graded: superior orbital fissure clarity, degree of orbital fat prolapse through the superior orbital fissure, loss of fat space at the apex, muscle enlargement, increase in orbital fat volume, vascular congestion, superior ophthalmic vein size, and lamina papyracea bowing. Thirty montages were randomly triplicated within the completed image-testing-file.</p><p><strong>Results: </strong>Each observer provided 3296 assessments of montages from 146 patients (68% female). Observer #2 had the highest rate of \"indeterminate\" gradings (13.3%), while #1 had the lowest (6.7%). For intraobserver agreement, the kappa statistics were \"substantial\" to \"almost perfect\" for apical crowding, muscular enlargement, and medial bowing, whereas orbital fat expansion and vascular congestion showed only \"slight\" to \"moderate\" agreement. Excluding superior ophthalmic vein size (where indeterminacy was too great for statistical analysis), there was a wide and statistically significant interobserver variation for the other 7 features, with no consistent ranking of observer scores.</p><p><strong>Conclusions: </strong>Subjective interpretation of computed x-ray tomography images for patients with thyroid eye disease has high variability, particularly for interobserver comparisons. Only the assessment of apical crowding, muscular enlargement, and bowing of the lamina papyracea showed fairly consistent intraobserver gradings. The results suggest that variability in the interpretation of such images might only be improved with the use of objective measures applied to the computed x-ray tomography images.</p>","PeriodicalId":19588,"journal":{"name":"Ophthalmic Plastic and Reconstructive Surgery","volume":" ","pages":""},"PeriodicalIF":1.2000,"publicationDate":"2025-03-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ophthalmic Plastic and Reconstructive Surgery","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1097/IOP.0000000000002929","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"OPHTHALMOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Purpose: To assess the reproducibility of subjective interpretation of computed x-ray tomography for 8 features associated with thyroid eye disease.
Methods: Patients with confirmed thyroid eye disease had 3 distinct orbital computed x-ray tomography sections presented as anonymized montages to 3 masked observers (#1 orbital radiologist, #2 general radiologist, and #3 orbital surgeon). Eight features were graded: superior orbital fissure clarity, degree of orbital fat prolapse through the superior orbital fissure, loss of fat space at the apex, muscle enlargement, increase in orbital fat volume, vascular congestion, superior ophthalmic vein size, and lamina papyracea bowing. Thirty montages were randomly triplicated within the completed image-testing-file.
Results: Each observer provided 3296 assessments of montages from 146 patients (68% female). Observer #2 had the highest rate of "indeterminate" gradings (13.3%), while #1 had the lowest (6.7%). For intraobserver agreement, the kappa statistics were "substantial" to "almost perfect" for apical crowding, muscular enlargement, and medial bowing, whereas orbital fat expansion and vascular congestion showed only "slight" to "moderate" agreement. Excluding superior ophthalmic vein size (where indeterminacy was too great for statistical analysis), there was a wide and statistically significant interobserver variation for the other 7 features, with no consistent ranking of observer scores.
Conclusions: Subjective interpretation of computed x-ray tomography images for patients with thyroid eye disease has high variability, particularly for interobserver comparisons. Only the assessment of apical crowding, muscular enlargement, and bowing of the lamina papyracea showed fairly consistent intraobserver gradings. The results suggest that variability in the interpretation of such images might only be improved with the use of objective measures applied to the computed x-ray tomography images.
期刊介绍:
Ophthalmic Plastic and Reconstructive Surgery features original articles and reviews on topics such as ptosis, eyelid reconstruction, orbital diagnosis and surgery, lacrimal problems, and eyelid malposition. Update reports on diagnostic techniques, surgical equipment and instrumentation, and medical therapies are included, as well as detailed analyses of recent research findings and their clinical applications.