{"title":"[18F]氟替美托 PET 图像视觉阅读的互译一致性和可变性。","authors":"Akinori Takenaka, Takashi Nihashi, Keita Sakurai, Keiji Notomi, Hokuto Ono, Yoshitaka Inui, Shinji Ito, Yutaka Arahata, Akinori Takeda, Kazunari Ishii, Kenji Ishii, Kengo Ito, Hiroshi Toyama, Akinori Nakamura, Takashi Kato","doi":"10.1007/s12149-024-01977-7","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong>The purpose of this study was to validate the concordance of visual ratings of [18F] flutemetamol amyloid positron emission tomography (PET) images and to investigate the correlation between the agreement of each rater and the Centiloid (CL) scale.</p><p><strong>Methods: </strong>A total of 192 participants, clinically classified as cognitively normal (CN) (n = 59), mild cognitive impairment (MCI) (n = 65), Alzheimer's disease (AD) (n = 55), or non-AD dementia (n = 13), participated in this study. Three experts conducted visual ratings of the amyloid PET images for all 192 patients, assigning a confidence level to each rating on a three-point scale (certain, probable, or neither). The positive or negative determination of amyloid PET results was made by majority vote. The CL value was calculated using the CapAIBL pipeline.</p><p><strong>Results: </strong>Overall, 101 images were determined to be positive, and 91 images were negative. Of the 101 positive images, the three raters were in complete agreement for 92 images and in disagreement for 9 images. Of the 91 negative images, the three raters were in complete agreement for 75 images and in disagreement for 16 images. Interrater reliability among the three experts was particularly high, with both Fleiss' kappa and Conger's kappa measuring 0.83 (0.76-0.89). The CL values of the unanimous positive group were significantly greater than those of the other groups, whereas the CL values of the unanimous negative group were significantly lower than those of the other groups. Images with rater disagreement had intermediate CLs. In cases with a high confidence level, the positive or negative visual ratings were in almost complete agreement. However, as confidence levels decreased, experts' visual ratings became more variable. The lower the confidence level was, the greater the number of cases with disagreement in the visual ratings.</p><p><strong>Conclusion: </strong>Three experts independently rated 192 amyloid PET images, achieving a high level of interrater agreement. However, in patients with intermediate amyloid accumulation, visual ratings varied. Therefore, determining positive and negative decisions in these patients should be performed with caution.</p>","PeriodicalId":8007,"journal":{"name":"Annals of Nuclear Medicine","volume":" ","pages":""},"PeriodicalIF":2.5000,"publicationDate":"2024-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Interrater agreement and variability in visual reading of [18F] flutemetamol PET images.\",\"authors\":\"Akinori Takenaka, Takashi Nihashi, Keita Sakurai, Keiji Notomi, Hokuto Ono, Yoshitaka Inui, Shinji Ito, Yutaka Arahata, Akinori Takeda, Kazunari Ishii, Kenji Ishii, Kengo Ito, Hiroshi Toyama, Akinori Nakamura, Takashi Kato\",\"doi\":\"10.1007/s12149-024-01977-7\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Objective: </strong>The purpose of this study was to validate the concordance of visual ratings of [18F] flutemetamol amyloid positron emission tomography (PET) images and to investigate the correlation between the agreement of each rater and the Centiloid (CL) scale.</p><p><strong>Methods: </strong>A total of 192 participants, clinically classified as cognitively normal (CN) (n = 59), mild cognitive impairment (MCI) (n = 65), Alzheimer's disease (AD) (n = 55), or non-AD dementia (n = 13), participated in this study. Three experts conducted visual ratings of the amyloid PET images for all 192 patients, assigning a confidence level to each rating on a three-point scale (certain, probable, or neither). The positive or negative determination of amyloid PET results was made by majority vote. The CL value was calculated using the CapAIBL pipeline.</p><p><strong>Results: </strong>Overall, 101 images were determined to be positive, and 91 images were negative. Of the 101 positive images, the three raters were in complete agreement for 92 images and in disagreement for 9 images. Of the 91 negative images, the three raters were in complete agreement for 75 images and in disagreement for 16 images. Interrater reliability among the three experts was particularly high, with both Fleiss' kappa and Conger's kappa measuring 0.83 (0.76-0.89). The CL values of the unanimous positive group were significantly greater than those of the other groups, whereas the CL values of the unanimous negative group were significantly lower than those of the other groups. Images with rater disagreement had intermediate CLs. In cases with a high confidence level, the positive or negative visual ratings were in almost complete agreement. However, as confidence levels decreased, experts' visual ratings became more variable. The lower the confidence level was, the greater the number of cases with disagreement in the visual ratings.</p><p><strong>Conclusion: </strong>Three experts independently rated 192 amyloid PET images, achieving a high level of interrater agreement. However, in patients with intermediate amyloid accumulation, visual ratings varied. Therefore, determining positive and negative decisions in these patients should be performed with caution.</p>\",\"PeriodicalId\":8007,\"journal\":{\"name\":\"Annals of Nuclear Medicine\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":2.5000,\"publicationDate\":\"2024-09-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Annals of Nuclear Medicine\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1007/s12149-024-01977-7\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Annals of Nuclear Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s12149-024-01977-7","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}
Interrater agreement and variability in visual reading of [18F] flutemetamol PET images.
Objective: The purpose of this study was to validate the concordance of visual ratings of [18F] flutemetamol amyloid positron emission tomography (PET) images and to investigate the correlation between the agreement of each rater and the Centiloid (CL) scale.
Methods: A total of 192 participants, clinically classified as cognitively normal (CN) (n = 59), mild cognitive impairment (MCI) (n = 65), Alzheimer's disease (AD) (n = 55), or non-AD dementia (n = 13), participated in this study. Three experts conducted visual ratings of the amyloid PET images for all 192 patients, assigning a confidence level to each rating on a three-point scale (certain, probable, or neither). The positive or negative determination of amyloid PET results was made by majority vote. The CL value was calculated using the CapAIBL pipeline.
Results: Overall, 101 images were determined to be positive, and 91 images were negative. Of the 101 positive images, the three raters were in complete agreement for 92 images and in disagreement for 9 images. Of the 91 negative images, the three raters were in complete agreement for 75 images and in disagreement for 16 images. Interrater reliability among the three experts was particularly high, with both Fleiss' kappa and Conger's kappa measuring 0.83 (0.76-0.89). The CL values of the unanimous positive group were significantly greater than those of the other groups, whereas the CL values of the unanimous negative group were significantly lower than those of the other groups. Images with rater disagreement had intermediate CLs. In cases with a high confidence level, the positive or negative visual ratings were in almost complete agreement. However, as confidence levels decreased, experts' visual ratings became more variable. The lower the confidence level was, the greater the number of cases with disagreement in the visual ratings.
Conclusion: Three experts independently rated 192 amyloid PET images, achieving a high level of interrater agreement. However, in patients with intermediate amyloid accumulation, visual ratings varied. Therefore, determining positive and negative decisions in these patients should be performed with caution.
期刊介绍:
Annals of Nuclear Medicine is an official journal of the Japanese Society of Nuclear Medicine. It develops the appropriate application of radioactive substances and stable nuclides in the field of medicine.
The journal promotes the exchange of ideas and information and research in nuclear medicine and includes the medical application of radionuclides and related subjects. It presents original articles, short communications, reviews and letters to the editor.