Kris Lami, Andrey Bychkov, Keitaro Matsumoto, Richard Attanoos, Sabina Berezowska, Luka Brcic, Alberto Cavazza, John C English, Alexandre Todorovic Fabro, Kaori Ishida, Yukio Kashima, Brandon T Larsen, Alberto M Marchevsky, Takuro Miyazaki, Shimpei Morimoto, Anja C Roden, Frank Schneider, Mano Soshi, Maxwell L Smith, Kazuhiro Tabata, Angela M Takano, Kei Tanaka, Tomonori Tanaka, Tomoshi Tsuchiya, Takeshi Nagayasu, Junya Fukuoka
{"title":"克服肺腺癌亚型中观察者之间的差异:一种为下游应用建立基础事实的聚类方法。","authors":"Kris Lami, Andrey Bychkov, Keitaro Matsumoto, Richard Attanoos, Sabina Berezowska, Luka Brcic, Alberto Cavazza, John C English, Alexandre Todorovic Fabro, Kaori Ishida, Yukio Kashima, Brandon T Larsen, Alberto M Marchevsky, Takuro Miyazaki, Shimpei Morimoto, Anja C Roden, Frank Schneider, Mano Soshi, Maxwell L Smith, Kazuhiro Tabata, Angela M Takano, Kei Tanaka, Tomonori Tanaka, Tomoshi Tsuchiya, Takeshi Nagayasu, Junya Fukuoka","doi":"10.5858/arpa.2022-0051-OA","DOIUrl":null,"url":null,"abstract":"<p><strong>Context.—: </strong>The accurate identification of different lung adenocarcinoma histologic subtypes is important for determining prognosis but can be challenging because of overlaps in the diagnostic features, leading to considerable interobserver variability.</p><p><strong>Objective.—: </strong>To provide an overview of the diagnostic agreement for lung adenocarcinoma subtypes among pathologists and to create a ground truth using the clustering approach for downstream computational applications.</p><p><strong>Design.—: </strong>Three sets of lung adenocarcinoma histologic images with different evaluation levels (small patches, areas with relatively uniform histology, and whole slide images) were reviewed by 17 international expert lung pathologists and 1 pathologist in training. Each image was classified into one or several lung adenocarcinoma subtypes.</p><p><strong>Results.—: </strong>Among the 4702 patches of the first set, 1742 (37%) had an overall consensus among all pathologists. The overall Fleiss κ score for the agreement of all subtypes was 0.58. Using cluster analysis, pathologists were hierarchically grouped into 2 clusters, with κ scores of 0.588 and 0.563 in clusters 1 and 2, respectively. Similar results were obtained for the second and third sets, with fair-to-moderate agreements. Patches from the first 2 sets that obtained the consensus of the 18 pathologists were retrieved to form consensus patches and were regarded as the ground truth of lung adenocarcinoma subtypes.</p><p><strong>Conclusions.—: </strong>Our observations highlight discrepancies among experts when assessing lung adenocarcinoma subtypes. However, a subsequent number of consensus patches could be retrieved from each cluster, which can be used as ground truth for the downstream computational pathology applications, with minimal influence from interobserver variability.</p>","PeriodicalId":8305,"journal":{"name":"Archives of pathology & laboratory medicine","volume":"147 8","pages":"885-895"},"PeriodicalIF":3.7000,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Overcoming the Interobserver Variability in Lung Adenocarcinoma Subtyping: A Clustering Approach to Establish a Ground Truth for Downstream Applications.\",\"authors\":\"Kris Lami, Andrey Bychkov, Keitaro Matsumoto, Richard Attanoos, Sabina Berezowska, Luka Brcic, Alberto Cavazza, John C English, Alexandre Todorovic Fabro, Kaori Ishida, Yukio Kashima, Brandon T Larsen, Alberto M Marchevsky, Takuro Miyazaki, Shimpei Morimoto, Anja C Roden, Frank Schneider, Mano Soshi, Maxwell L Smith, Kazuhiro Tabata, Angela M Takano, Kei Tanaka, Tomonori Tanaka, Tomoshi Tsuchiya, Takeshi Nagayasu, Junya Fukuoka\",\"doi\":\"10.5858/arpa.2022-0051-OA\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Context.—: </strong>The accurate identification of different lung adenocarcinoma histologic subtypes is important for determining prognosis but can be challenging because of overlaps in the diagnostic features, leading to considerable interobserver variability.</p><p><strong>Objective.—: </strong>To provide an overview of the diagnostic agreement for lung adenocarcinoma subtypes among pathologists and to create a ground truth using the clustering approach for downstream computational applications.</p><p><strong>Design.—: </strong>Three sets of lung adenocarcinoma histologic images with different evaluation levels (small patches, areas with relatively uniform histology, and whole slide images) were reviewed by 17 international expert lung pathologists and 1 pathologist in training. Each image was classified into one or several lung adenocarcinoma subtypes.</p><p><strong>Results.—: </strong>Among the 4702 patches of the first set, 1742 (37%) had an overall consensus among all pathologists. The overall Fleiss κ score for the agreement of all subtypes was 0.58. Using cluster analysis, pathologists were hierarchically grouped into 2 clusters, with κ scores of 0.588 and 0.563 in clusters 1 and 2, respectively. Similar results were obtained for the second and third sets, with fair-to-moderate agreements. Patches from the first 2 sets that obtained the consensus of the 18 pathologists were retrieved to form consensus patches and were regarded as the ground truth of lung adenocarcinoma subtypes.</p><p><strong>Conclusions.—: </strong>Our observations highlight discrepancies among experts when assessing lung adenocarcinoma subtypes. However, a subsequent number of consensus patches could be retrieved from each cluster, which can be used as ground truth for the downstream computational pathology applications, with minimal influence from interobserver variability.</p>\",\"PeriodicalId\":8305,\"journal\":{\"name\":\"Archives of pathology & laboratory medicine\",\"volume\":\"147 8\",\"pages\":\"885-895\"},\"PeriodicalIF\":3.7000,\"publicationDate\":\"2023-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Archives of pathology & laboratory medicine\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.5858/arpa.2022-0051-OA\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"MEDICAL LABORATORY TECHNOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Archives of pathology & laboratory medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.5858/arpa.2022-0051-OA","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MEDICAL LABORATORY TECHNOLOGY","Score":null,"Total":0}
Overcoming the Interobserver Variability in Lung Adenocarcinoma Subtyping: A Clustering Approach to Establish a Ground Truth for Downstream Applications.
Context.—: The accurate identification of different lung adenocarcinoma histologic subtypes is important for determining prognosis but can be challenging because of overlaps in the diagnostic features, leading to considerable interobserver variability.
Objective.—: To provide an overview of the diagnostic agreement for lung adenocarcinoma subtypes among pathologists and to create a ground truth using the clustering approach for downstream computational applications.
Design.—: Three sets of lung adenocarcinoma histologic images with different evaluation levels (small patches, areas with relatively uniform histology, and whole slide images) were reviewed by 17 international expert lung pathologists and 1 pathologist in training. Each image was classified into one or several lung adenocarcinoma subtypes.
Results.—: Among the 4702 patches of the first set, 1742 (37%) had an overall consensus among all pathologists. The overall Fleiss κ score for the agreement of all subtypes was 0.58. Using cluster analysis, pathologists were hierarchically grouped into 2 clusters, with κ scores of 0.588 and 0.563 in clusters 1 and 2, respectively. Similar results were obtained for the second and third sets, with fair-to-moderate agreements. Patches from the first 2 sets that obtained the consensus of the 18 pathologists were retrieved to form consensus patches and were regarded as the ground truth of lung adenocarcinoma subtypes.
Conclusions.—: Our observations highlight discrepancies among experts when assessing lung adenocarcinoma subtypes. However, a subsequent number of consensus patches could be retrieved from each cluster, which can be used as ground truth for the downstream computational pathology applications, with minimal influence from interobserver variability.
期刊介绍:
Welcome to the website of the Archives of Pathology & Laboratory Medicine (APLM). This monthly, peer-reviewed journal of the College of American Pathologists offers global reach and highest measured readership among pathology journals.
Published since 1926, ARCHIVES was voted in 2009 the only pathology journal among the top 100 most influential journals of the past 100 years by the BioMedical and Life Sciences Division of the Special Libraries Association. Online access to the full-text and PDF files of APLM articles is free.