{"title":"日本医生在乳房 X 射线照相术中对乳房构成进行定性视觉评估的观察者之间和观察者内部的差异:日本首次多机构观察者绩效研究","authors":"Yoichi Koyama, Kazuaki Nakashima, Shunichiro Orihara, Hiroko Tsunoda, Fuyo Kimura, Natsuki Uenaka, Kanako Ban, Yukiko Michishita, Yoshihide Kanemaki, Arisa Kurihara, Kanae Tawaraya, Masataka Taguri, Takashi Ishikawa, Takayoshi Uematsu","doi":"10.1007/s12282-024-01580-8","DOIUrl":null,"url":null,"abstract":"<h3 data-test=\"abstract-sub-heading\">Background</h3><p>Visual assessment of mammographic breast composition remains the most common worldwide, although subjective variability limits its reproducibility. This study aimed to investigate the inter- and intra-observer variability in qualitative visual assessment of mammographic breast composition through a multi-institutional observer performance study for the first time in Japan.</p><h3 data-test=\"abstract-sub-heading\">Methods</h3><p>This study enrolled 10 Japanese physicians from five different institutions. They used the new Japanese breast-composition classification system 4th edition to subjectively evaluate the breast composition in 200 pairs of right and left normal mediolateral oblique mammograms (number determined using precise sample size calculations) twice, with a 1-month interval (median patient age: 59 years [range 40–69 years]). The primary endpoint of this study was the inter-observer variability using kappa (<i>κ</i>) value.</p><h3 data-test=\"abstract-sub-heading\">Results</h3><p>Inter-observer variability for the four and two classes of breast-composition assessment revealed moderate agreement (Fleiss’ <i>κ</i>: first and second reading = 0.553 and 0.587, respectively) and substantial agreement (Fleiss’ κ: first and second reading = 0.689 and 0.70, respectively). Intra-observer variability for the four and two classes of breast-composition assessment demonstrated substantial agreement (Cohen’s <i>κ</i>, median = 0.758) and almost perfect agreement (Cohen’s <i>κ</i>, median = 0.813). Assessments of consensus between the 10 physicians and the automated software Volpara® revealed slight agreement (Cohen’s <i>κ</i>; first and second reading: 0.104 and 0.075, respectively).</p><h3 data-test=\"abstract-sub-heading\">Conclusions</h3><p>Qualitative visual assessment of mammographic breast composition using the new Japanese classification revealed excellent intra-observer reproducibility. However, persistent inter-observer variability, presenting a challenge in establishing it as the gold standard in Japan.</p>","PeriodicalId":56083,"journal":{"name":"Breast Cancer","volume":null,"pages":null},"PeriodicalIF":4.0000,"publicationDate":"2024-04-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Inter- and intra-observer variability of qualitative visual breast-composition assessment in mammography among Japanese physicians: a first multi-institutional observer performance study in Japan\",\"authors\":\"Yoichi Koyama, Kazuaki Nakashima, Shunichiro Orihara, Hiroko Tsunoda, Fuyo Kimura, Natsuki Uenaka, Kanako Ban, Yukiko Michishita, Yoshihide Kanemaki, Arisa Kurihara, Kanae Tawaraya, Masataka Taguri, Takashi Ishikawa, Takayoshi Uematsu\",\"doi\":\"10.1007/s12282-024-01580-8\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<h3 data-test=\\\"abstract-sub-heading\\\">Background</h3><p>Visual assessment of mammographic breast composition remains the most common worldwide, although subjective variability limits its reproducibility. This study aimed to investigate the inter- and intra-observer variability in qualitative visual assessment of mammographic breast composition through a multi-institutional observer performance study for the first time in Japan.</p><h3 data-test=\\\"abstract-sub-heading\\\">Methods</h3><p>This study enrolled 10 Japanese physicians from five different institutions. They used the new Japanese breast-composition classification system 4th edition to subjectively evaluate the breast composition in 200 pairs of right and left normal mediolateral oblique mammograms (number determined using precise sample size calculations) twice, with a 1-month interval (median patient age: 59 years [range 40–69 years]). The primary endpoint of this study was the inter-observer variability using kappa (<i>κ</i>) value.</p><h3 data-test=\\\"abstract-sub-heading\\\">Results</h3><p>Inter-observer variability for the four and two classes of breast-composition assessment revealed moderate agreement (Fleiss’ <i>κ</i>: first and second reading = 0.553 and 0.587, respectively) and substantial agreement (Fleiss’ κ: first and second reading = 0.689 and 0.70, respectively). Intra-observer variability for the four and two classes of breast-composition assessment demonstrated substantial agreement (Cohen’s <i>κ</i>, median = 0.758) and almost perfect agreement (Cohen’s <i>κ</i>, median = 0.813). Assessments of consensus between the 10 physicians and the automated software Volpara® revealed slight agreement (Cohen’s <i>κ</i>; first and second reading: 0.104 and 0.075, respectively).</p><h3 data-test=\\\"abstract-sub-heading\\\">Conclusions</h3><p>Qualitative visual assessment of mammographic breast composition using the new Japanese classification revealed excellent intra-observer reproducibility. However, persistent inter-observer variability, presenting a challenge in establishing it as the gold standard in Japan.</p>\",\"PeriodicalId\":56083,\"journal\":{\"name\":\"Breast Cancer\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":4.0000,\"publicationDate\":\"2024-04-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Breast Cancer\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1007/s12282-024-01580-8\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"OBSTETRICS & GYNECOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Breast Cancer","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s12282-024-01580-8","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"OBSTETRICS & GYNECOLOGY","Score":null,"Total":0}
Inter- and intra-observer variability of qualitative visual breast-composition assessment in mammography among Japanese physicians: a first multi-institutional observer performance study in Japan
Background
Visual assessment of mammographic breast composition remains the most common worldwide, although subjective variability limits its reproducibility. This study aimed to investigate the inter- and intra-observer variability in qualitative visual assessment of mammographic breast composition through a multi-institutional observer performance study for the first time in Japan.
Methods
This study enrolled 10 Japanese physicians from five different institutions. They used the new Japanese breast-composition classification system 4th edition to subjectively evaluate the breast composition in 200 pairs of right and left normal mediolateral oblique mammograms (number determined using precise sample size calculations) twice, with a 1-month interval (median patient age: 59 years [range 40–69 years]). The primary endpoint of this study was the inter-observer variability using kappa (κ) value.
Results
Inter-observer variability for the four and two classes of breast-composition assessment revealed moderate agreement (Fleiss’ κ: first and second reading = 0.553 and 0.587, respectively) and substantial agreement (Fleiss’ κ: first and second reading = 0.689 and 0.70, respectively). Intra-observer variability for the four and two classes of breast-composition assessment demonstrated substantial agreement (Cohen’s κ, median = 0.758) and almost perfect agreement (Cohen’s κ, median = 0.813). Assessments of consensus between the 10 physicians and the automated software Volpara® revealed slight agreement (Cohen’s κ; first and second reading: 0.104 and 0.075, respectively).
Conclusions
Qualitative visual assessment of mammographic breast composition using the new Japanese classification revealed excellent intra-observer reproducibility. However, persistent inter-observer variability, presenting a challenge in establishing it as the gold standard in Japan.
期刊介绍:
Breast Cancer, the official journal of the Japanese Breast Cancer Society, publishes articles that contribute to progress in the field, in basic or translational research and also in clinical research, seeking to develop a new focus and new perspectives for all who are concerned with breast cancer. The journal welcomes all original articles describing clinical and epidemiological studies and laboratory investigations regarding breast cancer and related diseases. The journal will consider five types of articles: editorials, review articles, original articles, case reports, and rapid communications. Although editorials and review articles will principally be solicited by the editors, they can also be submitted for peer review, as in the case of original articles. The journal provides the best of up-to-date information on breast cancer, presenting readers with high-impact, original work focusing on pivotal issues.