Jonathan Shapiro, Emily Avitan-Hersh, Binyamin Greenfield, Ziad Khamaysi, Roni P Dodiuk-Gad, Yuliya Valdman-Grinshpoun, Tamar Freud, Anna Lyakhovitsky
{"title":"The use of a ChatGPT-4-based chatbot in teledermatology: A retrospective exploratory study.","authors":"Jonathan Shapiro, Emily Avitan-Hersh, Binyamin Greenfield, Ziad Khamaysi, Roni P Dodiuk-Gad, Yuliya Valdman-Grinshpoun, Tamar Freud, Anna Lyakhovitsky","doi":"10.1111/ddg.15609","DOIUrl":null,"url":null,"abstract":"<p><strong>Background and objectives: </strong>Integration of artificial intelligence in healthcare, particularly ChatGPT, is transforming medical diagnostics and may benefit teledermatology. This exploratory study compared image description and differential diagnosis generation by a ChatGPT-4 based chatbot with human teledermatologists.</p><p><strong>Patients and methods: </strong>This retrospective study compared 154 teledermatology consultations (December 2023-February 2024) with ChatGPT-4's performance in image descriptions and diagnoses. Diagnostic concordance was classified as \"Top1\" (exact match with the teledermatologist's diagnoses), \"Top3\" (correct diagnosis within one the top three diagnoses), and \"Partial\" (similar but not identical diagnoses). Image descriptions were rated and compared for quality parameters (location, color, size, morphology, and surrounding area), and accuracy (Yes, No, and Partial).</p><p><strong>Results: </strong>Out of 154 cases, ChatGPT-4 achieved a Top1 diagnostic concordance in 108 (70.8%), Top3 concordance in 137 (87.7%), partial concordance in four (2.6%), and was discordant in 15 (9.7%) cases. The quality of ChatGPT-4's image descriptions significantly surpassed teledermatologists in all five parameters. ChatGPT-4's descriptions were accurate in 130 (84.4%), partially accurate in 22 (14.3%), and inaccurate in two (1.3%) cases.</p><p><strong>Conclusions: </strong>The preliminary findings of this study indicate that ChatGPT-4 demonstrates potential in generating accurate image descriptions and differential diagnoses. These results highlight the promise of integrating artificial intelligence into asynchronous teledermatology workflows.</p>","PeriodicalId":14758,"journal":{"name":"Journal Der Deutschen Dermatologischen Gesellschaft","volume":" ","pages":""},"PeriodicalIF":5.5000,"publicationDate":"2025-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal Der Deutschen Dermatologischen Gesellschaft","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1111/ddg.15609","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"DERMATOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Background and objectives: Integration of artificial intelligence in healthcare, particularly ChatGPT, is transforming medical diagnostics and may benefit teledermatology. This exploratory study compared image description and differential diagnosis generation by a ChatGPT-4 based chatbot with human teledermatologists.
Patients and methods: This retrospective study compared 154 teledermatology consultations (December 2023-February 2024) with ChatGPT-4's performance in image descriptions and diagnoses. Diagnostic concordance was classified as "Top1" (exact match with the teledermatologist's diagnoses), "Top3" (correct diagnosis within one the top three diagnoses), and "Partial" (similar but not identical diagnoses). Image descriptions were rated and compared for quality parameters (location, color, size, morphology, and surrounding area), and accuracy (Yes, No, and Partial).
Results: Out of 154 cases, ChatGPT-4 achieved a Top1 diagnostic concordance in 108 (70.8%), Top3 concordance in 137 (87.7%), partial concordance in four (2.6%), and was discordant in 15 (9.7%) cases. The quality of ChatGPT-4's image descriptions significantly surpassed teledermatologists in all five parameters. ChatGPT-4's descriptions were accurate in 130 (84.4%), partially accurate in 22 (14.3%), and inaccurate in two (1.3%) cases.
Conclusions: The preliminary findings of this study indicate that ChatGPT-4 demonstrates potential in generating accurate image descriptions and differential diagnoses. These results highlight the promise of integrating artificial intelligence into asynchronous teledermatology workflows.
期刊介绍:
The JDDG publishes scientific papers from a wide range of disciplines, such as dermatovenereology, allergology, phlebology, dermatosurgery, dermatooncology, and dermatohistopathology. Also in JDDG: information on medical training, continuing education, a calendar of events, book reviews and society announcements.
Papers can be submitted in German or English language. In the print version, all articles are published in German. In the online version, all key articles are published in English.