James Behrmann, Ellen M. Hong, Shannon Meledathu, Aliza Leiter, Michael Povelaitis, Mariela Mitre
{"title":"Chat generative pre-trained transformer’s performance on dermatology-specific questions and its implications in medical education","authors":"James Behrmann, Ellen M. Hong, Shannon Meledathu, Aliza Leiter, Michael Povelaitis, Mariela Mitre","doi":"10.21037/jmai-23-47","DOIUrl":null,"url":null,"abstract":"Background: Large language models (LLMs) like chat generative pre-trained transformer (ChatGPT) have gained popularity in healthcare by performing at or near the passing threshold for the United States Medical Licensing Exam (USMLE), but some limitations should be considered. Dermatology is a specialized medical field that relies heavily on visual recognition and images for diagnosis. This paper aimed to measure ChatGPT’s abilities to answer dermatology questions and compare this sub-specialty accuracy to its overall scores on USMLE Step exams.","PeriodicalId":73815,"journal":{"name":"Journal of medical artificial intelligence","volume":"16 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of medical artificial intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.21037/jmai-23-47","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Large language models (LLMs) like chat generative pre-trained transformer (ChatGPT) have gained popularity in healthcare by performing at or near the passing threshold for the United States Medical Licensing Exam (USMLE), but some limitations should be considered. Dermatology is a specialized medical field that relies heavily on visual recognition and images for diagnosis. This paper aimed to measure ChatGPT’s abilities to answer dermatology questions and compare this sub-specialty accuracy to its overall scores on USMLE Step exams.