Geoffrey Currie, Josie Currie, Sam Anderson, Johnathan Hewis
{"title":"Gender bias in generative artificial intelligence text-to-image depiction of medical students","authors":"Geoffrey Currie, Josie Currie, Sam Anderson, Johnathan Hewis","doi":"10.1177/00178969241274621","DOIUrl":null,"url":null,"abstract":"Introduction:In Australia, 54.3% of medical students are women yet they remain under-represented in stereotypical perspectives of medicine. While potentially transformative, generative artificial intelligence (genAI) has the potential for errors, misrepresentations and bias. GenAI text-to-image production could reinforce gender biases making it important to evaluate DALL-E 3 (the text-to-image genAI supported through ChatGPT) representations of Australian medical students.Method:In March 2024, DALL-E 3 was utilised via GPT-4 to generate a series of individual and group images of medical students, specifically Australian undergraduate medical students to eliminate potential confounders. Multiple iterations of images were generated using a variety of prompts. Collectively, 47 images were produced for evaluation of which 33 were individual characters and the remaining 14 images were comprised of multiple (5 to 67) characters. All images were independently analysed by three reviewers for apparent gender and skin tone. Consequently, 33 feature individuals were evaluated and a further 417 characters in groups were evaluated ( N = 448). Discrepancies in responses were resolved by consensus.Results:Collectively (individual and group images), 58.8% ( N = 258) of medical students were depicted as men, 39.9% ( N = 175) as women, 92.0% ( N = 404) with a light skin tone, 7.7% ( N = 34) with mid skin tone and 0% with dark skin tone. The gender distribution was a statistically significant variation from that of actual Australian medical students for individual images, for group images and for collective images. Among the images of individual medical students ( N = 25), DALL-E 3 generated 92% ( N = 23) as men and 100% were of light skin tone ( N = 25).Conclusion:This evaluation reveals the gender associated with genAI text-to-image generation using DALL-E 3 among Australian undergraduate medical students. Generated images included a disproportionately high proportion of white male medical students which is not representative of the diversity of medical students in Australia. The use of DALL-E 3 to produce depictions of medical students for education or promotion purposes should be done with caution.","PeriodicalId":47346,"journal":{"name":"Health Education Journal","volume":"1 1","pages":""},"PeriodicalIF":1.1000,"publicationDate":"2024-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Health Education Journal","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1177/00178969241274621","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"EDUCATION & EDUCATIONAL RESEARCH","Score":null,"Total":0}
引用次数: 0
Abstract
Introduction:In Australia, 54.3% of medical students are women yet they remain under-represented in stereotypical perspectives of medicine. While potentially transformative, generative artificial intelligence (genAI) has the potential for errors, misrepresentations and bias. GenAI text-to-image production could reinforce gender biases making it important to evaluate DALL-E 3 (the text-to-image genAI supported through ChatGPT) representations of Australian medical students.Method:In March 2024, DALL-E 3 was utilised via GPT-4 to generate a series of individual and group images of medical students, specifically Australian undergraduate medical students to eliminate potential confounders. Multiple iterations of images were generated using a variety of prompts. Collectively, 47 images were produced for evaluation of which 33 were individual characters and the remaining 14 images were comprised of multiple (5 to 67) characters. All images were independently analysed by three reviewers for apparent gender and skin tone. Consequently, 33 feature individuals were evaluated and a further 417 characters in groups were evaluated ( N = 448). Discrepancies in responses were resolved by consensus.Results:Collectively (individual and group images), 58.8% ( N = 258) of medical students were depicted as men, 39.9% ( N = 175) as women, 92.0% ( N = 404) with a light skin tone, 7.7% ( N = 34) with mid skin tone and 0% with dark skin tone. The gender distribution was a statistically significant variation from that of actual Australian medical students for individual images, for group images and for collective images. Among the images of individual medical students ( N = 25), DALL-E 3 generated 92% ( N = 23) as men and 100% were of light skin tone ( N = 25).Conclusion:This evaluation reveals the gender associated with genAI text-to-image generation using DALL-E 3 among Australian undergraduate medical students. Generated images included a disproportionately high proportion of white male medical students which is not representative of the diversity of medical students in Australia. The use of DALL-E 3 to produce depictions of medical students for education or promotion purposes should be done with caution.
期刊介绍:
Health Education Journal is a leading peer reviewed journal established in 1943. It carries original papers on health promotion and education research, policy development and good practice.