生成式人工智能文本到图像描述医学生中的性别偏见

IF 1.2 4区医学 Q3 EDUCATION & EDUCATIONAL RESEARCH Health Education Journal Pub Date : 2024-08-28 DOI:10.1177/00178969241274621

Geoffrey Currie, Josie Currie, Sam Anderson, Johnathan Hewis

{"title":"生成式人工智能文本到图像描述医学生中的性别偏见","authors":"Geoffrey Currie, Josie Currie, Sam Anderson, Johnathan Hewis","doi":"10.1177/00178969241274621","DOIUrl":null,"url":null,"abstract":"Introduction:In Australia, 54.3% of medical students are women yet they remain under-represented in stereotypical perspectives of medicine. While potentially transformative, generative artificial intelligence (genAI) has the potential for errors, misrepresentations and bias. GenAI text-to-image production could reinforce gender biases making it important to evaluate DALL-E 3 (the text-to-image genAI supported through ChatGPT) representations of Australian medical students.Method:In March 2024, DALL-E 3 was utilised via GPT-4 to generate a series of individual and group images of medical students, specifically Australian undergraduate medical students to eliminate potential confounders. Multiple iterations of images were generated using a variety of prompts. Collectively, 47 images were produced for evaluation of which 33 were individual characters and the remaining 14 images were comprised of multiple (5 to 67) characters. All images were independently analysed by three reviewers for apparent gender and skin tone. Consequently, 33 feature individuals were evaluated and a further 417 characters in groups were evaluated ( N = 448). Discrepancies in responses were resolved by consensus.Results:Collectively (individual and group images), 58.8% ( N = 258) of medical students were depicted as men, 39.9% ( N = 175) as women, 92.0% ( N = 404) with a light skin tone, 7.7% ( N = 34) with mid skin tone and 0% with dark skin tone. The gender distribution was a statistically significant variation from that of actual Australian medical students for individual images, for group images and for collective images. Among the images of individual medical students ( N = 25), DALL-E 3 generated 92% ( N = 23) as men and 100% were of light skin tone ( N = 25).Conclusion:This evaluation reveals the gender associated with genAI text-to-image generation using DALL-E 3 among Australian undergraduate medical students. Generated images included a disproportionately high proportion of white male medical students which is not representative of the diversity of medical students in Australia. The use of DALL-E 3 to produce depictions of medical students for education or promotion purposes should be done with caution.","PeriodicalId":47346,"journal":{"name":"Health Education Journal","volume":"1 1","pages":""},"PeriodicalIF":1.2000,"publicationDate":"2024-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Gender bias in generative artificial intelligence text-to-image depiction of medical students\",\"authors\":\"Geoffrey Currie, Josie Currie, Sam Anderson, Johnathan Hewis\",\"doi\":\"10.1177/00178969241274621\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Introduction:In Australia, 54.3% of medical students are women yet they remain under-represented in stereotypical perspectives of medicine. While potentially transformative, generative artificial intelligence (genAI) has the potential for errors, misrepresentations and bias. GenAI text-to-image production could reinforce gender biases making it important to evaluate DALL-E 3 (the text-to-image genAI supported through ChatGPT) representations of Australian medical students.Method:In March 2024, DALL-E 3 was utilised via GPT-4 to generate a series of individual and group images of medical students, specifically Australian undergraduate medical students to eliminate potential confounders. Multiple iterations of images were generated using a variety of prompts. Collectively, 47 images were produced for evaluation of which 33 were individual characters and the remaining 14 images were comprised of multiple (5 to 67) characters. All images were independently analysed by three reviewers for apparent gender and skin tone. Consequently, 33 feature individuals were evaluated and a further 417 characters in groups were evaluated ( N = 448). Discrepancies in responses were resolved by consensus.Results:Collectively (individual and group images), 58.8% ( N = 258) of medical students were depicted as men, 39.9% ( N = 175) as women, 92.0% ( N = 404) with a light skin tone, 7.7% ( N = 34) with mid skin tone and 0% with dark skin tone. The gender distribution was a statistically significant variation from that of actual Australian medical students for individual images, for group images and for collective images. Among the images of individual medical students ( N = 25), DALL-E 3 generated 92% ( N = 23) as men and 100% were of light skin tone ( N = 25).Conclusion:This evaluation reveals the gender associated with genAI text-to-image generation using DALL-E 3 among Australian undergraduate medical students. Generated images included a disproportionately high proportion of white male medical students which is not representative of the diversity of medical students in Australia. The use of DALL-E 3 to produce depictions of medical students for education or promotion purposes should be done with caution.\",\"PeriodicalId\":47346,\"journal\":{\"name\":\"Health Education Journal\",\"volume\":\"1 1\",\"pages\":\"\"},\"PeriodicalIF\":1.2000,\"publicationDate\":\"2024-08-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Health Education Journal\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1177/00178969241274621\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"EDUCATION & EDUCATIONAL RESEARCH\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Health Education Journal","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1177/00178969241274621","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"EDUCATION & EDUCATIONAL RESEARCH","Score":null,"Total":0}

引用次数: 0

摘要

导言：在澳大利亚，54.3% 的医科学生是女性，但她们在医学刻板印象中的代表性仍然不足。生成式人工智能（genAI）虽然具有潜在的变革性，但也有可能出现错误、歪曲和偏见。GenAI 文本到图像的生成可能会强化性别偏见，因此评估 DALL-E 3（通过 ChatGPT 支持的文本到图像 GenAI）对澳大利亚医科学生的表现非常重要。方法：2024 年 3 月，DALL-E 3 通过 GPT-4 生成了一系列医科学生的个人和群体图像，特别是澳大利亚本科医科学生，以消除潜在的混杂因素。使用各种提示多次重复生成图像。总共生成了 47 幅用于评估的图像，其中 33 幅为单个人物，其余 14 幅由多个（5 至 67 个）人物组成。所有图像均由三名审查员独立分析，以确定明显的性别和肤色。因此，有 33 个特征个体接受了评估，另有 417 个人物群体接受了评估（N = 448）。结果：总体而言（个体和群体图像），58.8%（N = 258）的医学生为男性，39.9%（N = 175）为女性，92.0%（N = 404）为浅肤色，7.7%（N = 34）为中肤色，0%为深肤色。在统计上，单个图像、群体图像和集体图像的性别分布与实际澳大利亚医学生的性别分布有显著差异。在医学生个人图像（25 张）中，DALL-E 3 生成的图像中有 92% （23 张）为男性，100% 为浅肤色（25 张）。生成的图像中白人男性医学生的比例过高，这并不代表澳大利亚医学生的多样性。在使用 DALL-E 3 生成用于教育或宣传目的的医学生图像时应谨慎。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Gender bias in generative artificial intelligence text-to-image depiction of medical students

Introduction:In Australia, 54.3% of medical students are women yet they remain under-represented in stereotypical perspectives of medicine. While potentially transformative, generative artificial intelligence (genAI) has the potential for errors, misrepresentations and bias. GenAI text-to-image production could reinforce gender biases making it important to evaluate DALL-E 3 (the text-to-image genAI supported through ChatGPT) representations of Australian medical students.Method:In March 2024, DALL-E 3 was utilised via GPT-4 to generate a series of individual and group images of medical students, specifically Australian undergraduate medical students to eliminate potential confounders. Multiple iterations of images were generated using a variety of prompts. Collectively, 47 images were produced for evaluation of which 33 were individual characters and the remaining 14 images were comprised of multiple (5 to 67) characters. All images were independently analysed by three reviewers for apparent gender and skin tone. Consequently, 33 feature individuals were evaluated and a further 417 characters in groups were evaluated ( N = 448). Discrepancies in responses were resolved by consensus.Results:Collectively (individual and group images), 58.8% ( N = 258) of medical students were depicted as men, 39.9% ( N = 175) as women, 92.0% ( N = 404) with a light skin tone, 7.7% ( N = 34) with mid skin tone and 0% with dark skin tone. The gender distribution was a statistically significant variation from that of actual Australian medical students for individual images, for group images and for collective images. Among the images of individual medical students ( N = 25), DALL-E 3 generated 92% ( N = 23) as men and 100% were of light skin tone ( N = 25).Conclusion:This evaluation reveals the gender associated with genAI text-to-image generation using DALL-E 3 among Australian undergraduate medical students. Generated images included a disproportionately high proportion of white male medical students which is not representative of the diversity of medical students in Australia. The use of DALL-E 3 to produce depictions of medical students for education or promotion purposes should be done with caution.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Health Education Journal Multiple-

CiteScore

2.30

自引率

0.00%

发文量

期刊介绍： Health Education Journal is a leading peer reviewed journal established in 1943. It carries original papers on health promotion and education research, policy development and good practice.