Three versions of an atopic dermatitis case report written by humans, artificial intelligence, or both: Identification of authorship and preferences

The journal of allergy and clinical immunology. Global Pub Date : 2025-02-01 DOI:10.1016/j.jacig.2024.100373

Mara Giavina Bianchi MD, PhD , Andrew D’adario MSc , Pedro Giavina Bianchi MD, PhD , Birajara Soares Machado PhD

{"title":"Three versions of an atopic dermatitis case report written by humans, artificial intelligence, or both: Identification of authorship and preferences","authors":"Mara Giavina Bianchi MD, PhD , Andrew D’adario MSc , Pedro Giavina Bianchi MD, PhD , Birajara Soares Machado PhD","doi":"10.1016/j.jacig.2024.100373","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><div>The use of artificial intelligence (AI) in scientific writing is rapidly increasing, raising concerns about authorship identification, content quality, and writing efficiency.</div></div><div><h3>Objectives</h3><div>This study investigates the real-world impact of ChatGPT, a large language model, on those aspects in a simulated publication scenario.</div></div><div><h3>Methods</h3><div>Forty-eight individuals representing 3 medical expertise levels (medical students, residents, and experts in allergy or dermatology) evaluated 3 blinded versions of an atopic dermatitis case report: one each human written (HUM), AI generated (AI), and combined written (COM). The survey assessed authorship, ranked their preference, and graded 13 quality criteria for each text. Time taken to generate each manuscript was also recorded.</div></div><div><h3>Results</h3><div>Authorship identification accuracy mirrored the odds at 33%. Expert participants (50.9%) demonstrated significantly higher accuracy compared to residents (27.7%) and students (19.6%, <em>P</em> < .001). Participants favored AI-assisted versions (AI and COM) over HUM (<em>P</em> < .001), with COM receiving the highest quality scores. COM and AI achieved 83.8% and 84.3% reduction in writing time, respectively, compared to HUM, while showing 13.9% (<em>P</em> < .001) and 11.1% improvement in quality (<em>P</em> < .001), respectively. However, experts assigned the lowest score for the references of the AI manuscript, potentially hindering its publication.</div></div><div><h3>Conclusion</h3><div>AI can deceptively mimic human writing, particularly for less experienced readers. Although AI-assisted writing is appealing and offers significant time savings, human oversight remains crucial to ensure accuracy, ethical considerations, and optimal quality. These findings underscore the need for transparency in AI use and highlight the potential of human-AI collaboration in the future of scientific writing.</div></div>","PeriodicalId":75041,"journal":{"name":"The journal of allergy and clinical immunology. Global","volume":"4 1","pages":"Article 100373"},"PeriodicalIF":0.0000,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11699452/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"The journal of allergy and clinical immunology. Global","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2772829324001693","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Background

The use of artificial intelligence (AI) in scientific writing is rapidly increasing, raising concerns about authorship identification, content quality, and writing efficiency.

Objectives

This study investigates the real-world impact of ChatGPT, a large language model, on those aspects in a simulated publication scenario.

Methods

Forty-eight individuals representing 3 medical expertise levels (medical students, residents, and experts in allergy or dermatology) evaluated 3 blinded versions of an atopic dermatitis case report: one each human written (HUM), AI generated (AI), and combined written (COM). The survey assessed authorship, ranked their preference, and graded 13 quality criteria for each text. Time taken to generate each manuscript was also recorded.

Results

Authorship identification accuracy mirrored the odds at 33%. Expert participants (50.9%) demonstrated significantly higher accuracy compared to residents (27.7%) and students (19.6%, P < .001). Participants favored AI-assisted versions (AI and COM) over HUM (P < .001), with COM receiving the highest quality scores. COM and AI achieved 83.8% and 84.3% reduction in writing time, respectively, compared to HUM, while showing 13.9% (P < .001) and 11.1% improvement in quality (P < .001), respectively. However, experts assigned the lowest score for the references of the AI manuscript, potentially hindering its publication.

Conclusion

AI can deceptively mimic human writing, particularly for less experienced readers. Although AI-assisted writing is appealing and offers significant time savings, human oversight remains crucial to ensure accuracy, ethical considerations, and optimal quality. These findings underscore the need for transparency in AI use and highlight the potential of human-AI collaboration in the future of scientific writing.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

由人类、人工智能或两者共同撰写的特应性皮炎病例报告的三个版本：作者和偏好的识别。

背景：人工智能（AI）在科学写作中的应用正在迅速增加，这引起了人们对作者身份识别、内容质量和写作效率的担忧。目的：本研究调查了ChatGPT（一个大型语言模型）在模拟出版场景中对这些方面的现实影响。方法：48名代表3种医学专业水平（医学生、住院医师和过敏或皮肤病学专家）的个体评估了一份特应性皮炎病例报告的3种盲法版本：人类书面（HUM）、人工智能生成（AI）和联合书面（COM）。该调查评估了作者身份，对他们的偏好进行了排序，并对每篇文章的质量标准进行了13次评分。生成每份手稿所花费的时间也被记录下来。结果：作者身份识别的准确率为33%。专家参与者（50.9%）比居民（27.7%）和学生（19.6%,P < .001）表现出更高的准确性。参与者更喜欢AI辅助版本（AI和COM）而不是HUM (P < 0.001)， COM获得最高的质量分数。与HUM相比，COM和AI的写入时间分别减少了83.8%和84.3%，而质量分别提高了13.9% （P < 0.001）和11.1% （P < 0.001）。然而，专家们给人工智能手稿的参考文献打分最低，这可能会阻碍其发表。结论：人工智能可以欺骗性地模仿人类写作，特别是对于经验不足的读者。尽管人工智能辅助写作很有吸引力，而且可以节省大量时间，但人类的监督仍然是确保准确性、道德考虑和最佳质量的关键。这些发现强调了人工智能使用透明度的必要性，并强调了人类与人工智能合作在未来科学写作中的潜力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊