Three versions of an atopic dermatitis case report written by humans, artificial intelligence, or both: Identification of authorship and preferences

Mara Giavina Bianchi MD, PhD , Andrew D’adario MSc , Pedro Giavina Bianchi MD, PhD , Birajara Soares Machado PhD
{"title":"Three versions of an atopic dermatitis case report written by humans, artificial intelligence, or both: Identification of authorship and preferences","authors":"Mara Giavina Bianchi MD, PhD ,&nbsp;Andrew D’adario MSc ,&nbsp;Pedro Giavina Bianchi MD, PhD ,&nbsp;Birajara Soares Machado PhD","doi":"10.1016/j.jacig.2024.100373","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><div>The use of artificial intelligence (AI) in scientific writing is rapidly increasing, raising concerns about authorship identification, content quality, and writing efficiency.</div></div><div><h3>Objectives</h3><div>This study investigates the real-world impact of ChatGPT, a large language model, on those aspects in a simulated publication scenario.</div></div><div><h3>Methods</h3><div>Forty-eight individuals representing 3 medical expertise levels (medical students, residents, and experts in allergy or dermatology) evaluated 3 blinded versions of an atopic dermatitis case report: one each human written (HUM), AI generated (AI), and combined written (COM). The survey assessed authorship, ranked their preference, and graded 13 quality criteria for each text. Time taken to generate each manuscript was also recorded.</div></div><div><h3>Results</h3><div>Authorship identification accuracy mirrored the odds at 33%. Expert participants (50.9%) demonstrated significantly higher accuracy compared to residents (27.7%) and students (19.6%, <em>P</em> &lt; .001). Participants favored AI-assisted versions (AI and COM) over HUM (<em>P</em> &lt; .001), with COM receiving the highest quality scores. COM and AI achieved 83.8% and 84.3% reduction in writing time, respectively, compared to HUM, while showing 13.9% (<em>P</em> &lt; .001) and 11.1% improvement in quality (<em>P</em> &lt; .001), respectively. However, experts assigned the lowest score for the references of the AI manuscript, potentially hindering its publication.</div></div><div><h3>Conclusion</h3><div>AI can deceptively mimic human writing, particularly for less experienced readers. Although AI-assisted writing is appealing and offers significant time savings, human oversight remains crucial to ensure accuracy, ethical considerations, and optimal quality. These findings underscore the need for transparency in AI use and highlight the potential of human-AI collaboration in the future of scientific writing.</div></div>","PeriodicalId":75041,"journal":{"name":"The journal of allergy and clinical immunology. Global","volume":"4 1","pages":"Article 100373"},"PeriodicalIF":0.0000,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11699452/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"The journal of allergy and clinical immunology. Global","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2772829324001693","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Background

The use of artificial intelligence (AI) in scientific writing is rapidly increasing, raising concerns about authorship identification, content quality, and writing efficiency.

Objectives

This study investigates the real-world impact of ChatGPT, a large language model, on those aspects in a simulated publication scenario.

Methods

Forty-eight individuals representing 3 medical expertise levels (medical students, residents, and experts in allergy or dermatology) evaluated 3 blinded versions of an atopic dermatitis case report: one each human written (HUM), AI generated (AI), and combined written (COM). The survey assessed authorship, ranked their preference, and graded 13 quality criteria for each text. Time taken to generate each manuscript was also recorded.

Results

Authorship identification accuracy mirrored the odds at 33%. Expert participants (50.9%) demonstrated significantly higher accuracy compared to residents (27.7%) and students (19.6%, P < .001). Participants favored AI-assisted versions (AI and COM) over HUM (P < .001), with COM receiving the highest quality scores. COM and AI achieved 83.8% and 84.3% reduction in writing time, respectively, compared to HUM, while showing 13.9% (P < .001) and 11.1% improvement in quality (P < .001), respectively. However, experts assigned the lowest score for the references of the AI manuscript, potentially hindering its publication.

Conclusion

AI can deceptively mimic human writing, particularly for less experienced readers. Although AI-assisted writing is appealing and offers significant time savings, human oversight remains crucial to ensure accuracy, ethical considerations, and optimal quality. These findings underscore the need for transparency in AI use and highlight the potential of human-AI collaboration in the future of scientific writing.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
由人类、人工智能或两者共同撰写的特应性皮炎病例报告的三个版本:作者和偏好的识别。
背景:人工智能(AI)在科学写作中的应用正在迅速增加,这引起了人们对作者身份识别、内容质量和写作效率的担忧。目的:本研究调查了ChatGPT(一个大型语言模型)在模拟出版场景中对这些方面的现实影响。方法:48名代表3种医学专业水平(医学生、住院医师和过敏或皮肤病学专家)的个体评估了一份特应性皮炎病例报告的3种盲法版本:人类书面(HUM)、人工智能生成(AI)和联合书面(COM)。该调查评估了作者身份,对他们的偏好进行了排序,并对每篇文章的质量标准进行了13次评分。生成每份手稿所花费的时间也被记录下来。结果:作者身份识别的准确率为33%。专家参与者(50.9%)比居民(27.7%)和学生(19.6%,P < .001)表现出更高的准确性。参与者更喜欢AI辅助版本(AI和COM)而不是HUM (P < 0.001), COM获得最高的质量分数。与HUM相比,COM和AI的写入时间分别减少了83.8%和84.3%,而质量分别提高了13.9% (P < 0.001)和11.1% (P < 0.001)。然而,专家们给人工智能手稿的参考文献打分最低,这可能会阻碍其发表。结论:人工智能可以欺骗性地模仿人类写作,特别是对于经验不足的读者。尽管人工智能辅助写作很有吸引力,而且可以节省大量时间,但人类的监督仍然是确保准确性、道德考虑和最佳质量的关键。这些发现强调了人工智能使用透明度的必要性,并强调了人类与人工智能合作在未来科学写作中的潜力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
The journal of allergy and clinical immunology. Global
The journal of allergy and clinical immunology. Global Immunology, Allergology and Rheumatology
CiteScore
0.70
自引率
0.00%
发文量
0
审稿时长
92 days
期刊最新文献
Reply Editorial board A different lens to explore health literacy skills in atopic conditions Anti–IL-4Ra therapy is superior to other biologic classes in treating allergic bronchopulmonary aspergillosis Investigating health care pathways for anaphylaxis: Seeing the bigger picture through linked data
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1