OpenAI's Sora and Google's Veo 2 in Action: A Narrative Review of Artificial Intelligence-driven Video Generation Models Transforming Healthcare.

IF 1 Q3 MEDICINE, GENERAL & INTERNAL Cureus Pub Date : 2025-01-17 eCollection Date: 2025-01-01 DOI:10.7759/cureus.77593
Mohamad-Hani Temsah, Rakan Nazer, Ibraheem Altamimi, Raniah Aldekhyyel, Amr Jamal, Mohammad Almansour, Fadi Aljamaan, Khalid Alhasan, Abdulkarim A Temsah, Ayman Al-Eyadhy, Bandar N Aljafen, Khalid H Malki
{"title":"OpenAI's Sora and Google's Veo 2 in Action: A Narrative Review of Artificial Intelligence-driven Video Generation Models Transforming Healthcare.","authors":"Mohamad-Hani Temsah, Rakan Nazer, Ibraheem Altamimi, Raniah Aldekhyyel, Amr Jamal, Mohammad Almansour, Fadi Aljamaan, Khalid Alhasan, Abdulkarim A Temsah, Ayman Al-Eyadhy, Bandar N Aljafen, Khalid H Malki","doi":"10.7759/cureus.77593","DOIUrl":null,"url":null,"abstract":"<p><p>The rapid evolution of generative artificial intelligence (AI) has introduced transformative technologies across various domains, with text-to-video (T2V) generation models emerging as transformative innovations in the field. This narrative review explores the potential of T2V AI generation models used in healthcare, focusing on their applications, challenges, and future directions. Advanced T2V platforms, such as Sora Turbo (OpenAI, Inc., San Francisco, California, United States) and Veo 2 (Google LLC, Mountain View, California, United States), both announced in December 2024, offer the capability to generate high-fidelity video contents. Such models could revolutionize healthcare by providing tailored videos for patient education, enhancing medical training, and possibly optimizing telemedicine. We conducted a comprehensive narrative literature search of databases including PubMed and Google Scholar, and identified 41 relevant studies published between 2020 and 2024. Our findings reveal significant possible benefits in improving patient education, standardizing customized medical training, and enhancing remote medical consultations. However, critical challenges persist, including risks of misinformation (or deepfake), privacy breaches, ethical concerns, and limitations in video authenticity. Detection mechanisms for deepfakes and regulatory frameworks remain underdeveloped, necessitating further interdisciplinary research and vigilant policy development. Future advancements in T2V AI generation models could enable real-time healthcare visualizations and augmented reality training. However, achieving these benefits will require addressing accessibility challenges to ensure equitable implementation and prevent potential disparities. By addressing these challenges and fostering collaboration among stakeholders, healthcare systems and AI technologists, T2V AI generation models could transform global healthcare into a more effective, universal, and innovative system while safeguarding against its potential misuse.</p>","PeriodicalId":93960,"journal":{"name":"Cureus","volume":"17 1","pages":"e77593"},"PeriodicalIF":1.0000,"publicationDate":"2025-01-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11741145/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cureus","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.7759/cureus.77593","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q3","JCRName":"MEDICINE, GENERAL & INTERNAL","Score":null,"Total":0}
引用次数: 0

Abstract

The rapid evolution of generative artificial intelligence (AI) has introduced transformative technologies across various domains, with text-to-video (T2V) generation models emerging as transformative innovations in the field. This narrative review explores the potential of T2V AI generation models used in healthcare, focusing on their applications, challenges, and future directions. Advanced T2V platforms, such as Sora Turbo (OpenAI, Inc., San Francisco, California, United States) and Veo 2 (Google LLC, Mountain View, California, United States), both announced in December 2024, offer the capability to generate high-fidelity video contents. Such models could revolutionize healthcare by providing tailored videos for patient education, enhancing medical training, and possibly optimizing telemedicine. We conducted a comprehensive narrative literature search of databases including PubMed and Google Scholar, and identified 41 relevant studies published between 2020 and 2024. Our findings reveal significant possible benefits in improving patient education, standardizing customized medical training, and enhancing remote medical consultations. However, critical challenges persist, including risks of misinformation (or deepfake), privacy breaches, ethical concerns, and limitations in video authenticity. Detection mechanisms for deepfakes and regulatory frameworks remain underdeveloped, necessitating further interdisciplinary research and vigilant policy development. Future advancements in T2V AI generation models could enable real-time healthcare visualizations and augmented reality training. However, achieving these benefits will require addressing accessibility challenges to ensure equitable implementation and prevent potential disparities. By addressing these challenges and fostering collaboration among stakeholders, healthcare systems and AI technologists, T2V AI generation models could transform global healthcare into a more effective, universal, and innovative system while safeguarding against its potential misuse.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
OpenAI的Sora和b谷歌的Veo 2在行动:人工智能驱动的视频生成模型改变医疗保健的叙述回顾。
生成式人工智能(AI)的快速发展已经在各个领域引入了变革性技术,其中文本到视频(T2V)生成模型正在成为该领域的变革性创新。本文将探讨T2V人工智能生成模型在医疗保健领域的潜力,重点关注其应用、挑战和未来方向。先进的T2V平台,如Sora Turbo (OpenAI, Inc.,旧金山,加利福尼亚州,美国)和Veo 2(谷歌LLC,山景城,加利福尼亚州,美国),都在2024年12月宣布,提供生成高保真视频内容的能力。这些模型可以为患者教育提供量身定制的视频,加强医疗培训,并可能优化远程医疗,从而彻底改变医疗保健。我们对PubMed和谷歌Scholar等数据库进行了全面的叙述性文献检索,筛选出了2020 - 2024年间发表的41篇相关研究。我们的研究结果揭示了在改善患者教育、标准化定制医疗培训和加强远程医疗咨询方面可能带来的重大好处。然而,关键的挑战仍然存在,包括错误信息(或深度造假)的风险、隐私泄露、道德问题和视频真实性的限制。深度造假的检测机制和监管框架仍然不发达,需要进一步的跨学科研究和警惕的政策制定。T2V人工智能生成模型的未来发展可以实现实时医疗可视化和增强现实培训。然而,要实现这些好处,就需要解决无障碍挑战,以确保公平实施并防止潜在的差距。通过应对这些挑战并促进利益相关者、医疗系统和人工智能技术人员之间的合作,T2V人工智能生成模型可以将全球医疗保健转变为更有效、更普遍和更创新的系统,同时防止潜在的滥用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Comparative Efficacy and Safety of Netarsudil-Containing Interventions for Intraocular Pressure Control: A Systematic Review and Network Meta-Analysis. Dr. Krishna Manohar Soman Rema: A Legacy of Innovation, Education, and Compassion in Pediatric Cardiac Surgery. Secondary Organizing Pneumonia: A Case Report of a Noteworthy Complication of Breast Irradiation. Comparative Assessment of Antibiotics and Probiotics: Adjuvants in Nonsurgical Periodontal Treatment for Smokers With Generalized Periodontitis. Memory-Enhancing and Anxiolytic Effects of the Rose of Jericho on Sleep Deprivation-Related Cognitive and Behavioral Changes.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1