Mertcan Sevgi, Eden Ruffell, Fares Antaki, Mark A Chia, Pearse A Keane
{"title":"Foundation models in ophthalmology: opportunities and challenges.","authors":"Mertcan Sevgi, Eden Ruffell, Fares Antaki, Mark A Chia, Pearse A Keane","doi":"10.1097/ICU.0000000000001091","DOIUrl":null,"url":null,"abstract":"<p><strong>Purpose of review: </strong>Last year marked the development of the first foundation model in ophthalmology, RETFound, setting the stage for generalizable medical artificial intelligence (GMAI) that can adapt to novel tasks. Additionally, rapid advancements in large language model (LLM) technology, including models such as GPT-4 and Gemini, have been tailored for medical specialization and evaluated on clinical scenarios with promising results. This review explores the opportunities and challenges for further advancements in these technologies.</p><p><strong>Recent findings: </strong>RETFound outperforms traditional deep learning models in specific tasks, even when only fine-tuned on small datasets. Additionally, LMMs like Med-Gemini and Medprompt GPT-4 perform better than out-of-the-box models for ophthalmology tasks. However, there is still a significant deficiency in ophthalmology-specific multimodal models. This gap is primarily due to the substantial computational resources required to train these models and the limitations of high-quality ophthalmology datasets.</p><p><strong>Summary: </strong>Overall, foundation models in ophthalmology present promising opportunities but face challenges, particularly the need for high-quality, standardized datasets for training and specialization. Although development has primarily focused on large language and vision models, the greatest opportunities lie in advancing large multimodal models, which can more closely mimic the capabilities of clinicians.</p>","PeriodicalId":50604,"journal":{"name":"Current Opinion in Ophthalmology","volume":" ","pages":""},"PeriodicalIF":3.0000,"publicationDate":"2024-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Current Opinion in Ophthalmology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1097/ICU.0000000000001091","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"OPHTHALMOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Purpose of review: Last year marked the development of the first foundation model in ophthalmology, RETFound, setting the stage for generalizable medical artificial intelligence (GMAI) that can adapt to novel tasks. Additionally, rapid advancements in large language model (LLM) technology, including models such as GPT-4 and Gemini, have been tailored for medical specialization and evaluated on clinical scenarios with promising results. This review explores the opportunities and challenges for further advancements in these technologies.
Recent findings: RETFound outperforms traditional deep learning models in specific tasks, even when only fine-tuned on small datasets. Additionally, LMMs like Med-Gemini and Medprompt GPT-4 perform better than out-of-the-box models for ophthalmology tasks. However, there is still a significant deficiency in ophthalmology-specific multimodal models. This gap is primarily due to the substantial computational resources required to train these models and the limitations of high-quality ophthalmology datasets.
Summary: Overall, foundation models in ophthalmology present promising opportunities but face challenges, particularly the need for high-quality, standardized datasets for training and specialization. Although development has primarily focused on large language and vision models, the greatest opportunities lie in advancing large multimodal models, which can more closely mimic the capabilities of clinicians.
期刊介绍:
Current Opinion in Ophthalmology is an indispensable resource featuring key up-to-date and important advances in the field from around the world. With renowned guest editors for each section, every bimonthly issue of Current Opinion in Ophthalmology delivers a fresh insight into topics such as glaucoma, refractive surgery and corneal and external disorders. With ten sections in total, the journal provides a convenient and thorough review of the field and will be of interest to researchers, clinicians and other healthcare professionals alike.