{"title":"评估人工智能(AI)大型语言模型回答问题能力的研究重复且过时:现在必须将人工智能应用于改善临床实践和患者护理。","authors":"Jacob F Oeding","doi":"10.1016/j.arthro.2024.10.020","DOIUrl":null,"url":null,"abstract":"<p><p>While artificial intelligence (AI) technologies like ChatGPT have demonstrated very real and powerful capabilities to date, this does not mean that research studying these technologies is immune from \"shiny object\" syndrome, a psychological phenomenon where we tend to focus on new and fashionable ideas only to be distracted from those that truly matter. In parallel with the increased publicity that AI has received since the release of large language models (LLMs) like ChatGPT has been an explosion in the number of studies evaluating LLMs' ability to answer hypothetical questions from patients on a variety of conditions. Nevertheless, these studies tend to leave us with the same conclusion: LLMs are generally capable of providing reliable and relevant responses to patient questions but are not without limitations. Given the abundance of studies demonstrating similar outcomes regardless of whether the LLMs are asked to respond to a patient's questions about their diabetes or about their shoulder dislocation, I'm afraid we are at risk of making AI more of a \"shiny object\" than a tool that can be used to change clinical practice and improve patient care. Specifically, we may be getting to a point where a \"publish or perish\" mindset has promoted studies with repetitive methodologies that only confirm well-established theories around the capabilities and limitations of AI and has created a distraction from new use-cases and more meaningful applications for patient care. We are now at a crossroads where we can either remain stuck in the past, repeating old studies' methodologies on a different procedure or injury, or progress by expanding the number and impact of applications these tools have in orthopaedic surgery. The capabilities of AI will continue to increase at rapid pace, but it will be up to those with intricate knowledge of orthopaedics and patient care to keep up.</p>","PeriodicalId":55459,"journal":{"name":"Arthroscopy-The Journal of Arthroscopic and Related Surgery","volume":" ","pages":""},"PeriodicalIF":4.4000,"publicationDate":"2024-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Studies Evaluating Artificial Intelligence (AI) Large Language Models Ability to Respond to Questions Are Repetitive and Out-of-Date: AI Must Now Be Applied to Improving Clinical Practice and Patient Care.\",\"authors\":\"Jacob F Oeding\",\"doi\":\"10.1016/j.arthro.2024.10.020\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>While artificial intelligence (AI) technologies like ChatGPT have demonstrated very real and powerful capabilities to date, this does not mean that research studying these technologies is immune from \\\"shiny object\\\" syndrome, a psychological phenomenon where we tend to focus on new and fashionable ideas only to be distracted from those that truly matter. In parallel with the increased publicity that AI has received since the release of large language models (LLMs) like ChatGPT has been an explosion in the number of studies evaluating LLMs' ability to answer hypothetical questions from patients on a variety of conditions. Nevertheless, these studies tend to leave us with the same conclusion: LLMs are generally capable of providing reliable and relevant responses to patient questions but are not without limitations. Given the abundance of studies demonstrating similar outcomes regardless of whether the LLMs are asked to respond to a patient's questions about their diabetes or about their shoulder dislocation, I'm afraid we are at risk of making AI more of a \\\"shiny object\\\" than a tool that can be used to change clinical practice and improve patient care. Specifically, we may be getting to a point where a \\\"publish or perish\\\" mindset has promoted studies with repetitive methodologies that only confirm well-established theories around the capabilities and limitations of AI and has created a distraction from new use-cases and more meaningful applications for patient care. We are now at a crossroads where we can either remain stuck in the past, repeating old studies' methodologies on a different procedure or injury, or progress by expanding the number and impact of applications these tools have in orthopaedic surgery. The capabilities of AI will continue to increase at rapid pace, but it will be up to those with intricate knowledge of orthopaedics and patient care to keep up.</p>\",\"PeriodicalId\":55459,\"journal\":{\"name\":\"Arthroscopy-The Journal of Arthroscopic and Related Surgery\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":4.4000,\"publicationDate\":\"2024-10-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Arthroscopy-The Journal of Arthroscopic and Related Surgery\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1016/j.arthro.2024.10.020\",\"RegionNum\":1,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ORTHOPEDICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Arthroscopy-The Journal of Arthroscopic and Related Surgery","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1016/j.arthro.2024.10.020","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ORTHOPEDICS","Score":null,"Total":0}
Studies Evaluating Artificial Intelligence (AI) Large Language Models Ability to Respond to Questions Are Repetitive and Out-of-Date: AI Must Now Be Applied to Improving Clinical Practice and Patient Care.
While artificial intelligence (AI) technologies like ChatGPT have demonstrated very real and powerful capabilities to date, this does not mean that research studying these technologies is immune from "shiny object" syndrome, a psychological phenomenon where we tend to focus on new and fashionable ideas only to be distracted from those that truly matter. In parallel with the increased publicity that AI has received since the release of large language models (LLMs) like ChatGPT has been an explosion in the number of studies evaluating LLMs' ability to answer hypothetical questions from patients on a variety of conditions. Nevertheless, these studies tend to leave us with the same conclusion: LLMs are generally capable of providing reliable and relevant responses to patient questions but are not without limitations. Given the abundance of studies demonstrating similar outcomes regardless of whether the LLMs are asked to respond to a patient's questions about their diabetes or about their shoulder dislocation, I'm afraid we are at risk of making AI more of a "shiny object" than a tool that can be used to change clinical practice and improve patient care. Specifically, we may be getting to a point where a "publish or perish" mindset has promoted studies with repetitive methodologies that only confirm well-established theories around the capabilities and limitations of AI and has created a distraction from new use-cases and more meaningful applications for patient care. We are now at a crossroads where we can either remain stuck in the past, repeating old studies' methodologies on a different procedure or injury, or progress by expanding the number and impact of applications these tools have in orthopaedic surgery. The capabilities of AI will continue to increase at rapid pace, but it will be up to those with intricate knowledge of orthopaedics and patient care to keep up.
期刊介绍:
Nowhere is minimally invasive surgery explained better than in Arthroscopy, the leading peer-reviewed journal in the field. Every issue enables you to put into perspective the usefulness of the various emerging arthroscopic techniques. The advantages and disadvantages of these methods -- along with their applications in various situations -- are discussed in relation to their efficiency, efficacy and cost benefit. As a special incentive, paid subscribers also receive access to the journal expanded website.