从患者角度评价chatgpt - 40对髋关节镜问题的回答。

IF 1.9 Q2 ORTHOPEDICS Joint diseases and related surgery Pub Date : 2025-01-02 Epub Date: 2024-12-18 DOI:10.52312/jdrs.2025.1961

Gökhan Ayık, Niyazi Ercan, Yunus Demirtaş, Tuğrul Yıldırım, Gökhan Çakmak

{"title":"从患者角度评价chatgpt - 40对髋关节镜问题的回答。","authors":"Gökhan Ayık, Niyazi Ercan, Yunus Demirtaş, Tuğrul Yıldırım, Gökhan Çakmak","doi":"10.52312/jdrs.2025.1961","DOIUrl":null,"url":null,"abstract":"Objectives: This study aimed to evaluate the responses provided by ChatGPT-4o to the most frequently asked questions by patients regarding hip arthroscopy.Materials and methods: In this cross-sectional survey study, a new Google account without a search history was created to determine the 20 most frequently asked questions about hip arthroscopy via Google. These questions were asked to a new ChatGPT-4o account on June 1, 2024, and the responses were recorded. Ten orthopedic surgeons specializing in sports surgery rated the responses using a rating scale to assess relevance, accuracy, clarity, and completeness. The responses were scored on a scale from 1 to 5, with 1 being the worst and 5 being the best. The interrater reliability assessed via the intraclass correlation coefficient (ICC).Results: The lowest score given by the surgeons for any response was 4/5 in each subcategory. The highest mean scores were in accuracy and clarity, followed by relevance, with completeness receiving the lowest scores. The overall mean score was 4.49±0.16. Interrater reliability showed insufficient overall agreement (ICC=0.004, p=0.383), with the highest agreement in clarity (ICC=0.039, p=0.131) and the lowest in accuracy (ICC=-0.019, p=0.688).Conclusion: The study confirms our hypothesis that ChatGPT-4o provides above-average quality responses to frequently asked questions about hip arthroscopy, as evidenced by the high scores in relevance, accuracy, clarity, and completeness. However, it is still advisable to consult orthopedic specialists on the subject, incorporating ChatGPT's suggestions during the final decision-making process.","PeriodicalId":73560,"journal":{"name":"Joint diseases and related surgery","volume":"36 1","pages":"193-199"},"PeriodicalIF":1.9000,"publicationDate":"2025-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11734852/pdf/","citationCount":"0","resultStr":"{\"title\":\"Evaluation of ChatGPT-4o's answers to questions about hip arthroscopy from the patient perspective.\",\"authors\":\"Gökhan Ayık, Niyazi Ercan, Yunus Demirtaş, Tuğrul Yıldırım, Gökhan Çakmak\",\"doi\":\"10.52312/jdrs.2025.1961\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Objectives: This study aimed to evaluate the responses provided by ChatGPT-4o to the most frequently asked questions by patients regarding hip arthroscopy.Materials and methods: In this cross-sectional survey study, a new Google account without a search history was created to determine the 20 most frequently asked questions about hip arthroscopy via Google. These questions were asked to a new ChatGPT-4o account on June 1, 2024, and the responses were recorded. Ten orthopedic surgeons specializing in sports surgery rated the responses using a rating scale to assess relevance, accuracy, clarity, and completeness. The responses were scored on a scale from 1 to 5, with 1 being the worst and 5 being the best. The interrater reliability assessed via the intraclass correlation coefficient (ICC).Results: The lowest score given by the surgeons for any response was 4/5 in each subcategory. The highest mean scores were in accuracy and clarity, followed by relevance, with completeness receiving the lowest scores. The overall mean score was 4.49±0.16. Interrater reliability showed insufficient overall agreement (ICC=0.004, p=0.383), with the highest agreement in clarity (ICC=0.039, p=0.131) and the lowest in accuracy (ICC=-0.019, p=0.688).Conclusion: The study confirms our hypothesis that ChatGPT-4o provides above-average quality responses to frequently asked questions about hip arthroscopy, as evidenced by the high scores in relevance, accuracy, clarity, and completeness. However, it is still advisable to consult orthopedic specialists on the subject, incorporating ChatGPT's suggestions during the final decision-making process.\",\"PeriodicalId\":73560,\"journal\":{\"name\":\"Joint diseases and related surgery\",\"volume\":\"36 1\",\"pages\":\"193-199\"},\"PeriodicalIF\":1.9000,\"publicationDate\":\"2025-01-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11734852/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Joint diseases and related surgery\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.52312/jdrs.2025.1961\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/12/18 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q2\",\"JCRName\":\"ORTHOPEDICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Joint diseases and related surgery","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.52312/jdrs.2025.1961","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/12/18 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"ORTHOPEDICS","Score":null,"Total":0}

引用次数: 0

摘要

目的：本研究旨在评估chatgpt - 40对患者关于髋关节镜检查最常见问题的回答。材料和方法：在这项横断面调查研究中，创建了一个没有搜索历史的谷歌新帐户，以确定通过谷歌进行髋关节镜检查的20个最常见问题。这些问题在2024年6月1日被询问到一个新的chatgpt - 40账户，并被记录下来。10位专门从事运动外科的整形外科医生使用评分量表对回答进行评分，以评估相关性、准确性、清晰度和完整性。回答按1到5分打分，1分是最差的，5分是最好的。通过类内相关系数（ICC）评估组间信度。结果：外科医生对任何反应的最低评分为4/5分。平均得分最高的是准确性和清晰度，其次是相关性，完整性得分最低。总平均评分为4.49±0.16。量表间信度一致性不足（ICC=0.004, p=0.383），其中清晰度一致性最高（ICC=0.039, p=0.131），准确性一致性最低（ICC=-0.019, p=0.688）。结论：研究证实了我们的假设，即chatggt - 40对髋关节镜常见问题的回答质量高于平均水平，在相关性、准确性、清晰度和完整性方面得分很高。然而，在最终的决策过程中，还是建议咨询骨科专家，并纳入ChatGPT的建议。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Evaluation of ChatGPT-4o's answers to questions about hip arthroscopy from the patient perspective.

Objectives: This study aimed to evaluate the responses provided by ChatGPT-4o to the most frequently asked questions by patients regarding hip arthroscopy.

Materials and methods: In this cross-sectional survey study, a new Google account without a search history was created to determine the 20 most frequently asked questions about hip arthroscopy via Google. These questions were asked to a new ChatGPT-4o account on June 1, 2024, and the responses were recorded. Ten orthopedic surgeons specializing in sports surgery rated the responses using a rating scale to assess relevance, accuracy, clarity, and completeness. The responses were scored on a scale from 1 to 5, with 1 being the worst and 5 being the best. The interrater reliability assessed via the intraclass correlation coefficient (ICC).

Results: The lowest score given by the surgeons for any response was 4/5 in each subcategory. The highest mean scores were in accuracy and clarity, followed by relevance, with completeness receiving the lowest scores. The overall mean score was 4.49±0.16. Interrater reliability showed insufficient overall agreement (ICC=0.004, p=0.383), with the highest agreement in clarity (ICC=0.039, p=0.131) and the lowest in accuracy (ICC=-0.019, p=0.688).

Conclusion: The study confirms our hypothesis that ChatGPT-4o provides above-average quality responses to frequently asked questions about hip arthroscopy, as evidenced by the high scores in relevance, accuracy, clarity, and completeness. However, it is still advisable to consult orthopedic specialists on the subject, incorporating ChatGPT's suggestions during the final decision-making process.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Joint diseases and related surgery

CiteScore

2.50

自引率

0.00%

发文量