Arwa A. Alsayed , Mariam B. Aldajani , Marwan H. Aljohani , Hamdan Alamri , Maram A. Alwadi , Bodor Z. Alshammari , Falah R. Alshammari
{"title":"Assessing the quality of AI information from ChatGPT regarding oral surgery, preventive dentistry, and oral cancer: An exploration study","authors":"Arwa A. Alsayed , Mariam B. Aldajani , Marwan H. Aljohani , Hamdan Alamri , Maram A. Alwadi , Bodor Z. Alshammari , Falah R. Alshammari","doi":"10.1016/j.sdentj.2024.09.009","DOIUrl":null,"url":null,"abstract":"<div><h3>Aim</h3><div>Evaluation of the quality of dental information produced by the ChatGPT artificial intelligence language model within the context of oral surgery, preventive dentistry, and oral cancer.</div></div><div><h3>Methodology</h3><div>This study adopted quantitative methods approach. The experts prepared 50 questions (including dimensions of, risk factors, preventive measures, diagnostic methods, and treatment options) that would be presented to ChatGPT, and its responses were rated for their accuracy, completeness, relevance, clarity or comprehensibility, and possible risks using a standardized rubric. To carry out the assessment of the responses by ChatGPT, a standardized scoring rubric was used. Evaluation process included feedback concerning the strengths, weaknesses, and potential areas of improvement in the responses provided by ChatGPT.</div></div><div><h3>Results</h3><div>While achieving the highest score for preventive dentistry at 4.3/5 and being able to communicate the complex information coherently, the tool showed lower accuracy for oral surgery and oral cancer, scoring 3.9/5 and 3.6/5, respectively, with several gaps for post-operative instructions, personalized risk assessments, and specialized diagnostic methods. Potential risks, such as a lack of individualized advice, were shown in 53% of the oral cancer and in 40% of the oral surgery. While showing promise in some domains, ChatGPT had important limitations in specialized areas that require nuanced expertise.</div></div><div><h3>Conclusion</h3><div>The findings point to the need for professional supervision while using AI-generated information and ongoing evaluation as capabilities evolve, for the assurance of responsible implementation in the best interest of patient care.</div></div>","PeriodicalId":47246,"journal":{"name":"Saudi Dental Journal","volume":"36 11","pages":"Pages 1483-1489"},"PeriodicalIF":1.7000,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Saudi Dental Journal","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1013905224002621","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"DENTISTRY, ORAL SURGERY & MEDICINE","Score":null,"Total":0}
引用次数: 0
Abstract
Aim
Evaluation of the quality of dental information produced by the ChatGPT artificial intelligence language model within the context of oral surgery, preventive dentistry, and oral cancer.
Methodology
This study adopted quantitative methods approach. The experts prepared 50 questions (including dimensions of, risk factors, preventive measures, diagnostic methods, and treatment options) that would be presented to ChatGPT, and its responses were rated for their accuracy, completeness, relevance, clarity or comprehensibility, and possible risks using a standardized rubric. To carry out the assessment of the responses by ChatGPT, a standardized scoring rubric was used. Evaluation process included feedback concerning the strengths, weaknesses, and potential areas of improvement in the responses provided by ChatGPT.
Results
While achieving the highest score for preventive dentistry at 4.3/5 and being able to communicate the complex information coherently, the tool showed lower accuracy for oral surgery and oral cancer, scoring 3.9/5 and 3.6/5, respectively, with several gaps for post-operative instructions, personalized risk assessments, and specialized diagnostic methods. Potential risks, such as a lack of individualized advice, were shown in 53% of the oral cancer and in 40% of the oral surgery. While showing promise in some domains, ChatGPT had important limitations in specialized areas that require nuanced expertise.
Conclusion
The findings point to the need for professional supervision while using AI-generated information and ongoing evaluation as capabilities evolve, for the assurance of responsible implementation in the best interest of patient care.
期刊介绍:
Saudi Dental Journal is an English language, peer-reviewed scholarly publication in the area of dentistry. Saudi Dental Journal publishes original research and reviews on, but not limited to: • dental disease • clinical trials • dental equipment • new and experimental techniques • epidemiology and oral health • restorative dentistry • periodontology • endodontology • prosthodontics • paediatric dentistry • orthodontics and dental education Saudi Dental Journal is the official publication of the Saudi Dental Society and is published by King Saud University in collaboration with Elsevier and is edited by an international group of eminent researchers.