Using Artificial Intelligence (AI) As An External Examiner

Esculapio Pub Date : 2023-11-08 DOI:10.51273/esc23.251319323
Tayyaba Azhar, Kinza Aslam, Zakia Saleem, Ahsan Sethi, Tahseen Fatima
{"title":"Using Artificial Intelligence (AI) As An External Examiner","authors":"Tayyaba Azhar, Kinza Aslam, Zakia Saleem, Ahsan Sethi, Tahseen Fatima","doi":"10.51273/esc23.251319323","DOIUrl":null,"url":null,"abstract":"Objective: To access the validity of ChatGPT on AI assisted tool for evaluating essay questions. Material and Methods: This was a cross-sectional quantitative study conducted at University College of Medicine and Dentistry from June till August 2023. Eighteen questions were selected from fifteen exit tests of Certificate in HPE course. Each of the answers were independently graded by two assessors with doctorate in HPE. The same answers were then reevaluated using ChatGPT. The inter-rater reliability was determined using Kappa test. Results: The agreement between ChatGPT and examiner scores varied on various items. Weak agreement was observed for questions 8 and 9, moderate agreement for questions 2, 3, and 5, and strong kappa agreement for questions 1, 4, 6, and 7. Conclusion: Artificial intelligence assisted tools such as ChatGPT is a reality but its use in assessing essay questions would require massive training data from expert assessors. Once appropriately trained, it may replicate assessment decisions across the full range of subject.","PeriodicalId":11923,"journal":{"name":"Esculapio","volume":"15 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Esculapio","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.51273/esc23.251319323","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Objective: To access the validity of ChatGPT on AI assisted tool for evaluating essay questions. Material and Methods: This was a cross-sectional quantitative study conducted at University College of Medicine and Dentistry from June till August 2023. Eighteen questions were selected from fifteen exit tests of Certificate in HPE course. Each of the answers were independently graded by two assessors with doctorate in HPE. The same answers were then reevaluated using ChatGPT. The inter-rater reliability was determined using Kappa test. Results: The agreement between ChatGPT and examiner scores varied on various items. Weak agreement was observed for questions 8 and 9, moderate agreement for questions 2, 3, and 5, and strong kappa agreement for questions 1, 4, 6, and 7. Conclusion: Artificial intelligence assisted tools such as ChatGPT is a reality but its use in assessing essay questions would require massive training data from expert assessors. Once appropriately trained, it may replicate assessment decisions across the full range of subject.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
将人工智能 (AI) 用作外部考官
目的了解人工智能辅助工具 ChatGPT 对作文题评价的有效性。材料与方法:这是一项横断面定量研究,于 2023 年 6 月至 8 月在大学医学和牙科学院进行。从 HPE 证书课程的 15 个结业测试中选取了 18 个问题。每道试题的答案均由两名拥有 HPE 博士学位的评审员独立评分。然后使用 ChatGPT 对相同的答案进行重新评估。评分者之间的信度采用 Kappa 检验。结果显示在不同的题目上,ChatGPT 和考官评分之间的一致性各不相同。第 8 题和第 9 题的一致性较弱,第 2 题、第 3 题和第 5 题的一致性中等,而第 1 题、第 4 题、第 6 题和第 7 题的卡帕一致性较强。结论人工智能辅助工具(如 ChatGPT)已成为现实,但将其用于评估作文题目需要专家评估员提供大量训练数据。一旦经过适当的培训,它可以在所有科目中复制评估决定。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
审稿时长
12 weeks
期刊最新文献
Morphometric Measurement of Distance of Nutrient Foramen from Proximal, Distal Ends and Circumferential Diameter at the level of Nutrient Foramen in dry adult Humerii Assessment of Oral Health and Oral Hygiene Practices of Transgender Community in Lahore Knowledge, Attitude, and Practice Towards Usage of Sunscreen and Prevention of Skin CancerAmong Doctors of a Tertiary Care Hospital: ACross-Sectional Study Measurement of Harris Hip Scoring During Early post-operative Period in Intertrochanteric Fractures FemurTreated with Dynamic Hip Screw Medical Negligence orMalpractice: Critical Review of Relevant Laws in Pakistan
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1