Using IBM�s Watson to automatically evaluate student short answer responses

2022 Physics Education Research Conference Proceedings Pub Date : 2022-09-22 DOI:10.1119/perc.2022.pr.campbell

Jennifer Campbell, K. Ansell, Timothy J Stelzer

引用次数: 1

Abstract

Recent advancements in natural language processing (NLP) have generated interest in using computers to assist in the coding and analysis of students’ short answer responses for PER or classroom applications. We train a state-of-the-art NLP, IBM’s Watson, and test its agreement with humans in three varying experimental cases. By exploring these cases, we begin to understand how Watson behaves with ideal and more realistic data, across different levels of training, and across different types of categorization tasks. We find that Watson’s self-reported confidence for categorizing samples is reasonably well-aligned with its accuracy, although this can be impacted by features of the data being analyzed. Based on these results, we discuss implications and suggest potential applications of this technology to education research.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

使用IBM的沃森自动评估学生的简短回答

自然语言处理(NLP)的最新进展引起了人们对使用计算机协助编写和分析学生的简短回答以用于PER或课堂应用的兴趣。我们训练了一个最先进的自然语言处理系统，IBM的沃森，并在三个不同的实验案例中测试了它与人类的一致性。通过探索这些案例，我们开始了解沃森如何处理理想和更现实的数据，跨越不同级别的训练，以及不同类型的分类任务。我们发现，沃森对样本分类的自我报告信心与其准确性相当好，尽管这可能受到被分析数据特征的影响。基于这些结果，我们讨论了该技术在教育研究中的潜在应用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2022 Physics Education Research Conference Proceedings

自引率

0.00%

发文量