{"title":"Strengths and weaknesses of automated scoring of free-text student answers","authors":"Marie Bexte, Andrea Horbach, Torsten Zesch","doi":"10.1007/s00287-024-01573-z","DOIUrl":null,"url":null,"abstract":"<p>Free-text tasks, where students need to write a short answer to a specific question, serve as a well-established method for assessing learner knowledge. To address the high cost of manually scoring these tasks, automated scoring models can be used. Such models come in various types, each with its own strengths and weaknesses. Comparing these models helps in selecting the most suitable one for a given problem. Depending on the assessment context, this decision can be driven by ethical or legal considerations. When implemented successfully, a scoring model has the potential to substantially reduce costs and enhance the reliability of the scoring process. This article compares the different categories of scoring models across a set of crucial criteria that have immediate relevance to model employment in practice.</p>","PeriodicalId":39769,"journal":{"name":"Informatik-Spektrum","volume":"7 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Informatik-Spektrum","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/s00287-024-01573-z","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Computer Science","Score":null,"Total":0}
引用次数: 0
Abstract
Free-text tasks, where students need to write a short answer to a specific question, serve as a well-established method for assessing learner knowledge. To address the high cost of manually scoring these tasks, automated scoring models can be used. Such models come in various types, each with its own strengths and weaknesses. Comparing these models helps in selecting the most suitable one for a given problem. Depending on the assessment context, this decision can be driven by ethical or legal considerations. When implemented successfully, a scoring model has the potential to substantially reduce costs and enhance the reliability of the scoring process. This article compares the different categories of scoring models across a set of crucial criteria that have immediate relevance to model employment in practice.
期刊介绍:
Im Informatik Spektrum finden Sie aktuelle, praktisch verwertbare Informationen über technische und wissenschaftliche Trends und Entwicklungen aus allen Bereichen der Informatik. Die Zeitschrift enthält Übersichtsartikel und einführende Darstellungen sowie Berichte über Projekte und Fallstudien aus der Praxis. Interviews, Kolumnen und Buchrezensionen runden das Angebot ab.Bilden Sie sich weiter, erschließen Sie sich neue Sachgebiete oder verschaffen Sie sich einen Überblick. Informatik Spektrum richtet sich neben Informatikspezialisten auch an Praktiker und Studierende, die Interesse an der wissenschaftlichen Entwicklung und praktischen Anwendung der Informatik haben.Möchten Sie zu einem Heft beitragen, richten Sie Ihren Vorschlag gerne an den Chefredakteur Peter Pagel (peter.pagel@springer.com). Willkommen sind Beiträge zum jeweiligen Schwerpunkt ebenso wie Beiträge zum gesamten Themenspektrum der Informatik.