{"title":"口语评估中的评价者认知调查","authors":"Przemysław Krakowian","doi":"10.18778/1731-7533.20.3.04","DOIUrl":null,"url":null,"abstract":"In the assessment of spoken production, numerous reasons can be identified behind the decisions that raters make in evaluating samples of oral performance. Inter and intra rater factors are relatively well documented in various reliability and validity studies. Some that have been identified in literature involve the effects of examinee pairing or the familiarity with the examinees, others point in the direction of gender and gender role perceptions O’Sullivan (2008), others appear to be connected with body language and non-verbal cues that accompany oral production (cf.: Krahmer and Swerts 2004, Seiter, Weger, Jensen and Kinzer 2010). While some studies that address the assessment of speaking English in exam contexts suggest that raters may not feel as comfortable assessing pronunciation as they do other aspects of a speaker’s performance (Orr 2002, Hubbard, Gilbert and Pidcock 2006, Brown 2006, De Velle 2008), more recent investigations of rater behaviour involving electronic evidence from training, maintenance and online examination programmes tentatively show that pronunciation, in fact, is the first category examiners attend to (Hubbard 2011, Chambers and Ingham 2011, Krakowian 2011, Seed 2012, Tynan 2015, Kang and Ginther 2019). This paper looks at large collection of assessments stored in an electronic system to investigate what raters really seem to pay attention to when allegedly following rating scales.","PeriodicalId":38985,"journal":{"name":"Research in Language","volume":"1 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-02-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Investigating Rater Perceptions in the Assessment of Speaking\",\"authors\":\"Przemysław Krakowian\",\"doi\":\"10.18778/1731-7533.20.3.04\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In the assessment of spoken production, numerous reasons can be identified behind the decisions that raters make in evaluating samples of oral performance. Inter and intra rater factors are relatively well documented in various reliability and validity studies. Some that have been identified in literature involve the effects of examinee pairing or the familiarity with the examinees, others point in the direction of gender and gender role perceptions O’Sullivan (2008), others appear to be connected with body language and non-verbal cues that accompany oral production (cf.: Krahmer and Swerts 2004, Seiter, Weger, Jensen and Kinzer 2010). While some studies that address the assessment of speaking English in exam contexts suggest that raters may not feel as comfortable assessing pronunciation as they do other aspects of a speaker’s performance (Orr 2002, Hubbard, Gilbert and Pidcock 2006, Brown 2006, De Velle 2008), more recent investigations of rater behaviour involving electronic evidence from training, maintenance and online examination programmes tentatively show that pronunciation, in fact, is the first category examiners attend to (Hubbard 2011, Chambers and Ingham 2011, Krakowian 2011, Seed 2012, Tynan 2015, Kang and Ginther 2019). This paper looks at large collection of assessments stored in an electronic system to investigate what raters really seem to pay attention to when allegedly following rating scales.\",\"PeriodicalId\":38985,\"journal\":{\"name\":\"Research in Language\",\"volume\":\"1 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-02-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Research in Language\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.18778/1731-7533.20.3.04\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"Arts and Humanities\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Research in Language","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.18778/1731-7533.20.3.04","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Arts and Humanities","Score":null,"Total":0}
引用次数: 0
摘要
在对口语制作的评估中,评分者在评估口语表现样本时所做的决定背后有许多原因。在各种信度和效度研究中,内部因素和内部因素的记录相对较好。一些在文献中被确定的影响涉及考生配对或与考生的熟悉程度的影响,其他的指向性别和性别角色认知的方向O 'Sullivan(2008),其他的似乎与伴随着口头生产的肢体语言和非语言线索有关(参见:Krahmer和Swerts 2004, Seiter, Weger, Jensen和Kinzer 2010)。虽然一些针对考试环境中英语口语评估的研究表明,评分者在评估说话者的其他方面表现时,可能不会像评估发音那样感到舒服(Orr 2002, Hubbard, Gilbert and Pidcock 2006, Brown 2006, De Velle 2008),但最近对评分者行为的调查初步表明,事实上,发音是一个重要的因素。是审查员关注的第一类(Hubbard 2011, Chambers and Ingham 2011, Krakowian 2011, Seed 2012, Tynan 2015, Kang and Ginther 2019)。本文着眼于存储在电子系统中的大量评估,以调查评级者在据称遵循评级量表时似乎真正关注的是什么。
Investigating Rater Perceptions in the Assessment of Speaking
In the assessment of spoken production, numerous reasons can be identified behind the decisions that raters make in evaluating samples of oral performance. Inter and intra rater factors are relatively well documented in various reliability and validity studies. Some that have been identified in literature involve the effects of examinee pairing or the familiarity with the examinees, others point in the direction of gender and gender role perceptions O’Sullivan (2008), others appear to be connected with body language and non-verbal cues that accompany oral production (cf.: Krahmer and Swerts 2004, Seiter, Weger, Jensen and Kinzer 2010). While some studies that address the assessment of speaking English in exam contexts suggest that raters may not feel as comfortable assessing pronunciation as they do other aspects of a speaker’s performance (Orr 2002, Hubbard, Gilbert and Pidcock 2006, Brown 2006, De Velle 2008), more recent investigations of rater behaviour involving electronic evidence from training, maintenance and online examination programmes tentatively show that pronunciation, in fact, is the first category examiners attend to (Hubbard 2011, Chambers and Ingham 2011, Krakowian 2011, Seed 2012, Tynan 2015, Kang and Ginther 2019). This paper looks at large collection of assessments stored in an electronic system to investigate what raters really seem to pay attention to when allegedly following rating scales.
期刊介绍:
Research in Language (RiL) is an international journal committed to publishing excellent studies in the area of linguistics and related disciplines focused on human communication. Language studies, as other scholarly disciplines, undergo two seemingly counteracting processes: the process of diversification of the field into narrow specialized domains and the process of convergence, strengthened by interdisciplinarity. It is the latter perspective that RiL editors invite for the journal, whose aim is to present language in its entirety, meshing traditional modular compartments, such as phonetics, phonology, morphology, syntax, semantics, and pragmatics, and offer a multidimensional perspective which exposes varied but relevant aspects of language, e.g. the cognitive, the psychological, the institutional aspect, as well as the social shaping of linguistic convention and creativity.