Language Testing in Asia最新文献

英文中文

The analysis of marking reliability through the approach of gauge repeatability and reproducibility (GR&R) study: a case of English-speaking test 通过量具重复性和再现性（GR&R）研究方法分析评分可靠性：以英语测试为例

IF 2.8 1区文学 Q1 EDUCATION & EDUCATIONAL RESEARCH

Language Testing in Asia

Pub Date : 2024-01-11 DOI: 10.1186/s40468-023-00271-z

Pornphan Sureeyatanapas, Panitas Sureeyatanapas, Uthumporn Panitanarak, Jittima Kraisriwattana, Patchanan Sarootyanapat, Daniel O’Connell

引用次数: 0

EFL teachers’ cognition of social and psychological consequences of high-stake national language tests: role of teacher training workshops EFL 教师对高分国家语言测试的社会和心理后果的认知：教师培训讲习班的作用

IF 2.8 1区文学 Q1 EDUCATION & EDUCATIONAL RESEARCH

Language Testing in Asia

Pub Date : 2023-11-28 DOI: 10.1186/s40468-023-00262-0

Rahmatolah Allahyari, Mahmoud Moradi Abbasabady, Shamim Akhter, Goudarz Alibakhshi

引用次数: 0

Correction: A review study on discourse-analytical approaches to language testing policy in the South Korean context 更正:对韩国语境下语言测试政策的话语分析方法的回顾研究

1区文学 Q1 EDUCATION & EDUCATIONAL RESEARCH

Language Testing in Asia

Pub Date : 2023-11-09 DOI: 10.1186/s40468-023-00269-7

Dongil Shin, Soohyeon Park, Eunhae Cho

引用次数: 0

Indonesian secular vs. Madrasah schools: assessing the discrepancy in English reading and listening tests 印尼世俗学校与伊斯兰学校:评估英语阅读和听力测试的差异

1区文学 Q1 EDUCATION & EDUCATIONAL RESEARCH

Language Testing in Asia

Pub Date : 2023-11-07 DOI: 10.1186/s40468-023-00266-w

Abu Nawas, I Gusti Ngurah Darmawan, Nina Maadad

Abstract The greater emphasis on the significance and difference in English performance between the school types has mainly been investigated across Asian countries. However, not much is known about what language skills differentiate their overall language achievement. Using a quantitative study with comparative analysis, this study measured the reading and listening skills of 1319 Indonesian students who were selected using a stratified sample design and grouped them into secular ( Sekolah , n = 726) and Islamic ( Madrasah , n = 593) groups. The samples were selected from 9205 of the total population of secondary school students, in Bone Regency, South Sulawesi Indonesia. The three-way ANOVA results showed a significant difference ( p < 0.05) in reading and listening subskills between the groups. Highly significant results of Madrasah students in reading and listening subskills indicate they are better at constructing what text means in a variety of contexts, as a literary experience in reading texts and obtaining general and specific information from listening tests compared to those attending secular schools. Poor performance of boys and students who enrolled in public secular schools may become the main explanation for achievement gaps across the groups. The main and interaction effects of the school system, sectors, and gender on the tested subskills were also explained in this study. Additionally, the result of the DIF test confirmed that the equity of the tested items between them was supported.

对不同学校类型学生英语表现的重要性和差异的研究主要集中在亚洲国家。然而，对于语言技能如何区分他们的整体语言成就，我们知之甚少。本研究采用定量研究与比较分析相结合的方法，对1319名印度尼西亚学生的阅读和听力技能进行了测量，这些学生采用分层抽样设计，并将他们分为世俗(Sekolah, n = 726)和伊斯兰(Madrasah, n = 593)两组。样本选自印度尼西亚南苏拉威西省Bone Regency的9205名中学生。三因素方差分析结果显示差异有统计学意义(p <(0.05)组间阅读和听力子技能差异显著。伊斯兰学校学生在阅读和听力子技能方面的显著成绩表明，与世俗学校的学生相比，他们更善于在各种语境中构建文本的含义，作为阅读文本的文学体验，并从听力测试中获得一般和具体信息。男生和就读于公立世俗学校的学生表现不佳，可能成为各群体之间成绩差距的主要原因。本研究还解释了学校制度、部门和性别对测试子技能的主要影响和交互影响。此外，DIF测试的结果证实了他们之间的测试项目的公平性是支持的。

{"title":"Indonesian secular vs. Madrasah schools: assessing the discrepancy in English reading and listening tests","authors":"Abu Nawas, I Gusti Ngurah Darmawan, Nina Maadad","doi":"10.1186/s40468-023-00266-w","DOIUrl":"https://doi.org/10.1186/s40468-023-00266-w","url":null,"abstract":"Abstract The greater emphasis on the significance and difference in English performance between the school types has mainly been investigated across Asian countries. However, not much is known about what language skills differentiate their overall language achievement. Using a quantitative study with comparative analysis, this study measured the reading and listening skills of 1319 Indonesian students who were selected using a stratified sample design and grouped them into secular ( Sekolah , n = 726) and Islamic ( Madrasah , n = 593) groups. The samples were selected from 9205 of the total population of secondary school students, in Bone Regency, South Sulawesi Indonesia. The three-way ANOVA results showed a significant difference ( p < 0.05) in reading and listening subskills between the groups. Highly significant results of Madrasah students in reading and listening subskills indicate they are better at constructing what text means in a variety of contexts, as a literary experience in reading texts and obtaining general and specific information from listening tests compared to those attending secular schools. Poor performance of boys and students who enrolled in public secular schools may become the main explanation for achievement gaps across the groups. The main and interaction effects of the school system, sectors, and gender on the tested subskills were also explained in this study. Additionally, the result of the DIF test confirmed that the equity of the tested items between them was supported.","PeriodicalId":37050,"journal":{"name":"Language Testing in Asia","volume":"52 8","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135476490","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Correction: Using AI-based detectors to control AI-assisted plagiarism in ESL writing: “The Terminator Versus the Machines” 更正:在ESL写作中使用基于ai的检测器来控制ai辅助抄袭:“终结者对机器”

1区文学 Q1 EDUCATION & EDUCATIONAL RESEARCH

Language Testing in Asia

Pub Date : 2023-10-31 DOI: 10.1186/s40468-023-00268-8

Karim Ibrahim

引用次数: 0

Rater cognitive processes in integrated writing tasks: from the perspective of problem-solving 综合写作任务中的认知过程:从问题解决的角度

1区文学 Q1 EDUCATION & EDUCATIONAL RESEARCH

Language Testing in Asia

Pub Date : 2023-10-26 DOI: 10.1186/s40468-023-00265-x

Wenfeng Jia, Peixin Zhang

Abstract It is widely believed that raters’ cognition is an important aspect of writing assessment, as it has both logical and temporal priority over scores. Based on a critical review of previous research in this area, it is found that raters’ cognition can be boiled to two fundamental issues: building text images and strategies for articulating scores. Compared to the scoring contexts of previous research, the TEM 8 integrated writing task scoring scale has unique features. It is urgent to know how raters build text images and how they articulate scores for text images in the specific context of rating TEM8 compositions. In order to answer these questions, the present study conducted qualitative research by considering raters as problem solvers in the light of problem-solving theory. Hence, 6 highly experienced raters were asked to verbalize their thoughts simultaneously while rating TEM 8 essays, supplemented by a retrospective interview. Analyzing the collected protocols, we found that with regard to research question 1, the raters went through two stages by setting building text images as isolated nodes and building holistic text images for each dimension as two sub-goals, respectively. In order to achieve the first sub-goal, raters used strategies such as single foci evaluating, diagnosing, and comparing; for the second sub-goal, they mainly used synthesizing and comparing. Regarding the second question, the results showed that they resorted to two groups of strategies: demarcating boundaries between scores within a dimension and discriminating between dimensions, each group consisting of more specific processes. Each of the extracted processes was defined clearly and their relationships were delineated, on the basis of which a new working model of the rating process was finalized. Overall, the present study deepens our understanding of rating processes and provides evidence for the scoring validity of the TEM 8 integrated writing test. It also provides implications for rating practice, such as the need for the distinction between two types of analytical rating scales.

人们普遍认为评分者的认知是写作评估的一个重要方面，因为它比分数具有逻辑和时间上的优先性。基于对该领域先前研究的批判性回顾，我们发现评分者的认知可以归结为两个基本问题:构建文本图像和表达分数的策略。与以往研究的评分语境相比，英语专业八级综合写作任务评分量表有其独特之处。迫切需要知道评分者是如何构建文本图像的，以及他们是如何在TEM8作文评分的特定背景下为文本图像评分的。为了回答这些问题，本研究根据问题解决理论，将评分者视为问题解决者，进行了定性研究。因此，6名经验丰富的评分者被要求在对TEM 8论文进行评分时同时用语言表达他们的想法，并辅以回顾性访谈。分析收集到的协议，我们发现，对于研究问题1，评分者将构建文本图像作为孤立节点，将构建每个维度的整体文本图像作为两个子目标，分别经历了两个阶段。为了实现第一个子目标，评分者采用了单焦点评估、诊断和比较等策略;对于第二个子目标，他们主要采用综合和比较的方法。对于第二个问题，结果显示，他们采取了两组策略:在一个维度内划分分数之间的边界和在维度之间进行区分，每一组都由更具体的过程组成。对每个提取出来的过程进行了明确的定义，并描述了它们之间的关系，在此基础上最终确定了评级过程的新工作模型。总体而言，本研究加深了我们对评分过程的理解，并为英语专业八级综合写作测试的评分效度提供了证据。它还为评级实践提供了启示，例如需要区分两种类型的分析评级量表。

{"title":"Rater cognitive processes in integrated writing tasks: from the perspective of problem-solving","authors":"Wenfeng Jia, Peixin Zhang","doi":"10.1186/s40468-023-00265-x","DOIUrl":"https://doi.org/10.1186/s40468-023-00265-x","url":null,"abstract":"Abstract It is widely believed that raters’ cognition is an important aspect of writing assessment, as it has both logical and temporal priority over scores. Based on a critical review of previous research in this area, it is found that raters’ cognition can be boiled to two fundamental issues: building text images and strategies for articulating scores. Compared to the scoring contexts of previous research, the TEM 8 integrated writing task scoring scale has unique features. It is urgent to know how raters build text images and how they articulate scores for text images in the specific context of rating TEM8 compositions. In order to answer these questions, the present study conducted qualitative research by considering raters as problem solvers in the light of problem-solving theory. Hence, 6 highly experienced raters were asked to verbalize their thoughts simultaneously while rating TEM 8 essays, supplemented by a retrospective interview. Analyzing the collected protocols, we found that with regard to research question 1, the raters went through two stages by setting building text images as isolated nodes and building holistic text images for each dimension as two sub-goals, respectively. In order to achieve the first sub-goal, raters used strategies such as single foci evaluating, diagnosing, and comparing; for the second sub-goal, they mainly used synthesizing and comparing. Regarding the second question, the results showed that they resorted to two groups of strategies: demarcating boundaries between scores within a dimension and discriminating between dimensions, each group consisting of more specific processes. Each of the extracted processes was defined clearly and their relationships were delineated, on the basis of which a new working model of the rating process was finalized. Overall, the present study deepens our understanding of rating processes and provides evidence for the scoring validity of the TEM 8 integrated writing test. It also provides implications for rating practice, such as the need for the distinction between two types of analytical rating scales.","PeriodicalId":37050,"journal":{"name":"Language Testing in Asia","volume":"8 8","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136381599","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Test review: a general academic English summative test 考试复习:一般学术英语总结性考试

1区文学 Q1 EDUCATION & EDUCATIONAL RESEARCH

Language Testing in Asia

Pub Date : 2023-10-23 DOI: 10.1186/s40468-023-00263-z

Yunlong Liu, Yaqiong Cui, Hua Yu

Abstract Many Chinese universities are implementing an educational reform to transform their general English courses into academic English courses. Accordingly, how to assess students’ English ability should also be reformed. In this test review, we introduce a school-based general academic English summative test developed by English instructors. An argument-based approach was adopted to analyze the test validity by obtaining students’ test data and their reflective responses to the test. This review can provide a practical reference for the development of a valid general academic English summative test by including practices of course instructors and voices of students, two important test stakeholders, in test design.

我国许多高校正在实施一项由普通英语课程向学术英语课程转变的教学改革。因此，如何评估学生的英语能力也应该改革。在这次测试回顾中，我们介绍了一种由英语教师开发的校本通用学术英语总结性测试。采用基于论证的方法，通过获取学生的测试数据和他们对测试的反思反应来分析测试的效度。将课程教师的实践和学生的声音这两个重要的测试利益相关者纳入测试设计中，可以为开发有效的通用学术英语总结性测试提供实践参考。

引用次数: 0

Instructional practices and students’ reading performance: a comparative study of 10 top performing regions in PISA 2018 教学实践与学生阅读表现:2018年国际学生评估项目中10个表现最好地区的比较研究

1区文学 Q1 EDUCATION & EDUCATIONAL RESEARCH

Language Testing in Asia

Pub Date : 2023-10-20 DOI: 10.1186/s40468-023-00261-1

Soi Kei Ho, Zhengdong Gan

Abstract This comparative study investigated the associations between instructional practices and students’ reading performance among 10 top performing regions that participated in the Program for International Student Assessment (PISA) 2018. A nationally representative sample consisting of 80,016 15-year-old students from 5 Asian regions (B-S-J-Z [China], Singapore, Macao, Hong Kong, and Korea) and 5 Western regions (Estonia, Canada, Finland, Ireland, and Poland) were included. A secondary analysis of PISA survey and assessment data was conducted. T test and ANOVA analyses revealed systematic differences in instructional practices of the 10 regions. B-S-J-Z (China) had significantly higher levels of teacher support, teacher-directed instruction, and teacher stimulation than the other sample regions. Asian regions tended to have higher levels of teacher support, teacher-directed instruction, teacher feedback, adaptive instruction, and teacher enthusiasm compared with Western regions, although variations were also found within Asian regions or within Western regions. Hierarchical linear regression (HLR) analyses indicated that reading performance was positively predicted by teacher support, adaptive instruction, teacher stimulation, and teacher enthusiasm, but negatively predicted by teacher-directed instruction and teacher feedback. This study sheds light on the effective instructional practices for optimizing students’ reading performance across different cultural contexts.

本比较研究调查了参加2018年国际学生评估项目(PISA)的10个表现最好的地区的教学实践与学生阅读成绩之间的关系。本研究采用具有全国代表性的样本，包括来自亚洲5个地区(B-S-J-Z[中国]、新加坡、澳门、香港和韩国)和西部5个地区(爱沙尼亚、加拿大、芬兰、爱尔兰和波兰)的8016名15岁学生。对PISA调查和评估数据进行了二次分析。T检验和方差分析揭示了10个地区教学实践的系统性差异。B-S-J-Z(中国)的教师支持水平、教师指导教学水平和教师激励水平显著高于其他样本地区。与西方地区相比，亚洲地区在教师支持、教师指导教学、教师反馈、适应性教学和教师热情方面的水平往往更高，尽管亚洲地区和西方地区之间也存在差异。层次线性回归(HLR)分析表明，教师支持、适应性教学、教师激励和教师热情对学生阅读成绩有正向预测作用，教师指导教学和教师反馈对学生阅读成绩有负向预测作用。本研究揭示了在不同文化背景下优化学生阅读表现的有效教学实践。

{"title":"Instructional practices and students’ reading performance: a comparative study of 10 top performing regions in PISA 2018","authors":"Soi Kei Ho, Zhengdong Gan","doi":"10.1186/s40468-023-00261-1","DOIUrl":"https://doi.org/10.1186/s40468-023-00261-1","url":null,"abstract":"Abstract This comparative study investigated the associations between instructional practices and students’ reading performance among 10 top performing regions that participated in the Program for International Student Assessment (PISA) 2018. A nationally representative sample consisting of 80,016 15-year-old students from 5 Asian regions (B-S-J-Z [China], Singapore, Macao, Hong Kong, and Korea) and 5 Western regions (Estonia, Canada, Finland, Ireland, and Poland) were included. A secondary analysis of PISA survey and assessment data was conducted. T test and ANOVA analyses revealed systematic differences in instructional practices of the 10 regions. B-S-J-Z (China) had significantly higher levels of teacher support, teacher-directed instruction, and teacher stimulation than the other sample regions. Asian regions tended to have higher levels of teacher support, teacher-directed instruction, teacher feedback, adaptive instruction, and teacher enthusiasm compared with Western regions, although variations were also found within Asian regions or within Western regions. Hierarchical linear regression (HLR) analyses indicated that reading performance was positively predicted by teacher support, adaptive instruction, teacher stimulation, and teacher enthusiasm, but negatively predicted by teacher-directed instruction and teacher feedback. This study sheds light on the effective instructional practices for optimizing students’ reading performance across different cultural contexts.","PeriodicalId":37050,"journal":{"name":"Language Testing in Asia","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135617037","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Correction: Exploring learning-oriented assessment in EAP writing classrooms: teacher and student perspectives 更正:探究EAP写作课堂中以学习为导向的评估:教师和学生的视角

1区文学 Q1 EDUCATION & EDUCATIONAL RESEARCH

Language Testing in Asia

Pub Date : 2023-10-18 DOI: 10.1186/s40468-023-00264-y

Maggie Ma

引用次数: 0

Using AI-based detectors to control AI-assisted plagiarism in ESL writing: “The Terminator Versus the Machines” 使用基于人工智能的检测器来控制ESL写作中的人工智能辅助抄袭:“终结者对机器”

1区文学 Q1 EDUCATION & EDUCATIONAL RESEARCH

Language Testing in Asia

Pub Date : 2023-10-16 DOI: 10.1186/s40468-023-00260-2

Karim Ibrahim

Abstract The release of ChatGPT marked the beginning of a new era of AI-assisted plagiarism that disrupts traditional assessment practices in ESL composition. In the face of this challenge, educators are left with little guidance in controlling AI-assisted plagiarism, especially when conventional methods fail to detect AI-generated texts. One approach to managing AI-assisted plagiarism is using fine-tuned AI classifiers, such as RoBERTa, to identify machine-generated texts; however, the reliability of this approach is yet to be established. To address the challenge of AI-assisted plagiarism in ESL contexts, the present cross-disciplinary descriptive study examined the potential of two RoBERTa-based classifiers to control AI-assisted plagiarism on a dataset of 240 human-written and ChatGPT-generated essays. Data analysis revealed that both platforms could identify AI-generated texts, but their detection accuracy was inconsistent across the dataset.

ChatGPT的发布标志着人工智能辅助抄袭新时代的开始，它颠覆了ESL作文中的传统评估实践。面对这一挑战，教育工作者在控制人工智能辅助抄袭方面几乎没有指导，特别是当传统方法无法检测人工智能生成的文本时。管理人工智能辅助抄袭的一种方法是使用微调的人工智能分类器(如RoBERTa)来识别机器生成的文本;但是，这种方法的可靠性还有待确定。为了解决ESL环境中人工智能辅助抄袭的挑战，本跨学科描述性研究检查了两个基于roberta的分类器在240篇人工写作和chatgpt生成的论文数据集中控制人工智能辅助抄袭的潜力。数据分析显示，这两个平台都可以识别人工智能生成的文本，但它们的检测精度在整个数据集中不一致。

引用次数: 1

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Language Testing in Asia

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀