Educational Research and Evaluation最新文献

英文中文

Policy transfer, context sensitivity, and epistemic justice: commentary and overview 政策转移、语境敏感性和认识正义：评述与综述

IF 1.4 Q1 EDUCATION & EDUCATIONAL RESEARCH

Educational Research and Evaluation

Pub Date : 2022-03-01 DOI: 10.1080/13803611.2022.2041876

Michael W Crossley

The global expansion of higher education has been increasingly evident since the 1980s, driven partly by the emergence of the knowledge economy and inspiring the internationalisation of all dimensions of the sector worldwide. Western, neoliberal models, values, and assumptions have dominated this process leading to the policy transfer of funding regimes favouring marketisation, the rise of the private sector, increased reliance upon student fees and governance processes seeking intergovernmental and intragovernmental coordination, and the harmonisation of higher education systems, standards, and qualifications frameworks. Within Europe this led to the signing of the Bologna Declaration in 1999 and the creation of the European Higher Education Area: “to facilitate student and staff mobility, to make higher education more inclusive and accessible, and to make higher education in Europe more attractive and competitive worldwide” (https://ec.europa.eu/education/policies/ higher-education/bologna-process-and-european-higher-education-area_en). Beyond Europe and North America, the influence of these and related developments can be seen in the rapid growth and internationalisation of higher education in contexts as diverse as Africa, Latin America, the Indian subcontinent, Southeast Asia, Oceania, and Mainland China. In the latter case, the dramatic growth of the home sector has been combined with exponential increases in the numbers of Chinese students pursuing higher education abroad, most notably in English-speaking countries such as the United States, the United Kingdom, and Australia. While the internationalisation of higher education has also been intensified by the impact of competitive global league tables and university rankings influencing status, reputations, enrolments, and the flow of research funds, such processes and new “governance mechanisms” have been increasingly challenged and problematised. This is especially significant in the work of comparative and international researchers who have long called for greater context sensitivity and the critical interrogation of policy flows in all sectors of education

自20世纪80年代以来，高等教育的全球扩张越来越明显，部分原因是知识经济的出现，并激发了世界范围内高等教育各个领域的国际化。西方的新自由主义模式、价值观和假设主导了这一过程，导致了有利于市场化的资助制度的政策转移、私营部门的崛起、对学生收费的依赖以及寻求政府间和政府内部协调的治理过程，以及高等教育体系、标准，以及资格框架。在欧洲，这导致1999年签署了《博洛尼亚宣言》，并创建了欧洲高等教育区：“促进学生和教职员工的流动，使高等教育更具包容性和可及性，并使欧洲的高等教育在全球范围内更具吸引力和竞争力。”(https://ec.europa.eu/education/policies/高等教育/博洛尼亚进程和欧洲高等教育体系）。除了欧洲和北美，这些和相关发展的影响可以从非洲、拉丁美洲、印度次大陆、东南亚、大洋洲和中国大陆等不同背景下高等教育的快速增长和国际化中看出。在后一种情况下，国内部门的急剧增长与在国外接受高等教育的中国学生人数的指数级增长相结合，尤其是在美国、英国和澳大利亚等英语国家。尽管竞争激烈的全球排行榜和大学排名对地位、声誉、入学率和研究资金流动的影响也加剧了高等教育的国际化，但这些过程和新的“治理机制”受到了越来越多的挑战和问题。这在比较研究人员和国际研究人员的工作中尤为重要，他们长期以来一直呼吁提高对背景的敏感性，并对所有教育部门的政策流动进行批判性的询问

{"title":"Policy transfer, context sensitivity, and epistemic justice: commentary and overview","authors":"Michael W Crossley","doi":"10.1080/13803611.2022.2041876","DOIUrl":"https://doi.org/10.1080/13803611.2022.2041876","url":null,"abstract":"The global expansion of higher education has been increasingly evident since the 1980s, driven partly by the emergence of the knowledge economy and inspiring the internationalisation of all dimensions of the sector worldwide. Western, neoliberal models, values, and assumptions have dominated this process leading to the policy transfer of funding regimes favouring marketisation, the rise of the private sector, increased reliance upon student fees and governance processes seeking intergovernmental and intragovernmental coordination, and the harmonisation of higher education systems, standards, and qualifications frameworks. Within Europe this led to the signing of the Bologna Declaration in 1999 and the creation of the European Higher Education Area: “to facilitate student and staff mobility, to make higher education more inclusive and accessible, and to make higher education in Europe more attractive and competitive worldwide” (https://ec.europa.eu/education/policies/ higher-education/bologna-process-and-european-higher-education-area_en). Beyond Europe and North America, the influence of these and related developments can be seen in the rapid growth and internationalisation of higher education in contexts as diverse as Africa, Latin America, the Indian subcontinent, Southeast Asia, Oceania, and Mainland China. In the latter case, the dramatic growth of the home sector has been combined with exponential increases in the numbers of Chinese students pursuing higher education abroad, most notably in English-speaking countries such as the United States, the United Kingdom, and Australia. While the internationalisation of higher education has also been intensified by the impact of competitive global league tables and university rankings influencing status, reputations, enrolments, and the flow of research funds, such processes and new “governance mechanisms” have been increasingly challenged and problematised. This is especially significant in the work of comparative and international researchers who have long called for greater context sensitivity and the critical interrogation of policy flows in all sectors of education","PeriodicalId":47025,"journal":{"name":"Educational Research and Evaluation","volume":"27 1","pages":"357 - 363"},"PeriodicalIF":1.4,"publicationDate":"2022-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41678617","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 5

International connectivity and employability in Cambodian higher education: a case study of developing employability skills in English language education 柬埔寨高等教育中的国际连通性和就业能力:英语教育中发展就业技能的案例研究

IF 1.4 Q1 EDUCATION & EDUCATIONAL RESEARCH

Educational Research and Evaluation

Pub Date : 2022-02-27 DOI: 10.1080/13803611.2022.2041875

Sathya Chea, W. Lo

ABSTRACT Building employability skills in Cambodia is emphasised in the development of higher education within the globalisation context. Given that English proficiency has increasingly become an important credential in the more internationalised Cambodian labour market, this paper examines the development and revision of an English language education programme at a renowned university in Phnom Penh to exemplify how an emphasis on employability skills development emerges with the global trends in curriculum in Cambodian higher education. It also discusses how revisions are made in response to the changing national context and how corresponding changes in pedagogical strategy appear at the institutional and classroom levels. This discussion documents and analyses the interplay between curriculum development and changes at global, national, and institutional levels in Cambodia – a fast-growing country that actively engages with globalisation and regionalisation.

摘要在全球化背景下，柬埔寨高等教育的发展强调培养就业能力。鉴于英语熟练程度日益成为柬埔寨劳动力市场国际化的一个重要凭证，本文考察了金边一所著名大学英语教育课程的制定和修订情况，以说明在柬埔寨高等教育课程的全球趋势下，对就业能力技能发展的重视是如何出现的。它还讨论了如何根据不断变化的国家背景进行修订，以及教学策略的相应变化如何在机构和课堂层面出现。本次讨论记录并分析了柬埔寨的课程开发与全球、国家和机构层面的变化之间的相互作用。柬埔寨是一个积极参与全球化和区域化的快速发展国家。

引用次数: 4

Replication study in education 教育中的复制研究

IF 1.4 Q1 EDUCATION & EDUCATIONAL RESEARCH

Educational Research and Evaluation

Pub Date : 2022-02-17 DOI: 10.1080/13803611.2021.2022307

Thomas Perry, B. See

The last 2 decades have seen many developments in education research, including the growth of robust studies testing education programmes and policies using experimental designs (Hedges, 2018), such as randomised controlled trials (RCTs). RCTs have been a focus for replication study in education, with researchers seeking to replicate similar programmes under trial conditions. These replications have had varying results and have raised questions about why some results have successfully replicated and others have not. Results from recent Education Endowment Foundaton effectiveness trials are good examples of this. A number of education programmes have shown beneficial effects on young people’s learning outcomes in efficacy trials, but no effects in larger scale effectiveness trials. Examples of these programmes include Philosophy for Children (Gorard et al., 2018), Switch-on (Reading Recovery) (Gorard et al., 2014), and Accelerated Reader (Gorard et al., 2015). Some may conclude that one of these evaluations must be wrong. It is important to realise that in all of these examples, the contexts and the fidelity of implementation differed. In the Philosophy for Children effectiveness trial, 53% of the schools did not implement the intervention as intended (Lord et al., 2021). This is the nature of effectiveness trials, where the programme is delivered in real-life conditions, whereas in efficacy trials the delivery would be closely monitored and controlled, and with a smaller sample. Similarly, with the Switch-on evaluation, although schools delivered the required number of sessions, they modified the content and the delivery format of the intervention (Patel et al., 2017). There were also important differences between the efficacy and effectiveness trials. The efficacy trial was conducted with 1st-year secondary school children, whereas the effectiveness trial was with primary school children. The tests used also differed in the two evaluations. In the efficacy trial, Reading was measured using the GL New Group Reading Test (Gorard et al., 2014), but in the effectiveness trial the test used was the Hodder Group Reading Test (Patel et al., 2017). What these two examples suggest is that variations in the context and target population for the study and variations in the measures and experimental conditions can have an appreciable effect on the result. These examples also highlight the point that adherence to the fundamental principles of the original programme is essential for effective replication. Without this, it is difficult to know whether unsuccessful replication is because the programme does not work, or that it does not work with a certain population or under certain conditions. It is therefore worthwhile replicating these studies while maintaining high fidelity to the intervention and at the same time varying the population and instruments used as suggested by Wiliam (2022). Related to these efforts are questions about the role of science in ed

在过去的20年里，教育研究取得了许多进展，包括使用实验设计测试教育计划和政策的有力研究的增长（Hedges，2018），如随机对照试验（RCT）。随机对照试验一直是教育中复制研究的重点，研究人员试图在试验条件下复制类似的项目。这些复制产生了不同的结果，并引发了人们的疑问，即为什么有些结果成功复制，而另一些结果没有成功复制。最近教育基金会基金会有效性试验的结果就是一个很好的例子。一些教育方案在疗效试验中对年轻人的学习结果显示出有益的影响，但在更大规模的疗效试验中没有效果。这些计划的例子包括《儿童哲学》（Gorard等人，2018）、《开启（阅读恢复）》（Gorrd等人，2014）和《加速阅读》（Gorad等人，2015）。有些人可能会得出这样的结论：其中一种评价肯定是错误的。重要的是要认识到，在所有这些例子中，实施的背景和保真度都有所不同。在儿童哲学有效性试验中，53%的学校没有按预期实施干预（Lord等人，2021）。这就是有效性试验的性质，即在现实生活中提供方案，而在有效性试验中，将密切监测和控制方案的提供，并使用较小的样本。同样，随着评估的开启，尽管学校提供了所需的课程数量，但他们修改了干预的内容和提供形式（Patel等人，2017）。疗效和有效性试验之间也存在重要差异。疗效试验是针对一年级中学生进行的，而疗效试验则是针对小学生进行的。两次评估中使用的测试也有所不同。在疗效试验中，阅读是使用GL新组阅读测试（Gorard等人，2014）测量的，但在有效性试验中，使用的测试是Hodder组阅读测试。（Patel等人，2017）。这两个例子表明，研究的背景和目标人群的变化以及测量和实验条件的变化可能会对结果产生明显影响。这些例子还突出表明，遵守原方案的基本原则对于有效复制至关重要。如果没有这一点，就很难知道复制失败是因为该方案不起作用，还是因为它对特定人群或在特定条件下不起作用。因此，值得复制这些研究，同时保持对干预的高保真度，同时按照Wiliam（2022）的建议改变使用的人群和仪器。与这些努力相关的是关于科学在教育中的作用及其应采取的形式的问题。随机对照试验是一种严格的调查方法，通常被认为是科学研究的黄金标准。然而，有一些与它们相关的警告，以及关于它们的好处和局限性的持续辩论（康诺利

{"title":"Replication study in education","authors":"Thomas Perry, B. See","doi":"10.1080/13803611.2021.2022307","DOIUrl":"https://doi.org/10.1080/13803611.2021.2022307","url":null,"abstract":"The last 2 decades have seen many developments in education research, including the growth of robust studies testing education programmes and policies using experimental designs (Hedges, 2018), such as randomised controlled trials (RCTs). RCTs have been a focus for replication study in education, with researchers seeking to replicate similar programmes under trial conditions. These replications have had varying results and have raised questions about why some results have successfully replicated and others have not. Results from recent Education Endowment Foundaton effectiveness trials are good examples of this. A number of education programmes have shown beneficial effects on young people’s learning outcomes in efficacy trials, but no effects in larger scale effectiveness trials. Examples of these programmes include Philosophy for Children (Gorard et al., 2018), Switch-on (Reading Recovery) (Gorard et al., 2014), and Accelerated Reader (Gorard et al., 2015). Some may conclude that one of these evaluations must be wrong. It is important to realise that in all of these examples, the contexts and the fidelity of implementation differed. In the Philosophy for Children effectiveness trial, 53% of the schools did not implement the intervention as intended (Lord et al., 2021). This is the nature of effectiveness trials, where the programme is delivered in real-life conditions, whereas in efficacy trials the delivery would be closely monitored and controlled, and with a smaller sample. Similarly, with the Switch-on evaluation, although schools delivered the required number of sessions, they modified the content and the delivery format of the intervention (Patel et al., 2017). There were also important differences between the efficacy and effectiveness trials. The efficacy trial was conducted with 1st-year secondary school children, whereas the effectiveness trial was with primary school children. The tests used also differed in the two evaluations. In the efficacy trial, Reading was measured using the GL New Group Reading Test (Gorard et al., 2014), but in the effectiveness trial the test used was the Hodder Group Reading Test (Patel et al., 2017). What these two examples suggest is that variations in the context and target population for the study and variations in the measures and experimental conditions can have an appreciable effect on the result. These examples also highlight the point that adherence to the fundamental principles of the original programme is essential for effective replication. Without this, it is difficult to know whether unsuccessful replication is because the programme does not work, or that it does not work with a certain population or under certain conditions. It is therefore worthwhile replicating these studies while maintaining high fidelity to the intervention and at the same time varying the population and instruments used as suggested by Wiliam (2022). Related to these efforts are questions about the role of science in ed","PeriodicalId":47025,"journal":{"name":"Educational Research and Evaluation","volume":"27 1","pages":"1 - 7"},"PeriodicalIF":1.4,"publicationDate":"2022-02-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43525664","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Is my opinion important in evaluating lecturers? Students’ perceptions of student evaluations of teaching (SET) and their relationship to SET scores 我的意见对评价讲师很重要吗?学生对学生评价教学(SET)的看法及其与SET分数的关系

IF 1.4 Q1 EDUCATION & EDUCATIONAL RESEARCH

Educational Research and Evaluation

Pub Date : 2022-02-17 DOI: 10.1080/13803611.2021.2022318

Noemí Suárez Monzón, Vanessa Gómez Suárez, Diego Gudberto Lara Paredes

ABSTRACT Previous studies have identified a positive relationship between students’ perceptions of student evaluations of teaching (SET) and the grades that students provide in SET, controlling for other bias factors. The research by Spooren and Christiaens in 2017 at the University of Antwerp supported this finding. In this study, the methodology used by Spooren and Christiaens was replicated at the Technological Indoamerica University in Ecuador, in a close conceptual replication. In the replicated study, 967 undergraduate participants answered the questionnaires used by the original authors. The replication study sample was very similar in size, seniority, and gender to the original study but not in academic disciplines studied. Most of the students agreed that the evaluation was relevant and could improve teaching practices. Results show a statistically significant but small positive relation among perceptions of SET and SET scores (0.20 for the Belgian university and 0.27 for the Ecuadorian university).

摘要先前的研究已经确定，在控制其他偏见因素的情况下，学生对学生教学评价（SET）的感知与学生在SET中提供的成绩之间存在正相关关系。Spooren和Christiaens于2017年在安特卫普大学的研究支持了这一发现。在这项研究中，Spooren和Christiaens使用的方法在厄瓜多尔的中南美洲科技大学进行了概念复制。在重复研究中，967名本科生参与者回答了原作者使用的问卷。复制研究样本在规模、资历和性别上与原始研究非常相似，但在所研究的学科中没有。大多数学生一致认为，评估是相关的，可以改进教学实践。结果显示，对SET的认知与SET分数之间存在统计学上显著但较小的正相关关系（比利时大学为0.20，厄瓜多尔大学为0.27）。

引用次数: 6

Effects of early mathematics intervention for low-SES pre-kindergarten and kindergarten students: a replication study 早期数学干预对低社会经济地位学龄前和幼稚园学生的影响:一项重复研究

IF 1.4 Q1 EDUCATION & EDUCATIONAL RESEARCH

Educational Research and Evaluation

Pub Date : 2022-02-07 DOI: 10.1080/13803611.2021.2022316

P. Starkey, Alice E. Klein, Ben Clarke, S. Baker, Jaime Thomas

ABSTRACT A socioeconomic status (SES)-related achievement gap in mathematics emerges prior to school entry, and increases in elementary school. This gap makes implementation of demanding mathematics standards (e.g., the Common Core State Standards) an ongoing challenge. Early educational intervention is a strategy for addressing this challenge. A randomised controlled trial was conducted in public American preschools to (1) replicate the efficacy of an intervention, Pre-K Mathematics, for low-SES children, and (2) test the combined impact of this intervention and a Common-Core-aligned kindergarten intervention, Early Learning in Mathematics. Forty-one clusters of pre-kindergarten and kindergarten classrooms, containing a sample of 389 low-SES children from an agricultural region, were randomly assigned to treatment and control conditions. The original impact findings were replicated: Child mathematics outcomes in pre-kindergarten were positive and significant. Gains were maintained in kindergarten. Thus, the gap can be reduced and gains maintained by sustained early intervention.

与社会经济地位(SES)相关的数学成绩差距在入学前就出现了，并在小学阶段增加。这种差距使得实施苛刻的数学标准(例如，共同核心州标准)成为一项持续的挑战。早期教育干预是应对这一挑战的一种策略。在美国公立幼儿园进行了一项随机对照试验，目的是:(1)重复对低社会经济地位儿童进行学前数学干预的效果，(2)测试该干预与共同核心幼儿园干预“早期数学学习”的综合影响。41组幼儿园前和幼儿园教室，包括389名来自农业地区的低社会经济地位儿童，被随机分配到治疗组和对照组。最初的影响结果被重复:学龄前儿童的数学成绩是积极的和显著的。幼儿园继续保持增长。因此，通过持续的早期干预可以缩小差距并保持收益。

{"title":"Effects of early mathematics intervention for low-SES pre-kindergarten and kindergarten students: a replication study","authors":"P. Starkey, Alice E. Klein, Ben Clarke, S. Baker, Jaime Thomas","doi":"10.1080/13803611.2021.2022316","DOIUrl":"https://doi.org/10.1080/13803611.2021.2022316","url":null,"abstract":"ABSTRACT A socioeconomic status (SES)-related achievement gap in mathematics emerges prior to school entry, and increases in elementary school. This gap makes implementation of demanding mathematics standards (e.g., the Common Core State Standards) an ongoing challenge. Early educational intervention is a strategy for addressing this challenge. A randomised controlled trial was conducted in public American preschools to (1) replicate the efficacy of an intervention, Pre-K Mathematics, for low-SES children, and (2) test the combined impact of this intervention and a Common-Core-aligned kindergarten intervention, Early Learning in Mathematics. Forty-one clusters of pre-kindergarten and kindergarten classrooms, containing a sample of 389 low-SES children from an agricultural region, were randomly assigned to treatment and control conditions. The original impact findings were replicated: Child mathematics outcomes in pre-kindergarten were positive and significant. Gains were maintained in kindergarten. Thus, the gap can be reduced and gains maintained by sustained early intervention.","PeriodicalId":47025,"journal":{"name":"Educational Research and Evaluation","volume":"27 1","pages":"61 - 82"},"PeriodicalIF":1.4,"publicationDate":"2022-02-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45455738","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 5

The development and validation of the Feedback in Learning Scale (FLS): a replication study 学习反馈量表(FLS)的开发与验证:一项重复研究

IF 1.4 Q1 EDUCATION & EDUCATIONAL RESEARCH

Educational Research and Evaluation

Pub Date : 2022-02-01 DOI: 10.1080/13803611.2021.2022320

D. Foung, Lucas Kohnke

ABSTRACT Replication studies are uncommon in education, and replications of validation studies are rarer. This study aimed to replicate, reproduce, and expand the study by Jellicoe and Forsythe published in 2019 that validated the Feedback in Learning Scale. We followed the original procedures, conducting a full validation process. We found only an 87% agreement between our model parameters and those of the original study. The differences were derived from the number of factors retained and the fit indices of alternative models. Fuller details of the methods used in the original study would have helped us to better ensure replicability. We also suggest that feedback in higher education (the context for our study) might be more effective if it were less personal and more task-related than workplace feedback (the context from which the Feedback in Learning Scale was derived).

复制研究在教育领域并不常见，验证研究的重复研究也比较少见。这项研究旨在复制、再现和扩展杰利科和福赛斯在2019年发表的研究，该研究验证了学习反馈量表。我们遵循了最初的程序，进行了完整的验证过程。我们发现我们的模型参数与原始研究的参数之间只有87%的一致性。差异来源于保留因子的数量和备选模型的拟合指数。原始研究中使用的方法的更详细的细节将有助于我们更好地确保可重复性。我们还建议，如果高等教育中的反馈(我们研究的背景)比工作场所的反馈(学习反馈量表的来源背景)更少个人化，更多与任务相关，那么反馈可能会更有效。

引用次数: 4

Conceptual replications, research, and the “what works” agenda in education 概念复制、研究和教育中的“什么有效”议程

IF 1.4 Q1 EDUCATION & EDUCATIONAL RESEARCH

Educational Research and Evaluation

Pub Date : 2022-01-31 DOI: 10.1080/13803611.2021.2022314

K. Morrison

ABSTRACT Conceptual replications have received increased coverage in the educational research agenda. This article argues for clarity in, and justification of, the definition, scope, and boundaries of a conceptual replication and what it can and cannot do. It argues for clear justifications when changing components from those of the original study. The article raises issues concerning internal validity and construct validity which arise from the elision of replication with applicability and generalisability in a conceptual replication, and questions how far the “concept” needs, and can obtain, greater separation from context. It indicates limits to the power of conceptual replications to falsify and verify the original study, and argues for greater specificity, precision, accuracy, and attention to contexts, conditions, and causality and their influence on outcomes. Implications are drawn for preparing, planning, conducting, analysing, judging, and reporting “fair” conceptual replications in education, identifying 10 “rules” for a fair conceptual replication.

概念重复在教育研究议程中得到了越来越多的关注。本文论证了概念复制的定义、范围和边界以及它能做什么和不能做什么的清晰性和正当性。在改变原始研究的组成部分时，它提出了明确的理由。本文提出了在概念复制中由于对适用性和概括性复制的省略而产生的内在效度和建构效度问题，并对“概念”需要和能够在多大程度上获得更大的语境分离提出了质疑。它指出了概念性复制在证伪和验证原始研究方面的局限性，并主张更大的特异性、精确性、准确性，以及对背景、条件和因果关系及其对结果的影响的关注。对教育中“公平”概念复制的准备、规划、实施、分析、判断和报告提出了建议，并确定了公平概念复制的10条“规则”。

引用次数: 6

The role of analytical variability in secondary data replications: a replication of Kim et al. (2014) 分析变异性在二次数据重复中的作用:Kim et al.(2014)的复制

IF 1.4 Q1 EDUCATION & EDUCATIONAL RESEARCH

Educational Research and Evaluation

Pub Date : 2022-01-31 DOI: 10.1080/13803611.2021.2022319

C. Bokhove

ABSTRACT An article by Kim et al. from 2014 examined individual- and school-level variables affecting the information and communication technology (ICT) literacy level of Korean elementary school students, finding differential gender effects. In this secondary data replication, we used data from the 2018 International Computer and Information Literacy Study, focusing on data from Korea as main replication. As many characteristics of the study as possible, such as variables and analytical strategy, were modelled in the analysis. Additional analyses included 13 countries and jurisdictions, varied centring techniques for variables, and missing data treatment. The replication and analyses were pre-registered via the Open Science Framework. The main analysis did not replicate the main gender finding. However, it was also clear that, despite care taken in a rigorous replication, analytical variability still plays a large role in replications of findings, and with secondary datasets. We discuss the implications of this for secondary data replications.

摘要Kim等人2014年发表的一篇文章研究了影响韩国小学生信息通信技术（ICT）识字水平的个人和学校层面的变量，发现了不同的性别影响。在这次二次数据复制中，我们使用了2018年国际计算机和信息素养研究的数据，重点是韩国的数据作为主要复制。分析中尽可能多地模拟了研究的特征，如变量和分析策略。其他分析包括13个国家和司法管辖区、变量的不同居中技术以及缺失数据处理。复制和分析是通过开放科学框架预先注册的。主要分析没有重复主要的性别发现。然而，同样显而易见的是，尽管在严格的复制中采取了谨慎的态度，但分析变异性在研究结果的复制和二级数据集的复制中仍然发挥着重要作用。我们将讨论这对辅助数据复制的影响。

引用次数: 6

How should educational research respond to the replication “crisis” in the social sciences? Reflections on the papers in the Special Issue 教育研究应如何应对社会科学的复制“危机”?对特刊论文的思考

IF 1.4 Q1 EDUCATION & EDUCATIONAL RESEARCH

Educational Research and Evaluation

Pub Date : 2022-01-31 DOI: 10.1080/13803611.2021.2022309

D. Wiliam

For anyone who understands the logic of null-hypothesis significance testing, the so-called “replication crisis” in the behavioural sciences (Bryan et al., 2021) would not have come as much of a surprise. Since the pioneering work of Carlo Bonferroni (1935) – and subsequent work in the 1950s by Henry Scheffé (1953), John Tukey (1953/1994), and Olive Jean Dunn (1961) – statisticians have repeatedly pointed out the logically obvious fact that the probability of making a Type I error (mistakenly rejecting the null hypothesis) increases when multiple comparisons are made. And yet, studies in leading psychology and education journals commonly present dozens if not hundreds of comparisons of means, correlations, or other statistics, and then go on to claim that any statistic that has a probability of less than 0.05 is “significant”. However, as Gelman and Loken (2013) point out, even when researchers do not engage in such “fishing expeditions”, if decisions about the analysis are made after the data are collected – “hypothesizing after results are known” or “HARKing” (Kerr, 1998) – then the probability of Type 1 errors is increased. At each stage in the analysis, the researcher is presented with many choices – what Gelman and Loken call “the garden of forking paths” after a short story by Argentinian author Jorge Luis (Borges, 1941/1964) – that can profoundly influence the results obtained. Some of these, such as cleaning data, or eliminating outliers, seem innocent, but nevertheless, because these decisions are taken after the results are seen, they are inconsistent with the assumptions of nullhypothesis significance testing. Other, more egregious, examples include outcome switching, collecting additional data, or changing the analytical approach when the desired level of statistical significance is not reached. A good example of how these issues play out in practice is provided by Bokhove (2022) in his replication of a study on gender differences in computer literacy, where he found that different, reasonable, analytical choices lead to very different conclusions.

对于任何理解零假设显著性检验逻辑的人来说，行为科学中所谓的“复制危机”(Bryan et al.， 2021)都不会让人感到意外。自从Carlo Bonferroni(1935)的开创性工作，以及20世纪50年代Henry scheff(1953)、John Tukey(1953/1994)和Olive Jean Dunn(1961)的后续工作以来，统计学家们一再指出一个逻辑上显而易见的事实，即当进行多次比较时，犯第一类错误(错误地拒绝零假设)的概率会增加。然而，主要的心理学和教育期刊上的研究通常会提出几十个(如果不是几百个的话)对平均值、相关性或其他统计数据的比较，然后继续声称任何概率小于0.05的统计数据都是“显著的”。然而，正如Gelman和Loken(2013)所指出的那样，即使研究人员不进行这种“钓鱼考察”，如果在收集数据后做出有关分析的决定-“在结果已知后假设”或“HARKing”(Kerr, 1998) -那么类型1错误的可能性就会增加。在分析的每个阶段，研究人员都会面临许多选择——Gelman和Loken以阿根廷作家Jorge Luis(博尔赫斯，1941/1964)的一个短篇小说命名，将其称为“分叉路径的花园”——这些选择会深刻地影响所获得的结果。其中一些，如清理数据或消除异常值，似乎是无害的，但是，由于这些决定是在看到结果之后做出的，因此它们与零假设显著性检验的假设不一致。其他，更令人震惊的例子包括结果转换，收集额外的数据，或者在没有达到预期的统计显著性水平时改变分析方法。Bokhove(2022)在复制一项关于计算机素养性别差异的研究中提供了一个很好的例子，他发现不同的、合理的、分析性的选择会导致非常不同的结论。

{"title":"How should educational research respond to the replication “crisis” in the social sciences? Reflections on the papers in the Special Issue","authors":"D. Wiliam","doi":"10.1080/13803611.2021.2022309","DOIUrl":"https://doi.org/10.1080/13803611.2021.2022309","url":null,"abstract":"For anyone who understands the logic of null-hypothesis significance testing, the so-called “replication crisis” in the behavioural sciences (Bryan et al., 2021) would not have come as much of a surprise. Since the pioneering work of Carlo Bonferroni (1935) – and subsequent work in the 1950s by Henry Scheffé (1953), John Tukey (1953/1994), and Olive Jean Dunn (1961) – statisticians have repeatedly pointed out the logically obvious fact that the probability of making a Type I error (mistakenly rejecting the null hypothesis) increases when multiple comparisons are made. And yet, studies in leading psychology and education journals commonly present dozens if not hundreds of comparisons of means, correlations, or other statistics, and then go on to claim that any statistic that has a probability of less than 0.05 is “significant”. However, as Gelman and Loken (2013) point out, even when researchers do not engage in such “fishing expeditions”, if decisions about the analysis are made after the data are collected – “hypothesizing after results are known” or “HARKing” (Kerr, 1998) – then the probability of Type 1 errors is increased. At each stage in the analysis, the researcher is presented with many choices – what Gelman and Loken call “the garden of forking paths” after a short story by Argentinian author Jorge Luis (Borges, 1941/1964) – that can profoundly influence the results obtained. Some of these, such as cleaning data, or eliminating outliers, seem innocent, but nevertheless, because these decisions are taken after the results are seen, they are inconsistent with the assumptions of nullhypothesis significance testing. Other, more egregious, examples include outcome switching, collecting additional data, or changing the analytical approach when the desired level of statistical significance is not reached. A good example of how these issues play out in practice is provided by Bokhove (2022) in his replication of a study on gender differences in computer literacy, where he found that different, reasonable, analytical choices lead to very different conclusions.","PeriodicalId":47025,"journal":{"name":"Educational Research and Evaluation","volume":"27 1","pages":"208 - 214"},"PeriodicalIF":1.4,"publicationDate":"2022-01-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43997167","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

Replication studies: an essay in praise of ground-up conceptual replications in the science of learning 复制研究：一篇赞扬学习科学中概念复制的文章

IF 1.4 Q1 EDUCATION & EDUCATIONAL RESEARCH

Educational Research and Evaluation

Pub Date : 2022-01-31 DOI: 10.1080/13803611.2021.2022308

John F. Brown

ABSTRACT This paper discusses adapting Churches’ approach to large-scale teacher/researcher conceptual replications of major “science of learning” findings, to increase teachers’ engagement with empirical research on, and building research networks for, gathering data on the science of learning. The project here demonstrated the feasibility of teacher-led randomised controlled trials for conceptually replicating the effects of cognitive science on learning, as specified by researchers. It also indicated high levels of interest by teachers in applying more science of learning in their practice. The approach gave freedom to teachers to design interventions, choose research methods, and measure outcomes, even though such freedom would be in tension with some scientific research which relies on constraining the sources of variation. This paper discusses how a balance can be struck between the objectives of teachers and researchers engaged in replicating cognitive science findings, and promoting teacher engagement in conceptual replication research.

摘要本文讨论了将教会的方法应用于教师/研究人员对主要“学习科学”研究结果的大规模概念复制，以增加教师对学习科学数据收集的实证研究和建立研究网络的参与度。该项目证明了教师主导的随机对照试验在概念上复制认知科学对学习的影响的可行性，正如研究人员所指出的那样。它还表明，教师们对在实践中应用更多的学习科学非常感兴趣。这种方法给了教师设计干预措施、选择研究方法和衡量结果的自由，尽管这种自由会与一些依赖于限制变异来源的科学研究产生矛盾。本文讨论了如何在教师和研究人员复制认知科学发现的目标与促进教师参与概念复制研究的目标之间取得平衡。

引用次数: 3

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Educational Research and Evaluation

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀