Voprosy Obrazovaniya-Educational Studies Moscow最新文献

英文中文

Power of Probability in Psychometrics. Review of the book “Bayesian Psychometric Modeling“ 概率在心理测量学中的作用。《贝叶斯心理测量建模》一书书评

Q4 EDUCATION, SCIENTIFIC DISCIPLINES

Voprosy Obrazovaniya-Educational Studies Moscow

Pub Date : 2023-11-07 DOI: 10.17323/vo-2023-17952

The emergence and development of Bayesian psychometrics is a result of psychometrics' desire to reduce measurement error. This book is the first to present a systematic description of the Bayesian approach in psychometric research. The book consists of two parts: the first one presents the main principles of the Bayesian approach (Foundations), the second one includes their application in psychometric modeling (Psychometrics). The reviewer believes that the publication will be useful for those who used to work in the frequentist approach and would like to learn about the Bayesian approach. At the same time, she recommends those who are not sure of the quality of their mathematical background to additionally turn to other sources that are not equipped with such a detailed mathematical description. The book has not been translated into Russian.

贝叶斯心理测量学的出现和发展是心理测量学希望减少测量误差的结果。这本书是第一个提出贝叶斯方法在心理测量研究系统的描述。本书由两部分组成:第一部分介绍贝叶斯方法的主要原理(基础)，第二部分包括它们在心理测量建模中的应用(心理测量学)。审稿人认为，该出版物将对那些曾经使用频率论方法工作并希望学习贝叶斯方法的人有用。与此同时，她建议那些不确定自己数学背景质量的人额外转向其他没有提供如此详细数学描述的来源。这本书还没有被翻译成俄文。

引用次数: 0

Is Psychometrics So Useful for Academic Psychology? 心理测量学对学术心理学有用吗?

Q4 EDUCATION, SCIENTIFIC DISCIPLINES

Voprosy Obrazovaniya-Educational Studies Moscow

Pub Date : 2023-11-06 DOI: 10.17323/vo-2023-16781

Юлия Тюменева

Psychological theories regarding ability and personality traits often rely on the results of psychometric modelling. The latter is assumed to link responses to test items to an unobserved 'construct' (trait, ability), which is 'modelled' from the test data. However, does the agreement between the data and the model indicate that the model represents a psychological construct? To what extent is ‘psychometric modelling’ modelling in the general scientific sense of the term? The validity of using modelling data to understand psychological phenomena depends on the answer to these questions. The article analyses the logic of psychometric modelling in comparison with modelling in other sciences and argues that psychological phenomena as a subject of modelling are not involved neither in the construction nor in the correction of models. The problem of unjustified interpretations of modelling results in psychology and their undesirable consequences for psychological theory is raised. At the same time, the use of psychometric modelling for human resource decision-making is still waiting for its evaluation.

关于能力和人格特征的心理学理论往往依赖于心理测量模型的结果。后者被认为是将对测试项目的反应与一个未观察到的“构造”(特征，能力)联系起来，这是根据测试数据“建模”的。然而，数据和模型之间的一致性是否表明模型代表了一种心理结构?“心理测量建模”在多大程度上是一般科学意义上的建模?使用建模数据来理解心理现象的有效性取决于对这些问题的回答。本文分析了心理测量学建模与其他科学建模的逻辑，认为心理现象作为建模的主体，既不涉及模型的构建，也不涉及模型的修正。提出了对心理学中建模结果的不合理解释及其对心理学理论的不良后果的问题。同时，心理测量模型在人力资源决策中的应用还有待于进一步的评价。

引用次数: 0

Experience of Using Bifactor Models to Reduce the Effects of Social Desirability on the Normative Questionnaire of Universal Competencies 运用双因素模型降低社会期望对通用能力规范问卷影响的经验

Q4 EDUCATION, SCIENTIFIC DISCIPLINES

Voprosy Obrazovaniya-Educational Studies Moscow

Pub Date : 2023-11-06 DOI: 10.17323/vo-2023-16827

Егор Сагитов Ирина Брун Станислав Павлов

One of the significant lack of questionnaires is a scores distortion for the measured constructs, associated with the social desirability effects. An even greater threat to the validity of decisions is social desirability in high-stakes evaluation, such as selection for a position. Moreover the issue of the relationship between different components of social desirability and the most frequently measured personal constructs remains debatable. In the material of the author's normative questionnaire of universal competencies, an approach is considered for making adjustments to the final scores for measured constructs using the developed scales of egoistic and moralistic social desirability. Also discussed the prospect of using statement formulations that are neutral to social desirability or express the most positive degree of measured indicators. The empirical basis of this study is data gathered within a pilot conducted in the spring of 2022, during which data were obtained from 579 respondents in 49 measurable competencies. The analysis was aimed at assessing the quality of the developed scales of social desirability and modeling of each of the universal competencies scales was carried out with the inclusion of a scale of social desirability. The data were analyzed in the framework of structural modeling - confirmatory factor analysis (CFA) using bifactor models for each of the measured competencies. According to the results of this study, the use of the scale of egoistic social desirability as a measure for adjusting factor scores for the competencies has generally satisfactory psychometric statistics, but there is concern about the relatively large measurement error. The paper discusses the advantages and disadvantages of this approach and other practices that are most often used to reduce the effects of social desirability in the academic and business environment.

问卷调查的显著缺失之一是测量结构的分数失真，与社会期望效应有关。对决策有效性的更大威胁是高风险评估中的社会期望，比如职位选择。此外，社会期望的不同组成部分与最常测量的个人构念之间的关系问题仍然存在争议。在作者普遍能力的规范问卷的材料中，考虑了一种方法来调整使用利己主义和道德社会可取性的发达量表测量结构的最终分数。还讨论了使用对社会期望中性或表达所测量指标最积极程度的陈述公式的前景。本研究的实证基础是在2022年春季进行的一项试点中收集的数据，在此期间，从579名受访者中获得了49项可测量能力的数据。分析的目的是评估社会可取性量表的质量，并对每个普遍能力量表进行建模，其中包括社会可取性量表。数据在结构建模-验证性因素分析(CFA)的框架内进行分析，对每个测量的能力使用双因素模型。本研究结果显示，使用自我社会期望量表作为调整胜任力因子得分的测量方法，其心理测量统计数据总体上令人满意，但存在测量误差较大的问题。本文讨论了这种方法的优点和缺点，以及在学术和商业环境中最常用于减少社会可取性影响的其他做法。

{"title":"Experience of Using Bifactor Models to Reduce the Effects of Social Desirability on the Normative Questionnaire of Universal Competencies","authors":"Егор Сагитов, Ирина Брун, Станислав Павлов","doi":"10.17323/vo-2023-16827","DOIUrl":"https://doi.org/10.17323/vo-2023-16827","url":null,"abstract":"One of the significant lack of questionnaires is a scores distortion for the measured constructs, associated with the social desirability effects. An even greater threat to the validity of decisions is social desirability in high-stakes evaluation, such as selection for a position. Moreover the issue of the relationship between different components of social desirability and the most frequently measured personal constructs remains debatable. In the material of the author's normative questionnaire of universal competencies, an approach is considered for making adjustments to the final scores for measured constructs using the developed scales of egoistic and moralistic social desirability. Also discussed the prospect of using statement formulations that are neutral to social desirability or express the most positive degree of measured indicators. The empirical basis of this study is data gathered within a pilot conducted in the spring of 2022, during which data were obtained from 579 respondents in 49 measurable competencies. The analysis was aimed at assessing the quality of the developed scales of social desirability and modeling of each of the universal competencies scales was carried out with the inclusion of a scale of social desirability. The data were analyzed in the framework of structural modeling - confirmatory factor analysis (CFA) using bifactor models for each of the measured competencies. According to the results of this study, the use of the scale of egoistic social desirability as a measure for adjusting factor scores for the competencies has generally satisfactory psychometric statistics, but there is concern about the relatively large measurement error. The paper discusses the advantages and disadvantages of this approach and other practices that are most often used to reduce the effects of social desirability in the academic and business environment.","PeriodicalId":54119,"journal":{"name":"Voprosy Obrazovaniya-Educational Studies Moscow","volume":"44 12","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135724146","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Psychometrics and Cognitive Research: Contradictions and Possibility for Cooperation 心理测量学与认知研究:合作的矛盾与可能性

Q4 EDUCATION, SCIENTIFIC DISCIPLINES

Voprosy Obrazovaniya-Educational Studies Moscow

Pub Date : 2023-11-06 DOI: 10.17323/vo-2023-16875

Юлия Кузьмина

The article considered several issues of the relationships between cognitive psychology and psychometrics. Cognitive psychology has mainly developed within experimental paradigm in psychology, whereas psychometrics has developed within a different paradigm – assessment of individual differences and correlational studies. In the article it has been considered a brief history of the development of relationships between experimental studies and psychometrics, from the end of 19th century to the present. The historical view allows understanding problems in the use of experimental tasks for assessing individual differences and obstacles to the widespread of use psychometric models in experimental studies. Several recommendations are proposed to improve the accuracy of measurements of individual differences in cognitive abilities and processes, from psychometric perspectives.

本文探讨了认知心理学与心理测量学之间关系的几个问题。认知心理学主要是在心理学的实验范式中发展起来的，而心理测量学则是在不同的范式中发展起来的——个体差异评估和相关研究。在这篇文章中，它被认为是从19世纪末到现在，实验研究和心理测量学之间关系发展的简史。历史的观点允许理解在使用实验任务来评估个体差异的问题，以及在实验研究中广泛使用心理测量模型的障碍。从心理测量学的角度，提出了一些建议，以提高个体认知能力和过程差异测量的准确性。& # x0D;

引用次数: 0

Approbation of the Scale of Attitudes towards Intercultural Learning in Higher Education 高等教育跨文化学习态度量表的认可

Q4 EDUCATION, SCIENTIFIC DISCIPLINES

Voprosy Obrazovaniya-Educational Studies Moscow

Pub Date : 2023-11-06 DOI: 10.17323/vo-2023-16819

Соня Алехандра Берриос Кальехас Мария Бульцева

The ability to cope with the challenges of cultural diversity is one of the key competencies that students need to develop in the contemporary world. This actualizes the question of intercultural learning and students' attitudes towards it. The study aimed at development and validation of the instrument to measure students’ attitudes towards intercultural learning. Taking into account the “sources” and processes of intercultural learning available to students, as well as the structure of the attitude, an instrument consisting of four scales were created - each of them includes three aspects of the attitudes: cognitive, affective and behavioral ones. The reliability and validity of each of the instrument were confirmed on a sample of 399 students of Russian universities using statistical procedures such as exploratory and confirmatory factor analysis, analysis of the consistency of the scales according to Cronbach's alpha and McDonald’s omega, analysis of extracted mean variance and heterotrait-monotrait ratio. It seems promising to consider various aspects of intercultural learning in further research, adapting and using the scales not only in the higher education system, but also in other contexts and age groups.

应对文化多样性挑战的能力是学生在当代世界需要培养的关键能力之一。这就提出了跨文化学习的问题以及学生对跨文化学习的态度。本研究旨在开发和验证测量学生对跨文化学习态度的工具。考虑到学生跨文化学习的“来源”和过程，以及态度的结构，我们创建了一个由四个量表组成的工具，每个量表包括态度的三个方面:认知、情感和行为。采用探索性因子分析和验证性因子分析、Cronbach's alpha和McDonald 's omega量表一致性分析、提取均值方差分析和异性状-单性状比分析等统计方法，对俄罗斯399名大学生样本进行了信度和效度验证。似乎有希望在进一步的研究中考虑跨文化学习的各个方面，不仅在高等教育系统中，而且在其他背景和年龄组中适应和使用这些量表。

{"title":"Approbation of the Scale of Attitudes towards Intercultural Learning in Higher Education","authors":"Мария Бульцева, Соня Алехандра Берриос Кальехас","doi":"10.17323/vo-2023-16819","DOIUrl":"https://doi.org/10.17323/vo-2023-16819","url":null,"abstract":"The ability to cope with the challenges of cultural diversity is one of the key competencies that students need to develop in the contemporary world. This actualizes the question of intercultural learning and students' attitudes towards it. The study aimed at development and validation of the instrument to measure students’ attitudes towards intercultural learning. Taking into account the “sources” and processes of intercultural learning available to students, as well as the structure of the attitude, an instrument consisting of four scales were created - each of them includes three aspects of the attitudes: cognitive, affective and behavioral ones. The reliability and validity of each of the instrument were confirmed on a sample of 399 students of Russian universities using statistical procedures such as exploratory and confirmatory factor analysis, analysis of the consistency of the scales according to Cronbach's alpha and McDonald’s omega, analysis of extracted mean variance and heterotrait-monotrait ratio. It seems promising to consider various aspects of intercultural learning in further research, adapting and using the scales not only in the higher education system, but also in other contexts and age groups.","PeriodicalId":54119,"journal":{"name":"Voprosy Obrazovaniya-Educational Studies Moscow","volume":"44 3","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135724155","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Measuring Learning Progress Based on Cognitive Operations 基于认知操作的学习进度测量

Q4 EDUCATION, SCIENTIFIC DISCIPLINES

Voprosy Obrazovaniya-Educational Studies Moscow

Pub Date : 2023-11-06 DOI: 10.17323/vo-2023-16902

Ирина Зуева Сергей Тарасов Денис Федерякин

Measuring students’ growth and change is considered one of the main ways for evidence-based development of educational systems. However, it is a non-trivial methodological task, despite the numerous approaches available for its conceptualization and statistical realization. In this article, we describe the main features of measuring students' growth and change using Item Response Theory (IRT) in detail. We then expand this approach to allow for the modeling of cognitive operations with the Linear Logistic Test Model (LLTM). We show that the synthesis of traditional IRT models for measuring growth and change with LLTM significantly enriches the interpretability of ability estimates while preserving the advantages of the traditional approach. To illustrate this approach, we use a set of monitoring tests to measure educational progress in mathematics in secondary school.

衡量学生的成长和变化被认为是教育系统循证发展的主要途径之一。然而，尽管有许多方法可用于其概念化和统计实现，但这是一项重要的方法任务。在本文中，我们详细描述了项目反应理论(IRT)测量学生成长和变化的主要特征。然后，我们扩展了这种方法，允许使用线性逻辑测试模型(LLTM)对认知操作进行建模。我们的研究表明，将传统的IRT模型与LLTM相结合，在保留传统方法优势的同时，显著丰富了能力估计的可解释性。为了说明这种方法，我们使用一套监测测试来衡量中学数学教育的进展。

引用次数: 0

Computational Psychometrics: Near Future or Reality 计算心理测量学:不远的将来还是现实

Q4 EDUCATION, SCIENTIFIC DISCIPLINES

Voprosy Obrazovaniya-Educational Studies Moscow

Pub Date : 2023-11-06 DOI: 10.17323/vo-2023-17938

Ксения Тарасова Дарья Грачева

This book continues a series of books on the methodology of educational testing and assessment written by leading psychometricians and researchers in the field of educational assessment. Computational psychometrics is defined as the combination of computer science methods and psychometric measurement principles for analysing data obtained as a result of testing using technologically advanced test formats. The first part of the book discusses the changes that have occurred in teaching and educational assessment under the influence of digital technologies. The second part provides an overview of computational psychometric methods: from traditional psychometric models to machine learning technologies. The material in the book can be useful to students and researchers in the field of psychometrics who are involved in the development, design and analysis of learning systems and measurements using complex test formats and data. The strength of the book is an electronic application containing the code of the R or Python programming environment for the methodological chapters.

这本书继续了一系列关于教育测试和评估方法的书籍，由领先的心理测量学家和研究人员在教育评估领域撰写。计算心理测量学被定义为计算机科学方法和心理测量原理的结合，用于分析使用技术先进的测试格式测试所获得的数据。本书的第一部分讨论了在数字技术的影响下，教学和教育评估发生的变化。第二部分概述了计算心理测量方法:从传统的心理测量模型到机器学习技术。在书中的材料可以是有用的学生和研究人员在心理测量学领域谁是参与开发，设计和学习系统的分析和测量使用复杂的测试格式和数据。本书的优势在于包含R或Python编程环境代码的电子应用程序，用于方法论章节。

引用次数: 0

Psychometric Research: Modern Methods and New Opportunities for Education 心理测量学研究:现代方法和教育的新机遇

Q4 EDUCATION, SCIENTIFIC DISCIPLINES

Voprosy Obrazovaniya-Educational Studies Moscow

Pub Date : 2023-11-06 DOI: 10.17323/vo-2023-17951

Алина Иванова Елена Карданова

Qualitative measurement is a fundamental requirement for research practice in the social sciences. The quality of measurements determines the validity of the interpretations, conclusions and decisions we can make based on the data obtained from the measurements. Qualitative measurement in the social sciences requires assessment tools as well as data analysis methods to link observed measurement results to theoretical attributes. The scientific basis for their development is provided by psychometrics. Preceding the special issue of the journal "Voprosy obrazovaniya / Educational Studies Moscow" devoted to psychometrics, the guest editors of this issue cover the main milestones of the history of psychometrics, highlight some significant publications, note the professional institutions and authors who have made their valuable contribution to the development of this branch of science. The authors pay special attention to the history of psychometrics in Russia. Assessing the possibilities, prospects and limitations of psychometrics, the authors express their point of view on the debatable issues of psychometrics, and it does not always coincide with the opinion of the authors of the special issue. This issue presents examples of using modern psychometric methods to solve actual problems in education research, as well as in research at the intersection of education and psychology, education and different spheres of business. All the authors of the presented articles are united by the desire to improve research practice in the social sciences through truly qualitative measurements.

定性测量是社会科学研究实践的基本要求。测量的质量决定了我们可以根据测量获得的数据做出的解释、结论和决策的有效性。社会科学中的定性测量需要评估工具以及数据分析方法来将观察到的测量结果与理论属性联系起来。心理测量学为其发展提供了科学依据。在“Voprosy obrazovaniya /莫斯科教育研究”杂志专门讨论心理测量学的特刊之前，本期的客座编辑涵盖了心理测量学历史上的主要里程碑，重点介绍了一些重要的出版物，并指出了为这一科学分支的发展做出宝贵贡献的专业机构和作者。作者特别关注俄罗斯心理测量学的历史。作者对心理测量学的可能性、前景和局限性进行了评估，对心理测量学中存在争议的问题表达了自己的观点，但并不总是与特刊作者的观点一致。本文介绍了运用现代心理测量学方法解决教育研究中的实际问题的实例，以及教育与心理学、教育与其他商业领域的交叉研究。这些文章的所有作者都希望通过真正的定性测量来改善社会科学的研究实践。

{"title":"Psychometric Research: Modern Methods and New Opportunities for Education","authors":"Елена Карданова, Алина Иванова","doi":"10.17323/vo-2023-17951","DOIUrl":"https://doi.org/10.17323/vo-2023-17951","url":null,"abstract":"Qualitative measurement is a fundamental requirement for research practice in the social sciences. The quality of measurements determines the validity of the interpretations, conclusions and decisions we can make based on the data obtained from the measurements. Qualitative measurement in the social sciences requires assessment tools as well as data analysis methods to link observed measurement results to theoretical attributes. The scientific basis for their development is provided by psychometrics. Preceding the special issue of the journal \"Voprosy obrazovaniya / Educational Studies Moscow\" devoted to psychometrics, the guest editors of this issue cover the main milestones of the history of psychometrics, highlight some significant publications, note the professional institutions and authors who have made their valuable contribution to the development of this branch of science. The authors pay special attention to the history of psychometrics in Russia. Assessing the possibilities, prospects and limitations of psychometrics, the authors express their point of view on the debatable issues of psychometrics, and it does not always coincide with the opinion of the authors of the special issue. This issue presents examples of using modern psychometric methods to solve actual problems in education research, as well as in research at the intersection of education and psychology, education and different spheres of business. All the authors of the presented articles are united by the desire to improve research practice in the social sciences through truly qualitative measurements.","PeriodicalId":54119,"journal":{"name":"Voprosy Obrazovaniya-Educational Studies Moscow","volume":"44 8","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135724150","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Decomposing Difficulty of Reading Literacy Test Items 阅读素养测试项目难度分解

Q4 EDUCATION, SCIENTIFIC DISCIPLINES

Voprosy Obrazovaniya-Educational Studies Moscow

Pub Date : 2023-11-06 DOI: 10.17323/vo-2023-16925

Алина Иванова Инна Антипкина

The current study investigates the question of test difficulty decomposition depending on the characteristics of items (such as: format, belonging to the type of text to which the item belongs) and the reader's actions required to answer it (search for information in the text, simple conclusions, complex conclusions, critical interpretation of the text). The sample of the study consisted of fourth grade elementary school students in Krasnoyarsk, who completed the computerized test of reading literacy "Progress" in the spring of 2022. Research method: psychometric modeling using the LLTM+e model. Research hypothesis: the decomposition of item difficulties will help to prove that the reading actions required to complete the tasks will form a hierarchy of difficulties similar to traditional taxonomies (B. Bloom), that is, reading skills aimed at analyzing, synthesizing, interpreting information will give tasks greater difficulty than simple conclusions, and those, in turn, will make tasks more difficult than the reader's actions to find information in the text. The results show that the assignment of items to the group of reader's actions is a significant factor. The size of the effects does not allow us to speak of a strict hierarchy, but when other attributes are controlled, the tasks for information retrieval in an explicit form are easier for students than the tasks for complex conclusions and for critical understanding of the text.

目前的研究根据题目的特征(如:格式，属于题目所属的文本类型)和读者回答问题所需的行为(在文本中搜索信息，简单结论，复杂结论，对文本的批判性解释)来调查测试难度分解问题。该研究的样本包括克拉斯诺亚尔斯克的四年级小学生，他们在2022年春天完成了阅读能力“进步”的计算机测试。研究方法:采用LLTM+e模型进行心理测量建模。研究假设:项目难度的分解将有助于证明完成任务所需的阅读动作将形成与传统分类法类似的难度层次(B. Bloom)，即以分析、综合、解释信息为目的的阅读技能将使任务比简单的结论更困难，而这些又将使任务比读者在文本中查找信息的动作更困难。结果表明，项目分配对读者行为组是一个重要的影响因素。影响的大小不允许我们谈论严格的层次结构，但是当其他属性受到控制时，以明确形式检索信息的任务对学生来说比复杂结论和批判性理解文本的任务更容易。

{"title":"Decomposing Difficulty of Reading Literacy Test Items","authors":"Алина Иванова, Инна Антипкина","doi":"10.17323/vo-2023-16925","DOIUrl":"https://doi.org/10.17323/vo-2023-16925","url":null,"abstract":"The current study investigates the question of test difficulty decomposition depending on the characteristics of items (such as: format, belonging to the type of text to which the item belongs) and the reader's actions required to answer it (search for information in the text, simple conclusions, complex conclusions, critical interpretation of the text). The sample of the study consisted of fourth grade elementary school students in Krasnoyarsk, who completed the computerized test of reading literacy \"Progress\" in the spring of 2022. Research method: psychometric modeling using the LLTM+e model. Research hypothesis: the decomposition of item difficulties will help to prove that the reading actions required to complete the tasks will form a hierarchy of difficulties similar to traditional taxonomies (B. Bloom), that is, reading skills aimed at analyzing, synthesizing, interpreting information will give tasks greater difficulty than simple conclusions, and those, in turn, will make tasks more difficult than the reader's actions to find information in the text. The results show that the assignment of items to the group of reader's actions is a significant factor. The size of the effects does not allow us to speak of a strict hierarchy, but when other attributes are controlled, the tasks for information retrieval in an explicit form are easier for students than the tasks for complex conclusions and for critical understanding of the text.","PeriodicalId":54119,"journal":{"name":"Voprosy Obrazovaniya-Educational Studies Moscow","volume":"44 4","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135724154","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

The Role of Context in Scenario-Based Tasks for Measuring Universal Skills: The Use of Generalizability Theory 情境在测量通用技能的情境任务中的作用:概括性理论的应用

Q4 EDUCATION, SCIENTIFIC DISCIPLINES

Voprosy Obrazovaniya-Educational Studies Moscow

Pub Date : 2023-11-06 DOI: 10.17323/vo-2023-16901

Дарья Грачева

In education, much attention is paid to the development and evaluation of universal skills in schoolchildren. At the same time, the assessment of universal skills requires new test formats based on the observed actions of the student in the digital environment. Scenario-based contextual tasks serve as a promising format. However, the contextual diversity of such tasks can make it difficult to compare results obtained from different scenario tasks. This article aims to analyze the role of scenario task context in measuring two universal skills: critical thinking and communication. The work uses the methods of Generalizability Theory, which allows to analyze to what extent the results can be generalized for other contexts of scenario tasks, and how, by changing the number of indicators or scenario contexts, to ensure satisfied measurement reliability. The study is based on data from fourth-grade students who were tested with various scenario-based tasks of the “4K” instrument. The results of the analysis showed that the behavior of the test-takers differs in scenarios with different contexts, while the difficulties of the contexts are almost the same. To achieve satisfactory reliability, it is recommended to use at least two scenarios with different contexts, and the use of three or more scenarios with different contexts will reduce the number of indicators without loss of reliability. Also, the study evaluated the role of context when using alternative scenario-based tasks forms were used. The alternative forms were similar in the main problem and plot of the scenario, but differed in topic (content). Changing only the content of the scenario makes it possible to generalize the results across scenario forms, that is, alternative forms can be used interchangeably. This study demonstrates how Generalization Theory can be used to optimize the development of tasks, taking into account the requirements for measurement reliability.

在教育方面，非常重视发展和评价学童的普遍技能。与此同时，通用技能的评估需要基于观察到的学生在数字环境中的行为的新测试格式。基于场景的上下文任务是一种很有前途的格式。然而，这些任务的上下文多样性使得比较从不同场景任务中获得的结果变得困难。本文旨在分析情境任务语境在衡量批判性思维和沟通这两项通用技能中的作用。这项工作使用了概率性理论的方法，它允许分析结果在多大程度上可以推广到场景任务的其他情境中，以及如何通过改变指标或场景情境的数量来确保满意的测量可靠性。该研究基于四年级学生的数据，这些学生接受了“4K”仪器的各种场景任务测试。分析结果表明，在不同情境下，考生的行为是不同的，而情境的难度几乎是相同的。为获得满意的可靠性，建议至少使用两个不同背景的场景，使用三个或更多不同背景的场景可以减少指标数量，但不影响可靠性。此外，该研究还评估了在使用其他基于场景的任务表单时上下文的作用。备选形式在主要问题和情节上是相似的，但在主题(内容)上是不同的。仅更改场景的内容就可以跨场景形式概括结果，也就是说，可替换的形式可以互换使用。本研究表明，在考虑测量可靠性要求的情况下，如何利用泛化理论来优化任务的开发。 & # x0D;

{"title":"The Role of Context in Scenario-Based Tasks for Measuring Universal Skills: The Use of Generalizability Theory","authors":"Дарья Грачева","doi":"10.17323/vo-2023-16901","DOIUrl":"https://doi.org/10.17323/vo-2023-16901","url":null,"abstract":"In education, much attention is paid to the development and evaluation of universal skills in schoolchildren. At the same time, the assessment of universal skills requires new test formats based on the observed actions of the student in the digital environment. Scenario-based contextual tasks serve as a promising format. However, the contextual diversity of such tasks can make it difficult to compare results obtained from different scenario tasks. This article aims to analyze the role of scenario task context in measuring two universal skills: critical thinking and communication. The work uses the methods of Generalizability Theory, which allows to analyze to what extent the results can be generalized for other contexts of scenario tasks, and how, by changing the number of indicators or scenario contexts, to ensure satisfied measurement reliability. The study is based on data from fourth-grade students who were tested with various scenario-based tasks of the “4K” instrument. The results of the analysis showed that the behavior of the test-takers differs in scenarios with different contexts, while the difficulties of the contexts are almost the same. To achieve satisfactory reliability, it is recommended to use at least two scenarios with different contexts, and the use of three or more scenarios with different contexts will reduce the number of indicators without loss of reliability. Also, the study evaluated the role of context when using alternative scenario-based tasks forms were used. The alternative forms were similar in the main problem and plot of the scenario, but differed in topic (content). Changing only the content of the scenario makes it possible to generalize the results across scenario forms, that is, alternative forms can be used interchangeably. This study demonstrates how Generalization Theory can be used to optimize the development of tasks, taking into account the requirements for measurement reliability. ","PeriodicalId":54119,"journal":{"name":"Voprosy Obrazovaniya-Educational Studies Moscow","volume":"43 3","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135724035","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Voprosy Obrazovaniya-Educational Studies Moscow

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀