在模拟测试环境中开发错误路径:如何指导评估专业人员

The Journal of Competency-Based Education Pub Date : 2019-10-03 DOI:10.1002/cbe2.1198

Sean P. Gyll

{"title":"在模拟测试环境中开发错误路径:如何指导评估专业人员","authors":"Sean P. Gyll","doi":"10.1002/cbe2.1198","DOIUrl":null,"url":null,"abstract":"In competency-based education (CBE), students move through their degree programs one course at a time by demonstrating mastery through a variety of content and professional domains. Institutions of higher learning are beginning to realize the importance of real-world, performance-based assessments like computer simulations, where demonstrations of competence are key to success. The greatest advantage of this mode of testing is the real-world relevance that can be incorporated into the assessment, and technological innovations continue to expand these opportunities. However, without a standard-based approach, computer simulation examinations will remain challenging for those institutions.Central to the development of computer simulations is the need for high-fidelity assessments of learner aptitudes and competences that include a significant emphasis on not only cognitive abilities (knowing what) but also the performance of those abilities (demonstrating how). As students work within an individual domain of knowledge and skill, they are assessed on one or more topics, each consisting of a series of competencies with associated test objectives and performance tasks (Gyll & Ragland, 2018). As the popularity of CBE programs continues to grow, its integrity will be scrutinized by students and employers alike; and its credibility is largely dependent upon the quality of the assessments that are used (McClarty & Gaertner, 2015).A simulation is a mock representation of a real-world process or system that represents the key characteristics of the system, such as its behavior, functions, and physical properties. A simulation is like a case study (the examination of a problem used to illustrate a thesis or principle), but with the participants on the inside, not on the outside. Simulations are used across many disciplines. For example, stochastic (probabilistic) simulations are used when some variable or process is estimated based on a statistical technique, such as can be used to show the eventual real effects or forecasted conditions of weather patterns. The military uses for simulation can involve the use of aircraft simulators for training pilots or simulated armored fighting vehicles that imitate real-world combat conditions. Similarly, simulations are an essential part of today's medical education, as many universities have simulation centers (or skill laboratories), where students can practice diagnostics and procedures on life-like mannequins (Roterman-Konieczna, 2015).In many instances, the differences between computer simulations and live environments seem trivial. In fact, many computer simulations have gotten so sophisticated, that, if optimized, they could serve as the actual workstation (i.e., a computer system designed for individual use). In some instances, however, it may neither be possible nor desirable to simulate the full functionality of an application. For example, when development costs increase in relationship to the number of simulation paths. Although there are a number of benefits to computer simulation testing including reduced development time and administration costs, less sophisticated scoring rubrics, and increased stability over testing occasions, several concerns remain unanswered.The purpose of this paper was to discuss one of the primary dilemmas impeding the development of high-fidelity computer simulation examinations; chiefly, the determination of the appropriate number of errant paths that render a computer simulation examination valid. Here, validity refers to the type of external validity or generalizability that Mitchell and Jolley (2001) describe as the conclusions of a scientific study outside the context of that study. In other words, the extent to which the results of the computer simulation will generalize to and across real-world applications. This paper briefly explores the history of simulations as examinations and differentiates between low versus high-fidelity assessments in a simulation environment. End-user navigation requirements and its relationship to developing the appropriate number of errant paths within a computer simulation is also explored. Finally, several tools and templates are provided to aid assessment professionals in the development process.1Since the early 2000s, there have been attempts to evaluate various aspects of human performance using simulation technology. Most of these studies focused on the cognitive, perceptual, and motor performance of end users, and paid little attention to simulation fidelity and its relationship to errant paths; that is, the look, feel, and functionality of a computer simulation and how end users navigate within the simulation environment (Nash, Edwards, Thompson, & Barfield, 2000). For the purpose of this paper, the definition of an errant path is operationalized as “any pathway leading an end user further away from a correct response in a computer simulation environment.”Computer simulations are now reasonably well-established in the certification and educational fields as an assessment tool for learning. The technique fits well into educational philosophies that stress the importance of the learner as an interactive participant in the learning process. Some useful examples of how computer simulation examinations are being used at one competency-based institution include CompTIA's performance-based assessments (A+, Network+, Security+, Cloud+, and Cybersecurity Analyst+), which assess candidates' ability to solve problems in a simulated environment.The simulation examination is often described by misleading labels, such as game, case study, or exercise. Perhaps the most useful approach, therefore, is to put forward a definition and description of the computer simulation followed by some examples of how and why it is used for assessment purposes both inside and outside of education. It is important to note that a simulated environment is very different from a virtual reality (VR) one. In VR environments, users react to certain simulated conditions within the environment (e.g., aircraft simulators), but the environment is completely imaginative and false. This “false” environment is created to appear very real, yet users can distinguish between reality and imagination (Keshav, 2017). In order for it to be a simulation, there can be no real interaction with a genuine environment. An information technology learner demonstrating the functionality of a software application through Power Point is not participating in a simulation. To be a computer simulation, the relevant functions of the software environment must be presented adequately through the computer equipment. Without reality of function, the activity is no more than an exercise or Power Point presentation.Reality of function includes the acceptance of the relevant duties and responsibilities of the simulation. For example, if a simulation is about the work of news editors on social media then the materials will provide news items for the participants to work on within a social media platform. However, if a teacher tells the class, “pretend you are news editors on social media and invent your own news items,” then there would be no reality of function; the participants would be authors and inventors, not editors.Developing high-fidelity simulations involves the understanding of navigational awareness and how learners acquire knowledge. Knowledge acquisition is typically measured at the task level (i.e., what the user is doing within an environment) by assessing speed and accuracy in participant response. As a rule of thumb, as knowledge increases both speed and accuracy also increase (Fitts, 1964). The pre-eminent method for assessing speed/accuracy performance, especially within highly complex tasks involving multiple decision paths, is click-stream analysis. Within this method of click-stream analysis, end-user response patterns are analyzed by measuring post hoc data from frequently travelled paths and locations to which the participant frequently returned. Since these methods often prove costly and time consuming, techniques that utilize qualitative procedures like training situation analysis (TSA) and subject matter expert (SME) judgments represent an alternative approach. The remaining sections of this paper focus on this TSA and SME approach.Although not specifically designed to assess navigational tracking and errant paths in computer simulation testing, a technique that has proven useful and generalizable to errant path identification is TSA. TSA includes a systematic process for identifying tasks that comprise a job, describing in detail the operational components of each of those tasks, ranking the importance in cost-benefit terms of each task component, and describing the functional training requirements of the tasks (Jonassen, Tessmer, & Hannum, 1989).Training situation analysis describes in detail the human performance for which systems design should be developed. These techniques were generalized to include test development processes performed by SMEs during the item writing and/or product task analysis phases. The following details the steps required to develop a computer simulation examination using a typical office product graphical user interface, comprised of menu, dropdown, and command prompt options. Appendices A and B include useful templates to aid in the development process.Computer simulations as examinations represent a much-needed effort to move beyond the shortcomings of today's form-based assessments and an opportunity to reengineer the way in which we measure performance. Within computer simulation examinations, we assess for competency and problem-solving skills versus the content memorization typically supported by multiple-choice assessments. Certainly, competency-based institutions are looking for ways to add value to their credentials and all are unanimous in recognizing that asking test takers to do something is far more indicative of skills and ability than asking them to remember and recall something.For any assessment program, construct validity is the paramount component of validity evidence. Put into context, this means that in order to support the inferences drawn from simulation examination scores, standard-based principles that focus on high fidelity must be maintained throughout the development life cycle. In this paper, several useful prototypes to aid assessment professionals in the development of computer simulation examinations are provided. Computer simulation examinations offer several benefits over traditional methods including reduced development time and administration costs, less sophisticated scoring rubrics, and increased stability over testing occasions.An even more critical advantage of computer simulation examinations rests in its ability to assess higher-order cognitive skill within a simulation environment. Today, access to some kind of hands-on encounter with systems and software has become an essential part of the learning experience, especially within CBE programs where demonstrations of competence are required for success. Be it in the context of a diagnostic pre-assessment or simply while preparing to tackle examination objectives where test takers must go interactive, use of computer simulations as examinations are now both expected and routine.Within CBE, a necessary part of taking responsibility for students' development is the ability to identify what standards should appropriately apply. Unless the expectation is set that raising questions about appropriate standards is a normal part of approaching any learning task, this is unlikely to be sufficiently developed. As discussed in this paper, it appears that the range, functionality, power, and utility of computer simulation examinations will continue to expand into the foreseeable future, especially as the popularity and legitimacy of CBE programs continue to grow. However, without an established set of standards, this loose structure provides little room for a consensus to emerge as to which method provides the best measure of validity evidence within a computer simulation environment.No conflicts declared.","PeriodicalId":101234,"journal":{"name":"The Journal of Competency-Based Education","volume":"4 4","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2019-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/cbe2.1198","citationCount":"0","resultStr":"{\"title\":\"Developing errant paths in a simulation testing environment: A how to guide for assessment professionals\",\"authors\":\"Sean P. Gyll\",\"doi\":\"10.1002/cbe2.1198\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In competency-based education (CBE), students move through their degree programs one course at a time by demonstrating mastery through a variety of content and professional domains. Institutions of higher learning are beginning to realize the importance of real-world, performance-based assessments like computer simulations, where demonstrations of competence are key to success. The greatest advantage of this mode of testing is the real-world relevance that can be incorporated into the assessment, and technological innovations continue to expand these opportunities. However, without a standard-based approach, computer simulation examinations will remain challenging for those institutions.Central to the development of computer simulations is the need for high-fidelity assessments of learner aptitudes and competences that include a significant emphasis on not only cognitive abilities (knowing what) but also the performance of those abilities (demonstrating how). As students work within an individual domain of knowledge and skill, they are assessed on one or more topics, each consisting of a series of competencies with associated test objectives and performance tasks (Gyll & Ragland, 2018). As the popularity of CBE programs continues to grow, its integrity will be scrutinized by students and employers alike; and its credibility is largely dependent upon the quality of the assessments that are used (McClarty & Gaertner, 2015).A simulation is a mock representation of a real-world process or system that represents the key characteristics of the system, such as its behavior, functions, and physical properties. A simulation is like a case study (the examination of a problem used to illustrate a thesis or principle), but with the participants on the inside, not on the outside. Simulations are used across many disciplines. For example, stochastic (probabilistic) simulations are used when some variable or process is estimated based on a statistical technique, such as can be used to show the eventual real effects or forecasted conditions of weather patterns. The military uses for simulation can involve the use of aircraft simulators for training pilots or simulated armored fighting vehicles that imitate real-world combat conditions. Similarly, simulations are an essential part of today's medical education, as many universities have simulation centers (or skill laboratories), where students can practice diagnostics and procedures on life-like mannequins (Roterman-Konieczna, 2015).In many instances, the differences between computer simulations and live environments seem trivial. In fact, many computer simulations have gotten so sophisticated, that, if optimized, they could serve as the actual workstation (i.e., a computer system designed for individual use). In some instances, however, it may neither be possible nor desirable to simulate the full functionality of an application. For example, when development costs increase in relationship to the number of simulation paths. Although there are a number of benefits to computer simulation testing including reduced development time and administration costs, less sophisticated scoring rubrics, and increased stability over testing occasions, several concerns remain unanswered.The purpose of this paper was to discuss one of the primary dilemmas impeding the development of high-fidelity computer simulation examinations; chiefly, the determination of the appropriate number of errant paths that render a computer simulation examination valid. Here, validity refers to the type of external validity or generalizability that Mitchell and Jolley (2001) describe as the conclusions of a scientific study outside the context of that study. In other words, the extent to which the results of the computer simulation will generalize to and across real-world applications. This paper briefly explores the history of simulations as examinations and differentiates between low versus high-fidelity assessments in a simulation environment. End-user navigation requirements and its relationship to developing the appropriate number of errant paths within a computer simulation is also explored. Finally, several tools and templates are provided to aid assessment professionals in the development process.1Since the early 2000s, there have been attempts to evaluate various aspects of human performance using simulation technology. Most of these studies focused on the cognitive, perceptual, and motor performance of end users, and paid little attention to simulation fidelity and its relationship to errant paths; that is, the look, feel, and functionality of a computer simulation and how end users navigate within the simulation environment (Nash, Edwards, Thompson, & Barfield, 2000). For the purpose of this paper, the definition of an errant path is operationalized as “any pathway leading an end user further away from a correct response in a computer simulation environment.”Computer simulations are now reasonably well-established in the certification and educational fields as an assessment tool for learning. The technique fits well into educational philosophies that stress the importance of the learner as an interactive participant in the learning process. Some useful examples of how computer simulation examinations are being used at one competency-based institution include CompTIA's performance-based assessments (A+, Network+, Security+, Cloud+, and Cybersecurity Analyst+), which assess candidates' ability to solve problems in a simulated environment.The simulation examination is often described by misleading labels, such as game, case study, or exercise. Perhaps the most useful approach, therefore, is to put forward a definition and description of the computer simulation followed by some examples of how and why it is used for assessment purposes both inside and outside of education. It is important to note that a simulated environment is very different from a virtual reality (VR) one. In VR environments, users react to certain simulated conditions within the environment (e.g., aircraft simulators), but the environment is completely imaginative and false. This “false” environment is created to appear very real, yet users can distinguish between reality and imagination (Keshav, 2017). In order for it to be a simulation, there can be no real interaction with a genuine environment. An information technology learner demonstrating the functionality of a software application through Power Point is not participating in a simulation. To be a computer simulation, the relevant functions of the software environment must be presented adequately through the computer equipment. Without reality of function, the activity is no more than an exercise or Power Point presentation.Reality of function includes the acceptance of the relevant duties and responsibilities of the simulation. For example, if a simulation is about the work of news editors on social media then the materials will provide news items for the participants to work on within a social media platform. However, if a teacher tells the class, “pretend you are news editors on social media and invent your own news items,” then there would be no reality of function; the participants would be authors and inventors, not editors.Developing high-fidelity simulations involves the understanding of navigational awareness and how learners acquire knowledge. Knowledge acquisition is typically measured at the task level (i.e., what the user is doing within an environment) by assessing speed and accuracy in participant response. As a rule of thumb, as knowledge increases both speed and accuracy also increase (Fitts, 1964). The pre-eminent method for assessing speed/accuracy performance, especially within highly complex tasks involving multiple decision paths, is click-stream analysis. Within this method of click-stream analysis, end-user response patterns are analyzed by measuring post hoc data from frequently travelled paths and locations to which the participant frequently returned. Since these methods often prove costly and time consuming, techniques that utilize qualitative procedures like training situation analysis (TSA) and subject matter expert (SME) judgments represent an alternative approach. The remaining sections of this paper focus on this TSA and SME approach.Although not specifically designed to assess navigational tracking and errant paths in computer simulation testing, a technique that has proven useful and generalizable to errant path identification is TSA. TSA includes a systematic process for identifying tasks that comprise a job, describing in detail the operational components of each of those tasks, ranking the importance in cost-benefit terms of each task component, and describing the functional training requirements of the tasks (Jonassen, Tessmer, & Hannum, 1989).Training situation analysis describes in detail the human performance for which systems design should be developed. These techniques were generalized to include test development processes performed by SMEs during the item writing and/or product task analysis phases. The following details the steps required to develop a computer simulation examination using a typical office product graphical user interface, comprised of menu, dropdown, and command prompt options. Appendices A and B include useful templates to aid in the development process.Computer simulations as examinations represent a much-needed effort to move beyond the shortcomings of today's form-based assessments and an opportunity to reengineer the way in which we measure performance. Within computer simulation examinations, we assess for competency and problem-solving skills versus the content memorization typically supported by multiple-choice assessments. Certainly, competency-based institutions are looking for ways to add value to their credentials and all are unanimous in recognizing that asking test takers to do something is far more indicative of skills and ability than asking them to remember and recall something.For any assessment program, construct validity is the paramount component of validity evidence. Put into context, this means that in order to support the inferences drawn from simulation examination scores, standard-based principles that focus on high fidelity must be maintained throughout the development life cycle. In this paper, several useful prototypes to aid assessment professionals in the development of computer simulation examinations are provided. Computer simulation examinations offer several benefits over traditional methods including reduced development time and administration costs, less sophisticated scoring rubrics, and increased stability over testing occasions.An even more critical advantage of computer simulation examinations rests in its ability to assess higher-order cognitive skill within a simulation environment. Today, access to some kind of hands-on encounter with systems and software has become an essential part of the learning experience, especially within CBE programs where demonstrations of competence are required for success. Be it in the context of a diagnostic pre-assessment or simply while preparing to tackle examination objectives where test takers must go interactive, use of computer simulations as examinations are now both expected and routine.Within CBE, a necessary part of taking responsibility for students' development is the ability to identify what standards should appropriately apply. Unless the expectation is set that raising questions about appropriate standards is a normal part of approaching any learning task, this is unlikely to be sufficiently developed. As discussed in this paper, it appears that the range, functionality, power, and utility of computer simulation examinations will continue to expand into the foreseeable future, especially as the popularity and legitimacy of CBE programs continue to grow. However, without an established set of standards, this loose structure provides little room for a consensus to emerge as to which method provides the best measure of validity evidence within a computer simulation environment.No conflicts declared.\",\"PeriodicalId\":101234,\"journal\":{\"name\":\"The Journal of Competency-Based Education\",\"volume\":\"4 4\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-10-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1002/cbe2.1198\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"The Journal of Competency-Based Education\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/cbe2.1198\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"The Journal of Competency-Based Education","FirstCategoryId":"1085","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/cbe2.1198","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

当然，以能力为基础的机构正在寻找增加证书价值的方法，所有人都一致认为，要求考生做某事比要求他们记住和回忆某事更能表明他们的技能和能力。对于任何评估程序，结构效度是效度证据的重要组成部分。放在上下文中，这意味着为了支持从模拟考试分数中得出的推论，必须在整个开发生命周期中维护关注高保真度的基于标准的原则。本文提供了几个有用的原型，以帮助评估专业人员开发计算机模拟考试。与传统方法相比，计算机模拟考试有几个优点，包括减少开发时间和管理成本，不那么复杂的评分标准，以及增加测试场合的稳定性。计算机模拟考试的一个更关键的优势在于它能够在模拟环境中评估高阶认知技能。今天，对系统和软件进行某种形式的实际接触已经成为学习经验的重要组成部分，特别是在CBE项目中，成功需要展示能力。无论是在诊断预评估的背景下，还是在准备解决应试者必须进行互动的考试目标时，使用计算机模拟作为考试现在既是预期的也是常规的。在CBE中，对学生的发展负责的一个必要部分是确定应该适当应用哪些标准的能力。除非设定这样的期望，即提出有关适当标准的问题是处理任何学习任务的正常部分，否则这种期望不太可能得到充分发展。正如本文所讨论的，在可预见的未来，计算机模拟考试的范围、功能、能力和实用性将继续扩大，特别是随着CBE项目的普及和合法性继续增长。然而，在没有建立一套标准的情况下，这种松散的结构为在计算机模拟环境中哪种方法提供有效性证据的最佳度量提供了很少的共识空间。没有宣布冲突。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

摘要图片

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Developing errant paths in a simulation testing environment: A how to guide for assessment professionals

In competency-based education (CBE), students move through their degree programs one course at a time by demonstrating mastery through a variety of content and professional domains. Institutions of higher learning are beginning to realize the importance of real-world, performance-based assessments like computer simulations, where demonstrations of competence are key to success. The greatest advantage of this mode of testing is the real-world relevance that can be incorporated into the assessment, and technological innovations continue to expand these opportunities. However, without a standard-based approach, computer simulation examinations will remain challenging for those institutions.

Central to the development of computer simulations is the need for high-fidelity assessments of learner aptitudes and competences that include a significant emphasis on not only cognitive abilities (knowing what) but also the performance of those abilities (demonstrating how). As students work within an individual domain of knowledge and skill, they are assessed on one or more topics, each consisting of a series of competencies with associated test objectives and performance tasks (Gyll & Ragland, 2018). As the popularity of CBE programs continues to grow, its integrity will be scrutinized by students and employers alike; and its credibility is largely dependent upon the quality of the assessments that are used (McClarty & Gaertner, 2015).

A simulation is a mock representation of a real-world process or system that represents the key characteristics of the system, such as its behavior, functions, and physical properties. A simulation is like a case study (the examination of a problem used to illustrate a thesis or principle), but with the participants on the inside, not on the outside. Simulations are used across many disciplines. For example, stochastic (probabilistic) simulations are used when some variable or process is estimated based on a statistical technique, such as can be used to show the eventual real effects or forecasted conditions of weather patterns. The military uses for simulation can involve the use of aircraft simulators for training pilots or simulated armored fighting vehicles that imitate real-world combat conditions. Similarly, simulations are an essential part of today's medical education, as many universities have simulation centers (or skill laboratories), where students can practice diagnostics and procedures on life-like mannequins (Roterman-Konieczna, 2015).

In many instances, the differences between computer simulations and live environments seem trivial. In fact, many computer simulations have gotten so sophisticated, that, if optimized, they could serve as the actual workstation (i.e., a computer system designed for individual use). In some instances, however, it may neither be possible nor desirable to simulate the full functionality of an application. For example, when development costs increase in relationship to the number of simulation paths. Although there are a number of benefits to computer simulation testing including reduced development time and administration costs, less sophisticated scoring rubrics, and increased stability over testing occasions, several concerns remain unanswered.

The purpose of this paper was to discuss one of the primary dilemmas impeding the development of high-fidelity computer simulation examinations; chiefly, the determination of the appropriate number of errant paths that render a computer simulation examination valid. Here, validity refers to the type of external validity or generalizability that Mitchell and Jolley (2001) describe as the conclusions of a scientific study outside the context of that study. In other words, the extent to which the results of the computer simulation will generalize to and across real-world applications. This paper briefly explores the history of simulations as examinations and differentiates between low versus high-fidelity assessments in a simulation environment. End-user navigation requirements and its relationship to developing the appropriate number of errant paths within a computer simulation is also explored. Finally, several tools and templates are provided to aid assessment professionals in the development process.1

Since the early 2000s, there have been attempts to evaluate various aspects of human performance using simulation technology. Most of these studies focused on the cognitive, perceptual, and motor performance of end users, and paid little attention to simulation fidelity and its relationship to errant paths; that is, the look, feel, and functionality of a computer simulation and how end users navigate within the simulation environment (Nash, Edwards, Thompson, & Barfield, 2000). For the purpose of this paper, the definition of an errant path is operationalized as “any pathway leading an end user further away from a correct response in a computer simulation environment.”

Computer simulations are now reasonably well-established in the certification and educational fields as an assessment tool for learning. The technique fits well into educational philosophies that stress the importance of the learner as an interactive participant in the learning process. Some useful examples of how computer simulation examinations are being used at one competency-based institution include CompTIA's performance-based assessments (A⁺, Network⁺, Security⁺, Cloud⁺, and Cybersecurity Analyst⁺), which assess candidates' ability to solve problems in a simulated environment.

The simulation examination is often described by misleading labels, such as game, case study, or exercise. Perhaps the most useful approach, therefore, is to put forward a definition and description of the computer simulation followed by some examples of how and why it is used for assessment purposes both inside and outside of education. It is important to note that a simulated environment is very different from a virtual reality (VR) one. In VR environments, users react to certain simulated conditions within the environment (e.g., aircraft simulators), but the environment is completely imaginative and false. This “false” environment is created to appear very real, yet users can distinguish between reality and imagination (Keshav, 2017). In order for it to be a simulation, there can be no real interaction with a genuine environment. An information technology learner demonstrating the functionality of a software application through Power Point is not participating in a simulation. To be a computer simulation, the relevant functions of the software environment must be presented adequately through the computer equipment. Without reality of function, the activity is no more than an exercise or Power Point presentation.

Reality of function includes the acceptance of the relevant duties and responsibilities of the simulation. For example, if a simulation is about the work of news editors on social media then the materials will provide news items for the participants to work on within a social media platform. However, if a teacher tells the class, “pretend you are news editors on social media and invent your own news items,” then there would be no reality of function; the participants would be authors and inventors, not editors.

Developing high-fidelity simulations involves the understanding of navigational awareness and how learners acquire knowledge. Knowledge acquisition is typically measured at the task level (i.e., what the user is doing within an environment) by assessing speed and accuracy in participant response. As a rule of thumb, as knowledge increases both speed and accuracy also increase (Fitts, 1964). The pre-eminent method for assessing speed/accuracy performance, especially within highly complex tasks involving multiple decision paths, is click-stream analysis. Within this method of click-stream analysis, end-user response patterns are analyzed by measuring post hoc data from frequently travelled paths and locations to which the participant frequently returned. Since these methods often prove costly and time consuming, techniques that utilize qualitative procedures like training situation analysis (TSA) and subject matter expert (SME) judgments represent an alternative approach. The remaining sections of this paper focus on this TSA and SME approach.

Although not specifically designed to assess navigational tracking and errant paths in computer simulation testing, a technique that has proven useful and generalizable to errant path identification is TSA. TSA includes a systematic process for identifying tasks that comprise a job, describing in detail the operational components of each of those tasks, ranking the importance in cost-benefit terms of each task component, and describing the functional training requirements of the tasks (Jonassen, Tessmer, & Hannum, 1989).

Training situation analysis describes in detail the human performance for which systems design should be developed. These techniques were generalized to include test development processes performed by SMEs during the item writing and/or product task analysis phases. The following details the steps required to develop a computer simulation examination using a typical office product graphical user interface, comprised of menu, dropdown, and command prompt options. Appendices A and B include useful templates to aid in the development process.

Computer simulations as examinations represent a much-needed effort to move beyond the shortcomings of today's form-based assessments and an opportunity to reengineer the way in which we measure performance. Within computer simulation examinations, we assess for competency and problem-solving skills versus the content memorization typically supported by multiple-choice assessments. Certainly, competency-based institutions are looking for ways to add value to their credentials and all are unanimous in recognizing that asking test takers to do something is far more indicative of skills and ability than asking them to remember and recall something.

For any assessment program, construct validity is the paramount component of validity evidence. Put into context, this means that in order to support the inferences drawn from simulation examination scores, standard-based principles that focus on high fidelity must be maintained throughout the development life cycle. In this paper, several useful prototypes to aid assessment professionals in the development of computer simulation examinations are provided. Computer simulation examinations offer several benefits over traditional methods including reduced development time and administration costs, less sophisticated scoring rubrics, and increased stability over testing occasions.

An even more critical advantage of computer simulation examinations rests in its ability to assess higher-order cognitive skill within a simulation environment. Today, access to some kind of hands-on encounter with systems and software has become an essential part of the learning experience, especially within CBE programs where demonstrations of competence are required for success. Be it in the context of a diagnostic pre-assessment or simply while preparing to tackle examination objectives where test takers must go interactive, use of computer simulations as examinations are now both expected and routine.

Within CBE, a necessary part of taking responsibility for students' development is the ability to identify what standards should appropriately apply. Unless the expectation is set that raising questions about appropriate standards is a normal part of approaching any learning task, this is unlikely to be sufficiently developed. As discussed in this paper, it appears that the range, functionality, power, and utility of computer simulation examinations will continue to expand into the foreseeable future, especially as the popularity and legitimacy of CBE programs continue to grow. However, without an established set of standards, this loose structure provides little room for a consensus to emerge as to which method provides the best measure of validity evidence within a computer simulation environment.

No conflicts declared.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

The Journal of Competency-Based Education

自引率

0.00%

发文量

期刊最新文献

Issue Information Exploring secondary teachers' perspectives on implementing competency-based education The impact of student recognition of excellence to student outcome in a competency-based educational model Issue Information JCBE editorial