JLTA Journal最新文献

英文中文

General Correspondence 一般的信件

JLTA Journal

Pub Date : 1900-01-01 DOI: 10.20622/jltajournal.23.0_79

引用次数: 0

Developing a New Task to Measure Speech Perception Ability: Is the Word Count Task Valid and Reliable? 开发一种测量语音感知能力的新任务:字数统计任务有效可靠吗?

JLTA Journal

Pub Date : 1900-01-01 DOI: 10.20622/jltajournal.23.0_17

Toshihide O’ki

To become an advanced second language listener, a learner needs to have good speech perception ability. Previous research that focused on measuring this ability often utilized integrated-skills tasks (e.g., repetition tasks and dictation tasks), but their validity and reliability are questionable because learners’ productive skills affect their task performance. This study attempted to develop an original discrete-point task called the word count task, in which learners count and report the number of words in blanks. To evaluate the task’s validity and reliability, two comparable studies with dictation tasks were conducted with university students in Japan. The second study, which was revised based on the first study, revealed that the reliability coefficient of the word count task expressed by Cronbach’s (cid:302)(cid:3) reached .85, slightly exceeding the reliability of the dictation tasks. Moreover, correlations with dictation tasks were found to be significantly positive with moderate to strong relationships, meaning the word count task demonstrated sufficient criterion-related validity. Moreover, the listening strategy survey conducted to explore cognitive processes involved in the task showed that phonological processing is more dominant than meaning processing in the word count task. These findings seem to corroborate the applicability of the word count task to research and classroom assessment, but further research is necessary to reevaluate its validity using other methods mentioned in this study. The first study attempted to examine the validity and reliability of a prototype of word count task (WCT) and reveal how the task should be revised. As explained, there is no established task to measure speech perception ability; therefore, the validity of WCT was tested based on its relationship with a dictation task (DT). As already stated, DTs have shortcomings as a measure of speech perception ability, but due to their popularity in educational settings in Japan, using one is the best counterpart to the WCT.

要成为一名高级的第二语言倾听者，学习者需要有良好的言语感知能力。先前的研究主要关注于测量这种能力，通常使用综合技能任务(例如，重复任务和听写任务)，但它们的有效性和可靠性值得怀疑，因为学习者的生产技能会影响他们的任务表现。本研究试图开发一个原始的离散点任务，称为单词计数任务，在该任务中，学习者计数并报告空白中的单词数量。为了评估任务的效度和信度，我们对日本大学生进行了两项具有可比性的听写任务研究。第二项研究在第一项研究的基础上进行了修正，发现克朗巴赫(cid:302)(cid:3)表达的单词计数任务的信度系数达到了0.85，略高于听写任务的信度。此外，与听写任务的相关性被发现是显著的正相关，具有中等到强烈的关系，这意味着单词计数任务显示出足够的标准相关效度。此外，听力策略调查显示，在单词计数任务中，语音加工比意义加工更占优势。这些发现似乎证实了单词计数任务在研究和课堂评估中的适用性，但需要进一步的研究来重新评估其有效性，并使用本研究中提到的其他方法。第一项研究试图检验字数统计任务(WCT)原型的效度和信度，并揭示该任务应该如何修改。如上所述，没有既定的任务来测量语音感知能力;因此，基于其与听写任务(DT)的关系，测试了WCT的有效性。如前所述，DTs作为语言感知能力的测量有缺点，但由于它们在日本教育环境中的普及，使用DTs是WCT的最佳对应物。

{"title":"Developing a New Task to Measure Speech Perception Ability: Is the Word Count Task Valid and Reliable?","authors":"Toshihide O’ki","doi":"10.20622/jltajournal.23.0_17","DOIUrl":"https://doi.org/10.20622/jltajournal.23.0_17","url":null,"abstract":"To become an advanced second language listener, a learner needs to have good speech perception ability. Previous research that focused on measuring this ability often utilized integrated-skills tasks (e.g., repetition tasks and dictation tasks), but their validity and reliability are questionable because learners’ productive skills affect their task performance. This study attempted to develop an original discrete-point task called the word count task, in which learners count and report the number of words in blanks. To evaluate the task’s validity and reliability, two comparable studies with dictation tasks were conducted with university students in Japan. The second study, which was revised based on the first study, revealed that the reliability coefficient of the word count task expressed by Cronbach’s (cid:302)(cid:3) reached .85, slightly exceeding the reliability of the dictation tasks. Moreover, correlations with dictation tasks were found to be significantly positive with moderate to strong relationships, meaning the word count task demonstrated sufficient criterion-related validity. Moreover, the listening strategy survey conducted to explore cognitive processes involved in the task showed that phonological processing is more dominant than meaning processing in the word count task. These findings seem to corroborate the applicability of the word count task to research and classroom assessment, but further research is necessary to reevaluate its validity using other methods mentioned in this study. The first study attempted to examine the validity and reliability of a prototype of word count task (WCT) and reveal how the task should be revised. As explained, there is no established task to measure speech perception ability; therefore, the validity of WCT was tested based on its relationship with a dictation task (DT). As already stated, DTs have shortcomings as a measure of speech perception ability, but due to their popularity in educational settings in Japan, using one is the best counterpart to the WCT.","PeriodicalId":249185,"journal":{"name":"JLTA Journal","volume":"306 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122695240","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Evaluating Language Assessments From an Ethics Perspective: Suggestions for a New Agenda 从伦理学的角度评价语言评价:新议程的建议

JLTA Journal

Pub Date : 1900-01-01 DOI: 10.20622/jltajournal.23.0_3

Antony Kunnan

Introduction The dominant 20th century approach to the evaluation of language assessments was the Standards-based approach. The Standards most evaluators referred to are the American Psychological Association (APA), American Educational Research Association (AERA), National Council on Measurement in Education (NCME) Standards (1999, 2014). These standards (mainly a list of test qualities such as validity and reliability, and of late, consequences and fairness) were developed from best practices at assessment institutions and had loose connections to theories of educational and psychological measurement. The “Test Usefulness” concept proposed by Bachman and Palmer (1996) was a popular example of the Standards approach. In the early part of the 21st century, Kane (1992) and Bachman and Palmer (2010) proposed an Argument-based approach using Toulmin’s way of structuring arguments with claims, warrants, backing and rebuttals. This approach provided a framework for evaluating language assessments. Bachman and Palmer’s (2010) “Assessment Use Argument” (AUA) is an example of this approach. While both approaches provide ways for researchers to conduct evaluations, they have a weakness, and that is they generally lack an articulated philosophical grounding. This lack of philosophical grounding can be seen in the Standards approach in which why the listed standards are important and not others is not articulated. In the Argument approach, what aspects are to be included as claims and warrants is left the assessment developer with the evaluator following them which is a critical problem. To remedy this situation, I am proposing an Ethics-based approach to assessment evaluation. The framework that implements the approach harnesses the dual concepts of fair assessments and just institutions leading to the Principle of Fairness and Principle of Justice, respectively.

20世纪主要的语言评估方法是基于标准的方法。评估者提到最多的标准是美国心理学会(APA)、美国教育研究协会(AERA)、国家教育测量委员会(NCME)标准(1999,2014)。这些标准(主要是测试质量的列表，如有效性和可靠性，以及后来的结果和公平性)是从评估机构的最佳实践中发展出来的，与教育和心理测量理论有松散的联系。巴赫曼和帕尔默(1996)提出的“测试有用性”概念是标准方法的一个流行例子。21世纪初，Kane(1992)、Bachman和Palmer(2010)提出了一种基于论证的方法，使用图尔敏的方法，用主张、授权、支持和反驳来构建论证。这种方法为评价语言评估提供了一个框架。巴赫曼和帕尔默(2010)的“评估使用论证”(AUA)就是这种方法的一个例子。虽然这两种方法都为研究人员提供了进行评估的方法，但它们都有一个弱点，那就是它们通常缺乏明确的哲学基础。这种缺乏哲学基础的情况可以从标准方法中看出，其中没有阐明为什么列出的标准是重要的，而不是其他标准。在论证方法中，哪些方面应包括为索赔和认股权证，留给评估开发人员和评估人员，这是一个关键问题。为了纠正这种情况，我提出了一种基于伦理的评估方法。实施该方法的框架利用了公平评估和公正制度的双重概念，分别导致公平原则和正义原则。

{"title":"Evaluating Language Assessments From an Ethics Perspective: Suggestions for a New Agenda","authors":"Antony Kunnan","doi":"10.20622/jltajournal.23.0_3","DOIUrl":"https://doi.org/10.20622/jltajournal.23.0_3","url":null,"abstract":"Introduction The dominant 20th century approach to the evaluation of language assessments was the Standards-based approach. The Standards most evaluators referred to are the American Psychological Association (APA), American Educational Research Association (AERA), National Council on Measurement in Education (NCME) Standards (1999, 2014). These standards (mainly a list of test qualities such as validity and reliability, and of late, consequences and fairness) were developed from best practices at assessment institutions and had loose connections to theories of educational and psychological measurement. The “Test Usefulness” concept proposed by Bachman and Palmer (1996) was a popular example of the Standards approach. In the early part of the 21st century, Kane (1992) and Bachman and Palmer (2010) proposed an Argument-based approach using Toulmin’s way of structuring arguments with claims, warrants, backing and rebuttals. This approach provided a framework for evaluating language assessments. Bachman and Palmer’s (2010) “Assessment Use Argument” (AUA) is an example of this approach. While both approaches provide ways for researchers to conduct evaluations, they have a weakness, and that is they generally lack an articulated philosophical grounding. This lack of philosophical grounding can be seen in the Standards approach in which why the listed standards are important and not others is not articulated. In the Argument approach, what aspects are to be included as claims and warrants is left the assessment developer with the evaluator following them which is a critical problem. To remedy this situation, I am proposing an Ethics-based approach to assessment evaluation. The framework that implements the approach harnesses the dual concepts of fair assessments and just institutions leading to the Principle of Fairness and Principle of Justice, respectively.","PeriodicalId":249185,"journal":{"name":"JLTA Journal","volume":"64 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127291797","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

The Effects of Retelling on Reading Comprehension: Focusing on Different Levels of Comprehension and Non-Textual Information in Retelling Protocols 复述对阅读理解的影响:以不同层次的理解和复述协议中的非文本信息为中心

JLTA Journal

Pub Date : 1900-01-01 DOI: 10.20622/jltajournal.24.0_23

S. Ito

引用次数: 0

Assessing Functional Adequacy Using Picture Description Tasks in Classroom-Based L2 Speaking Assessment 在基于课堂的第二语言口语评估中使用图片描述任务评估功能充分性

JLTA Journal

Pub Date : 1900-01-01 DOI: 10.20622/jltajournal.25.0_60

Rie Koizumi, Yo In’nami

引用次数: 0

AIを活用した英文ライティング自動評価採点システムのスコア予測精度の検証利用人工智能验证英文写作自动评分系统的得分预测精度

JLTA Journal

Pub Date : 1900-01-01 DOI: 10.20622/jltajournal.25.0_21

由仁杉田

引用次数: 0

Comparison Between Holistic and Analytic Rubrics of a Paired Oral Test 配对口语测试整体题与分析题的比较

JLTA Journal

Pub Date : 1900-01-01 DOI: 10.20622/jltajournal.23.0_57

Rie Koizumi, Yo In’nami, Makoto Fukazawa

The current study aimed to reveal similarities and differences between a holistic and an analytic rubric used in assessing speaking performance in a paired oral test. To this end, speaking performances of 110 Japanese university students produced in paired oral interaction were evaluated by raters, holistically and analytically. The comparisons made between the two rubrics using many-facet Rasch measurement showed that both worked effectively, with the analytic rubric working slightly better in terms of a better global fit, a better test-taker and task separation, higher test-taker and task reliability, smaller standard errors, and a smaller percentage of test takers with overfit. Correlation and regression analysis indicated a strong relationship between the two (r = .84) and the Interactive communication and Fluency analytic criteria substantially explained holistic scores (adjusted R2 = .71). Results suggest that teachers can obtain similar results with either rubric type and, if they select an analytic one, a priority would be to include Interactive communication and Fluency criteria.

目前的研究旨在揭示在配对口语测试中评估口语表现时使用的整体和分析标准的异同。为此，我们对110名日本大学生在配对口语互动中的口语表现进行了全面和分析性的评价。使用多面Rasch测量对两种标准进行的比较表明，两者都有效，分析标准在更好的全局拟合、更好的考生和任务分离、更高的考生和任务可靠性、更小的标准误差和更小的过拟合的考生百分比方面稍好一些。相关分析和回归分析表明，两者之间存在很强的相关性(r = 0.84)，交互式沟通和流利性分析标准实质上解释了整体得分(调整后的R2 = 0.71)。结果表明，教师可以用任何一种标题类型获得类似的结果，如果他们选择一个分析的，优先考虑的是包括互动沟通和流畅性标准。

引用次数: 0

General correspondence 一般的信件

JLTA Journal

Pub Date : 1900-01-01 DOI: 10.20622/jltajournal.22.0_89

引用次数: 0

Japanese language testing in American universities: Research agenda in the new normal 美国大学日语测试:新常态下的研究议程

JLTA Journal

Pub Date : 1900-01-01 DOI: 10.20622/jltajournal.25.0_3

妃美近藤ブラウン

引用次数: 0

学会通信学会通信

JLTA Journal

Pub Date : 1900-01-01 DOI: 10.20622/jltajournal.19.0_111

引用次数: 0

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

JLTA Journal

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀