Behavior Research Methods最新文献

英文中文

The Beijing Sentence Corpus II: A cross-script comparison between traditional and simplified Chinese sentence reading. 北京句子语料库II：繁体和简体汉语句子阅读的交叉对照。

IF 4.6 2区心理学 Q1 PSYCHOLOGY, EXPERIMENTAL

Behavior Research Methods

Pub Date : 2025-01-17 DOI: 10.3758/s13428-024-02523-z

Ming Yan, Jinger Pan, Reinhold Kliegl

We introduce a sentence corpus with eye-movement data in traditional Chinese (TC), based on the original Beijing Sentence Corpus (BSC) in simplified Chinese (SC). The most noticeable difference between TC and SC character sets is their visual complexity. There are reaction time corpora in isolated TC character/word lexical decision and naming tasks. However, up to now natural TC sentence reading corpus with recorded eye movements has not been available for general public. We report effects of word frequency, visual complexity, and predictability on eye movements on fixation location and duration based on 60 native TC readers. In addition, because the current BSC-II sentences are nearly identical to the original BSC sentences, we report similarities and differences of the linguistic influences on eye movements for the two varieties of written Chinese. The results shed light on how visual complexity affects eye movements. Together, the two sentence corpora comprise a useful tool to establish cross-script similarities and differences in TC and SC.

在简体中文北京句子语料库的基础上，引入了一个包含眼动数据的繁体中文句子语料库。TC字符集和SC字符集最显著的区别是它们的视觉复杂性。在孤立的TC字/词词汇决策和命名任务中存在反应时语料库。然而，到目前为止，还没有为公众提供有眼动记录的自然TC句子阅读语料库。我们报告了词频、视觉复杂性和可预测性对60名母语TC读者注视位置和持续时间的眼球运动的影响。此外，由于目前的BSC- ii句子与原始的BSC句子几乎相同，我们报告了两种书面语对眼球运动的语言影响的异同。研究结果揭示了视觉复杂性如何影响眼球运动。这两句语料库共同构成了一个有用的工具，可以建立汉语和汉语跨脚本的异同。

{"title":"The Beijing Sentence Corpus II: A cross-script comparison between traditional and simplified Chinese sentence reading.","authors":"Ming Yan, Jinger Pan, Reinhold Kliegl","doi":"10.3758/s13428-024-02523-z","DOIUrl":"10.3758/s13428-024-02523-z","url":null,"abstract":"We introduce a sentence corpus with eye-movement data in traditional Chinese (TC), based on the original Beijing Sentence Corpus (BSC) in simplified Chinese (SC). The most noticeable difference between TC and SC character sets is their visual complexity. There are reaction time corpora in isolated TC character/word lexical decision and naming tasks. However, up to now natural TC sentence reading corpus with recorded eye movements has not been available for general public. We report effects of word frequency, visual complexity, and predictability on eye movements on fixation location and duration based on 60 native TC readers. In addition, because the current BSC-II sentences are nearly identical to the original BSC sentences, we report similarities and differences of the linguistic influences on eye movements for the two varieties of written Chinese. The results shed light on how visual complexity affects eye movements. Together, the two sentence corpora comprise a useful tool to establish cross-script similarities and differences in TC and SC.","PeriodicalId":8717,"journal":{"name":"Behavior Research Methods","volume":"57 2","pages":"60"},"PeriodicalIF":4.6,"publicationDate":"2025-01-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11748476/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142999269","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Predicting image memorability from evoked feelings. 从唤起的感觉预测图像记忆。

IF 4.6 2区心理学 Q1 PSYCHOLOGY, EXPERIMENTAL

Behavior Research Methods

Pub Date : 2025-01-14 DOI: 10.3758/s13428-024-02510-4

Cheyenne Wakeland-Hart, Mariam Aly

While viewing a visual stimulus, we often cannot tell whether it is inherently memorable or forgettable. However, the memorability of a stimulus can be quantified and partially predicted by a collection of conceptual and perceptual factors. Higher-level properties that represent the "meaningfulness" of a visual stimulus to viewers best predict whether it will be remembered or forgotten across a population. Here, we hypothesize that the feelings evoked by an image, operationalized as the valence and arousal dimensions of affect, significantly contribute to the memorability of scene images. We ran two complementary experiments to investigate the influence of affect on scene memorability, in the process creating a new image set (VAMOS) of hundreds of natural scene images for which we obtained valence, arousal, and memorability scores. From our first experiment, we found memorability to be highly reliable for scene images that span a wide range of evoked arousal and valence. From our second experiment, we found that both valence and arousal are significant but weak predictors of image memorability. Scene images were most memorable if they were slightly negatively valenced and highly arousing. Images that were extremely positive or unarousing were most forgettable. Valence and arousal together accounted for less than 8% of the variance in image memorability. These findings suggest that evoked affect contributes to the overall memorability of a scene image but, like other singular predictors, does not fully explain it. Instead, memorability is best explained by an assemblage of visual features that combine, in perhaps unintuitive ways, to predict what is likely to stick in our memory.

当我们看到一个视觉刺激时，我们常常无法分辨它是固有的记忆还是遗忘。然而，刺激的记忆性可以通过一系列概念和知觉因素来量化和部分预测。对观看者来说，代表视觉刺激的“意义”的高级属性最能预测整个人群是否会记住或忘记它。在此，我们假设图像所唤起的感觉，作为情感的效价和唤醒维度，显著地促进了场景图像的记忆。我们进行了两个互补的实验来研究情感对场景记忆的影响，在创建数百个自然场景图像的新图像集（VAMOS）的过程中，我们获得了效价、唤醒和记忆分数。从我们的第一个实验，我们发现记忆是高度可靠的场景图像跨越广泛的唤起和效价。从我们的第二个实验中，我们发现效价和唤醒都是图像记忆的显著但微弱的预测因子。如果场景图像具有轻微的负面价值和高度的刺激性，那么它们是最令人难忘的。那些极其积极或平淡无奇的画面最容易被遗忘。效价和觉醒加起来只占图像记忆变异的不到8%。这些发现表明，唤起的情感有助于场景图像的整体记忆，但像其他单一预测因素一样，并不能完全解释它。相反，记忆性最好的解释是视觉特征的集合，这些特征以一种可能不直观的方式组合在一起，来预测什么可能会留在我们的记忆中。

{"title":"Predicting image memorability from evoked feelings.","authors":"Cheyenne Wakeland-Hart, Mariam Aly","doi":"10.3758/s13428-024-02510-4","DOIUrl":"10.3758/s13428-024-02510-4","url":null,"abstract":"While viewing a visual stimulus, we often cannot tell whether it is inherently memorable or forgettable. However, the memorability of a stimulus can be quantified and partially predicted by a collection of conceptual and perceptual factors. Higher-level properties that represent the \"meaningfulness\" of a visual stimulus to viewers best predict whether it will be remembered or forgotten across a population. Here, we hypothesize that the feelings evoked by an image, operationalized as the valence and arousal dimensions of affect, significantly contribute to the memorability of scene images. We ran two complementary experiments to investigate the influence of affect on scene memorability, in the process creating a new image set (VAMOS) of hundreds of natural scene images for which we obtained valence, arousal, and memorability scores. From our first experiment, we found memorability to be highly reliable for scene images that span a wide range of evoked arousal and valence. From our second experiment, we found that both valence and arousal are significant but weak predictors of image memorability. Scene images were most memorable if they were slightly negatively valenced and highly arousing. Images that were extremely positive or unarousing were most forgettable. Valence and arousal together accounted for less than 8% of the variance in image memorability. These findings suggest that evoked affect contributes to the overall memorability of a scene image but, like other singular predictors, does not fully explain it. Instead, memorability is best explained by an assemblage of visual features that combine, in perhaps unintuitive ways, to predict what is likely to stick in our memory.","PeriodicalId":8717,"journal":{"name":"Behavior Research Methods","volume":"57 1","pages":"58"},"PeriodicalIF":4.6,"publicationDate":"2025-01-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142982532","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A note on using random starting values in small sample SEM. 关于在小样本扫描电镜中使用随机起始值的说明。

IF 4.6 2区心理学 Q1 PSYCHOLOGY, EXPERIMENTAL

Behavior Research Methods

Pub Date : 2025-01-14 DOI: 10.3758/s13428-024-02543-9

Julie De Jonckere, Yves Rosseel

Model estimation for SEM analyses in commonly used software typically involves iterative optimization procedures, which can lead to nonconvergence issues. In this paper, we propose using random starting values as an alternative to the current default strategies. By drawing from uniform distributions within data-driven lower and upper bounds (see De Jonckere et al. (2022) Structural Equation Modeling: A Multidisciplinary Journal, 29(3), 412-427), random starting values are generated for each (free) parameter in the model. Through three small simulation studies, we demonstrate that incorporating such bounded random starting values significantly reduces the nonconvergence rate, resulting in increased convergence rates ranging between 87% and 96% in the first two studies. In essence, bounded random starting values seem to offer a promising alternative to the default starting values that are currently used in most software packages.

常用软件中SEM分析的模型估计通常涉及迭代优化过程，这可能导致非收敛问题。在本文中，我们建议使用随机起始值作为当前默认策略的替代方案。通过在数据驱动的下界和上界内绘制均匀分布(参见De Jonckere et al. (2022) Structural Equation Modeling: A Multidisciplinary Journal, 29(3), 412-427)，为模型中的每个（自由）参数生成随机起始值。通过三次小型模拟研究，我们证明了采用这种有界随机起始值显著降低了不收敛率，导致前两项研究中收敛率提高了87%至96%。从本质上讲，有界随机起始值似乎为目前大多数软件包中使用的默认起始值提供了一个有希望的替代方案。

引用次数: 0

ROAR-CAT: Rapid Online Assessment of Reading ability with Computerized Adaptive Testing. 咆哮- cat：快速在线评估阅读能力与计算机化的自适应测试。

IF 4.6 2区心理学 Q1 PSYCHOLOGY, EXPERIMENTAL

Behavior Research Methods

Pub Date : 2025-01-14 DOI: 10.3758/s13428-024-02578-y

Wanjing Anya Ma, Adam Richie-Halford, Amy K Burkhardt, Klint Kanopka, Clementine Chou, Benjamin W Domingue, Jason D Yeatman

The Rapid Online Assessment of Reading (ROAR) is a web-based lexical decision task that measures single-word reading abilities in children and adults without a proctor. Here we study whether item response theory (IRT) and computerized adaptive testing (CAT) can be used to create a more efficient online measure of word recognition. To construct an item bank, we first analyzed data taken from four groups of students (N = 1960) who differed in age, socioeconomic status, and language-based learning disabilities. The majority of item parameters were highly consistent across groups (r = .78-.94), and six items that functioned differently across groups were removed. Next, we implemented a JavaScript CAT algorithm and conducted a validation experiment with 485 students in grades 1-8 who were randomly assigned to complete trials of all items in the item bank in either (a) a random order or (b) a CAT order. We found that, to achieve reliability of 0.9, CAT improved test efficiency by 40%: 75 CAT items produced the same standard error of measurement as 125 items in a random order. Subsequent validation in 32 public school classrooms showed that an approximately 3-min ROAR-CAT can achieve high correlations (r = .89 for first grade, r = .73 for second grade) with alternative 5-15-min individually proctored oral reading assessments. Our findings suggest that ROAR-CAT is a promising tool for efficiently and accurately measuring single-word reading ability. Furthermore, our development process serves as a model for creating adaptive online assessments that bridge research and practice.

快速在线阅读评估（ROAR）是一个基于网络的词汇决策任务，在没有监考的情况下测量儿童和成人的单字阅读能力。本文研究了项目反应理论（IRT）和计算机自适应测试（CAT）是否可以用于创建一个更有效的在线单词识别测试。为了构建题库，我们首先分析了年龄、社会经济地位和语言学习障碍不同的四组学生（N = 1960）的数据。大多数项目参数在组间高度一致（r = 0.78 - 0.94）， 6个在组间功能不同的项目被删除。接下来，我们实现了一个JavaScript CAT算法，并对485名1-8年级的学生进行了验证实验，他们被随机分配以(a)随机顺序或(b) CAT顺序完成题库中所有项目的试验。我们发现，为了达到0.9的信度，CAT将测试效率提高了40%:75个CAT项目产生的测量标准误差与随机顺序的125个项目相同。随后在32个公立学校教室进行的验证表明，大约3分钟的ROAR-CAT可以获得高相关性(r =。一年级为89，r =。二年级73分)，另外还有5-15分钟的口头阅读评估。我们的研究结果表明，咆哮- cat是一个有前途的工具，有效地和准确地测量单字阅读能力。此外，我们的开发过程可作为创建自适应在线评估的模型，将研究与实践联系起来。

{"title":"ROAR-CAT: Rapid Online Assessment of Reading ability with Computerized Adaptive Testing.","authors":"Wanjing Anya Ma, Adam Richie-Halford, Amy K Burkhardt, Klint Kanopka, Clementine Chou, Benjamin W Domingue, Jason D Yeatman","doi":"10.3758/s13428-024-02578-y","DOIUrl":"10.3758/s13428-024-02578-y","url":null,"abstract":"The Rapid Online Assessment of Reading (ROAR) is a web-based lexical decision task that measures single-word reading abilities in children and adults without a proctor. Here we study whether item response theory (IRT) and computerized adaptive testing (CAT) can be used to create a more efficient online measure of word recognition. To construct an item bank, we first analyzed data taken from four groups of students (N = 1960) who differed in age, socioeconomic status, and language-based learning disabilities. The majority of item parameters were highly consistent across groups (r = .78-.94), and six items that functioned differently across groups were removed. Next, we implemented a JavaScript CAT algorithm and conducted a validation experiment with 485 students in grades 1-8 who were randomly assigned to complete trials of all items in the item bank in either (a) a random order or (b) a CAT order. We found that, to achieve reliability of 0.9, CAT improved test efficiency by 40%: 75 CAT items produced the same standard error of measurement as 125 items in a random order. Subsequent validation in 32 public school classrooms showed that an approximately 3-min ROAR-CAT can achieve high correlations (r = .89 for first grade, r = .73 for second grade) with alternative 5-15-min individually proctored oral reading assessments. Our findings suggest that ROAR-CAT is a promising tool for efficiently and accurately measuring single-word reading ability. Furthermore, our development process serves as a model for creating adaptive online assessments that bridge research and practice.","PeriodicalId":8717,"journal":{"name":"Behavior Research Methods","volume":"57 1","pages":"56"},"PeriodicalIF":4.6,"publicationDate":"2025-01-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11732908/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142982534","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Robust estimation of the latent trait in graded response models. 分级反应模型中潜在性状的鲁棒估计。

IF 4.6 2区心理学 Q1 PSYCHOLOGY, EXPERIMENTAL

Behavior Research Methods

Pub Date : 2025-01-14 DOI: 10.3758/s13428-024-02574-2

Audrey Filonczuk, Ying Cheng

Aberrant responses (e.g., careless responses, miskeyed items, etc.) often contaminate psychological assessments and surveys. Previous robust estimators for dichotomous IRT models have produced more accurate latent trait estimates with data containing response disturbances. However, for widely used Likert-type items with three or more response categories, a robust estimator for estimating latent traits does not exist. We propose a robust estimator for the graded response model (GRM) that can be applied to Likert-type items. Two weighting mechanisms for downweighting "suspicious" responses are considered: the Huber and the bisquare weight functions. Simulations reveal the estimator reduces bias for various test lengths, numbers of response categories, and types of response disturbances. The reduction in bias and stable standard errors suggests that the robust estimator for the GRM is effective in counteracting the harmful effects of response disturbances and providing more accurate scores on psychological assessments. The robust estimator is then applied to data from the Big Five Inventory-2 (Ober et al., 2021) to demonstrate its use. Potential applications and implications are discussed.

异常反应（例如，粗心的反应，错误的项目等）经常污染心理评估和调查。以前对二分类IRT模型的鲁棒估计已经在包含响应干扰的数据中产生了更准确的潜在性状估计。然而，对于具有三个或更多反应类别的广泛使用的李克特型项目，没有一个可靠的估计器来估计潜在特征。我们提出了一个可应用于李克特类型项目的分级响应模型（GRM）的鲁棒估计器。考虑了两种加权机制来降低“可疑”响应的权重：Huber和bissquared权重函数。仿真表明，该估计器减少了各种测试长度、响应类别数量和响应干扰类型的偏差。偏差的减少和稳定的标准误差表明，GRM的稳健估计器可以有效地抵消反应干扰的有害影响，并在心理评估中提供更准确的分数。然后将稳健估计器应用于Big Five Inventory-2中的数据（Ober等人，2021年）以演示其使用。讨论了潜在的应用和影响。

{"title":"Robust estimation of the latent trait in graded response models.","authors":"Audrey Filonczuk, Ying Cheng","doi":"10.3758/s13428-024-02574-2","DOIUrl":"10.3758/s13428-024-02574-2","url":null,"abstract":"Aberrant responses (e.g., careless responses, miskeyed items, etc.) often contaminate psychological assessments and surveys. Previous robust estimators for dichotomous IRT models have produced more accurate latent trait estimates with data containing response disturbances. However, for widely used Likert-type items with three or more response categories, a robust estimator for estimating latent traits does not exist. We propose a robust estimator for the graded response model (GRM) that can be applied to Likert-type items. Two weighting mechanisms for downweighting \"suspicious\" responses are considered: the Huber and the bisquare weight functions. Simulations reveal the estimator reduces bias for various test lengths, numbers of response categories, and types of response disturbances. The reduction in bias and stable standard errors suggests that the robust estimator for the GRM is effective in counteracting the harmful effects of response disturbances and providing more accurate scores on psychological assessments. The robust estimator is then applied to data from the Big Five Inventory-2 (Ober et al., 2021) to demonstrate its use. Potential applications and implications are discussed.","PeriodicalId":8717,"journal":{"name":"Behavior Research Methods","volume":"57 1","pages":"55"},"PeriodicalIF":4.6,"publicationDate":"2025-01-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142982536","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A response time-based mixture item response theory model for dynamic item-response strategies. 基于反应时间的动态项目反应策略混合项目反应理论模型。

IF 4.6 2区心理学 Q1 PSYCHOLOGY, EXPERIMENTAL

Behavior Research Methods

Pub Date : 2025-01-09 DOI: 10.3758/s13428-024-02555-5

Sijia Huang, Jinwen Luo, Minjeong Jeon

Educational researchers have a long-lasting interest in the strategies examinees employ when responding to items in an assessment. Mixture item response theory (IRT) modeling is a popular class of approaches to studying examinees' item-response strategies. In the present study, we introduce a response time (RT)-based mixture IRT model for flexible modeling of examinee-and-item-specific item-response strategies. We posit that examinees may alternate between ability-based and non-ability-based strategies across different test items. Our proposed model identifies such within-examinee strategy switches without the need to predefine the non-ability-based strategies. Instead, our proposed approach allows for inferring the nature of these strategies from model parameter estimates. We illustrated the proposed approach using empirical data from PISA 2018 Science test and evaluated it through simulation studies. We concluded the article with discussions of limitations and future research directions.

教育研究人员对考生在回答评估项目时所采用的策略有着长期的兴趣。混合项目反应理论（IRT）模型是研究考生项目反应策略的一种常用方法。在本研究中，我们引入了一个基于反应时间（RT）的混合IRT模型，用于灵活地建模考生和特定项目的项目反应策略。我们假设考生可以在不同的测试项目中选择能力型和非能力型策略。我们提出的模型识别了这种考生内部的策略切换，而不需要预先定义非基于能力的策略。相反，我们提出的方法允许从模型参数估计推断这些策略的性质。我们使用2018年PISA科学测试的经验数据说明了所提出的方法，并通过模拟研究对其进行了评估。最后对本文的局限性和未来的研究方向进行了讨论。

引用次数: 0

Author Correction: Analytical power calculations for structural equation modeling: A tutorial and Shiny app. 作者更正：结构方程建模的分析功率计算：教程和闪亮的应用程序。

IF 4.6 2区心理学 Q1 PSYCHOLOGY, EXPERIMENTAL

Behavior Research Methods

Pub Date : 2025-01-09 DOI: 10.3758/s13428-024-02571-5

Suzanne Jak, Terrence D Jorgensen, Mathilde G E Verdam, Frans J Oort, Louise Elffers

引用次数: 0

Testing measurement invariance in a conditional likelihood framework by considering multiple covariates simultaneously. 同时考虑多个协变量的条件似然框架测量不变性检验。

IF 4.6 2区心理学 Q1 PSYCHOLOGY, EXPERIMENTAL

Behavior Research Methods

Pub Date : 2025-01-08 DOI: 10.3758/s13428-024-02551-9

Clemens Draxler, Andreas Kurz

This article addresses the problem of measurement invariance in psychometrics. In particular, its focus is on the invariance assumption of item parameters in a class of models known as Rasch models. It suggests a mixed-effects or random intercept model for binary data together with a conditional likelihood approach of both estimating and testing the effects of multiple covariates simultaneously. The procedure can also be viewed as a multivariate multiple regression analysis which can be applied in longitudinal designs to investigate effects of covariates over time or different experimental conditions. This work also derives four statistical tests based on asymptotic theory and a parameter-free test suitable in small sample size scenarios. Finally, it outlines generalizations for categorical data in more than two categories. All procedures are illustrated on real-data examples from behavioral research and on a hypothetical data example related to clinical research in a longitudinal design.

本文讨论了心理测量学中的测量不变性问题。特别地，它的重点是在一类被称为Rasch模型的模型中项目参数的不变性假设。它建议二元数据的混合效应或随机截距模型，以及同时估计和测试多个协变量影响的条件似然方法。这个过程也可以看作是一个多元多元回归分析，可以应用于纵向设计，以调查随时间或不同实验条件的协变量的影响。这项工作还导出了四个基于渐近理论的统计检验和一个适用于小样本量场景的无参数检验。最后，它概述了分类数据在两个以上类别的概括。所有程序都以行为研究的实际数据示例和纵向设计中与临床研究相关的假设数据示例为例进行说明。

引用次数: 0

Quantifying sighting dominance using on-display projections of monocular and binocular views. 定量瞄准优势使用显示投影单眼和双目视图。

IF 4.6 2区心理学 Q1 PSYCHOLOGY, EXPERIMENTAL

Behavior Research Methods

Pub Date : 2025-01-08 DOI: 10.3758/s13428-024-02512-2

Giuseppe Notaro, Uri Hasson

Sighting dominance is an important behavioral property which has been difficult to measure quantitatively with high precision. We developed a measurement method that is grounded in a two-camera model that satisfies these aims. Using a simple alignment task, this method quantifies sighting ocular dominance during binocular viewing, identifying each eye's relative contribution to binocular vision. The method involves placing a physical target between the viewer and a display. The viewer indicates the perceived target's projection on the display with both eyes open and with only one eye open. The relative location of the binocular projection in relation to the two monocular projections is the index of dominance. The method produces a continuous variable with robust test-retest reliability (ICC = 0.96). The unit of measurement for the computed quantity is physiologically grounded: it is proportional to the distance between the monocular projections, which we show is predictable from interpupillary distance and phoria. Comparisons with the classic 'hole in card' sighting dominance test show good agreement, but also hint at potential bias for determining right-eye dominance in the latter. Interestingly, we find that some individuals systematically demonstrate nearly balanced vision, a phenomenon previously construed as mixed dominance or noisy responses. We also present ways to quantify and mitigate sources of random noise in this measurement. Overall, this new method allows for precise estimation of sighting dominance during binocular viewing. We expect it will allow a more effective understanding of the neural basis of dominance and improved effectiveness when using sighting dominance as a covariate in more complex analyses.

瞄准优势是一种重要的行为特性，但难以精确定量地测量。我们开发了一种基于双相机模型的测量方法，以满足这些目标。通过一个简单的对准任务，该方法量化了双眼视觉中的瞄准眼优势，确定了每只眼睛对双眼视觉的相对贡献。该方法包括在观看者和显示器之间放置一个物理目标。观看者睁着双眼或只用一只眼睛在显示器上指示所感知到的目标的投影。双眼投影相对于两个单眼投影的相对位置是优势指数。该方法产生的连续变量具有稳健的重测信度（ICC = 0.96）。计算量的测量单位是基于生理学的：它与单眼投影之间的距离成正比，我们表明这是可以从瞳孔间距和焦距来预测的。与经典的“卡片洞”视觉优势测试的比较显示出良好的一致性，但也暗示了后者在确定右眼优势时的潜在偏见。有趣的是，我们发现有些人系统地表现出近乎平衡的视觉，这种现象以前被解释为混合支配或嘈杂反应。我们还提出了量化和减轻这种测量中的随机噪声源的方法。总的来说，这种新方法可以精确地估计双眼观察期间的瞄准优势。我们期望它能更有效地理解优势的神经基础，并在更复杂的分析中使用视觉优势作为协变量时提高有效性。

{"title":"Quantifying sighting dominance using on-display projections of monocular and binocular views.","authors":"Giuseppe Notaro, Uri Hasson","doi":"10.3758/s13428-024-02512-2","DOIUrl":"10.3758/s13428-024-02512-2","url":null,"abstract":"Sighting dominance is an important behavioral property which has been difficult to measure quantitatively with high precision. We developed a measurement method that is grounded in a two-camera model that satisfies these aims. Using a simple alignment task, this method quantifies sighting ocular dominance during binocular viewing, identifying each eye's relative contribution to binocular vision. The method involves placing a physical target between the viewer and a display. The viewer indicates the perceived target's projection on the display with both eyes open and with only one eye open. The relative location of the binocular projection in relation to the two monocular projections is the index of dominance. The method produces a continuous variable with robust test-retest reliability (ICC = 0.96). The unit of measurement for the computed quantity is physiologically grounded: it is proportional to the distance between the monocular projections, which we show is predictable from interpupillary distance and phoria. Comparisons with the classic 'hole in card' sighting dominance test show good agreement, but also hint at potential bias for determining right-eye dominance in the latter. Interestingly, we find that some individuals systematically demonstrate nearly balanced vision, a phenomenon previously construed as mixed dominance or noisy responses. We also present ways to quantify and mitigate sources of random noise in this measurement. Overall, this new method allows for precise estimation of sighting dominance during binocular viewing. We expect it will allow a more effective understanding of the neural basis of dominance and improved effectiveness when using sighting dominance as a covariate in more complex analyses.","PeriodicalId":8717,"journal":{"name":"Behavior Research Methods","volume":"57 1","pages":"52"},"PeriodicalIF":4.6,"publicationDate":"2025-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142943475","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Semantic alignment: A measure to quantify the degree of semantic equivalence for English-Chinese translation equivalents based on distributional semantics. 语义对齐：一种量化基于分布语义的英汉对等翻译语义对等程度的方法。

IF 4.6 2区心理学 Q1 PSYCHOLOGY, EXPERIMENTAL

Behavior Research Methods

Pub Date : 2025-01-08 DOI: 10.3758/s13428-024-02527-9

Yufeng Liu, Shifa Chen, Yi Yang

The degree of semantic equivalence of translation pairs is typically measured by asking bilinguals to rate the semantic similarity of them or comparing the number and meaning of dictionary entries. Such measures are subjective, labor-intensive, and unable to capture the fine-grained variation in the degree of semantic equivalence. Thompson et al. (in Nature Human Behaviour, 4(10), 1029-1038, 2020) propose a computational method to quantify the extent to which translation equivalents are semantically aligned by measuring the contextual use across languages. Here, we refine this method to quantify semantic alignment of English-Chinese translation equivalents using word2vec based on the proposal that the degree of similarity between the contexts associated with a word and those of its multiple translations vary continuously. We validate our measure using semantic alignment from GloVe and fastText, and data from two behavioral datasets. The consistency of semantic alignment induced across different models confirms the robustness of our method. We demonstrate that semantic alignment not only reflects human semantic similarity judgment of translation equivalents but also captures bilinguals' usage frequency of translations. We also show that our method is more cognitively plausible than Thompson et al.'s method. Furthermore, the correlations between semantic alignment and key psycholinguistic factors mirror those between human-rated semantic similarity and these variables, indicating that computed semantic alignment reflects the degree of semantic overlap of translation equivalents in the bilingual mental lexicon. We further provide the largest English-Chinese translation equivalent dataset to date, encompassing 50,088 translation pairs for 15,734 English words, their dominant Chinese translation equivalents, and their semantic alignment Rc values.

翻译对的语义对等程度通常是通过要求双语者评价它们的语义相似度或比较字典条目的数量和含义来衡量的。这种度量是主观的、劳动密集型的，并且无法捕捉语义等价程度的细粒度变化。Thompson等人（Nature Human Behaviour, 4(10), 1029-1038, 2020）提出了一种计算方法，通过测量跨语言的上下文使用来量化翻译对等物在语义上的对齐程度。在此，我们基于一个词的上下文与多个翻译的上下文之间的相似度连续变化的建议，对该方法进行改进，使用word2vec来量化英汉对等翻译的语义对齐。我们使用GloVe和fastText的语义对齐以及来自两个行为数据集的数据来验证我们的测量。不同模型间语义对齐的一致性证实了我们方法的鲁棒性。研究表明，语义对齐不仅反映了人类对翻译对等物的语义相似度判断，而且反映了双语者对翻译的使用频率。我们还表明，我们的方法比汤普森等人的方法在认知上更合理。此外，语义对齐与关键心理语言学因素之间的相关性反映了人类评定的语义相似度与这些变量之间的相关性，表明计算的语义对齐反映了双语心理词典中翻译对等物的语义重叠程度。我们进一步提供了迄今为止最大的英汉翻译等效数据集，包括15,734个英语单词的50,088个翻译对，它们的主要中文翻译等效，以及它们的语义对齐Rc值。

{"title":"Semantic alignment: A measure to quantify the degree of semantic equivalence for English-Chinese translation equivalents based on distributional semantics.","authors":"Yufeng Liu, Shifa Chen, Yi Yang","doi":"10.3758/s13428-024-02527-9","DOIUrl":"10.3758/s13428-024-02527-9","url":null,"abstract":"The degree of semantic equivalence of translation pairs is typically measured by asking bilinguals to rate the semantic similarity of them or comparing the number and meaning of dictionary entries. Such measures are subjective, labor-intensive, and unable to capture the fine-grained variation in the degree of semantic equivalence. Thompson et al. (in Nature Human Behaviour, 4(10), 1029-1038, 2020) propose a computational method to quantify the extent to which translation equivalents are semantically aligned by measuring the contextual use across languages. Here, we refine this method to quantify semantic alignment of English-Chinese translation equivalents using word2vec based on the proposal that the degree of similarity between the contexts associated with a word and those of its multiple translations vary continuously. We validate our measure using semantic alignment from GloVe and fastText, and data from two behavioral datasets. The consistency of semantic alignment induced across different models confirms the robustness of our method. We demonstrate that semantic alignment not only reflects human semantic similarity judgment of translation equivalents but also captures bilinguals' usage frequency of translations. We also show that our method is more cognitively plausible than Thompson et al.'s method. Furthermore, the correlations between semantic alignment and key psycholinguistic factors mirror those between human-rated semantic similarity and these variables, indicating that computed semantic alignment reflects the degree of semantic overlap of translation equivalents in the bilingual mental lexicon. We further provide the largest English-Chinese translation equivalent dataset to date, encompassing 50,088 translation pairs for 15,734 English words, their dominant Chinese translation equivalents, and their semantic alignment Rc values.","PeriodicalId":8717,"journal":{"name":"Behavior Research Methods","volume":"57 1","pages":"51"},"PeriodicalIF":4.6,"publicationDate":"2025-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142943478","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Behavior Research Methods

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀