Behavior Research Methods最新文献_第10页

The fundamentals of eye tracking part 4: Tools for conducting an eye tracking study. 眼动追踪的基本原理第4部分：进行眼动追踪研究的工具。

IF 4.6 2区心理学 Q1 PSYCHOLOGY, EXPERIMENTAL

Behavior Research Methods

Pub Date : 2025-01-06 DOI: 10.3758/s13428-024-02529-7

Diederick C Niehorster, Marcus Nyström, Roy S Hessels, Richard Andersson, Jeroen S Benjamins, Dan Witzner Hansen, Ignace T C Hooge

Researchers using eye tracking are heavily dependent on software and hardware tools to perform their studies, from recording eye tracking data and visualizing it, to processing and analyzing it. This article provides an overview of available tools for research using eye trackers and discusses considerations to make when choosing which tools to adopt for one's study.

使用眼动追踪的研究人员在很大程度上依赖于软件和硬件工具来完成他们的研究，从记录眼动追踪数据并将其可视化，到处理和分析它。本文概述了使用眼动仪进行研究的可用工具，并讨论了在选择研究工具时应考虑的因素。

引用次数: 0

Validation of the Emotionally Congruent and Incongruent Face-Body Static Set (ECIFBSS). 情绪一致与不一致脸-身体静态集的验证。

IF 4.6 2区心理学 Q1 PSYCHOLOGY, EXPERIMENTAL

Behavior Research Methods

Pub Date : 2025-01-03 DOI: 10.3758/s13428-024-02550-w

Anne-Sophie Puffet, Simon Rigoulot

Frequently, we perceive emotional information through multiple channels (e.g., face, voice, posture). These cues interact, facilitating emotional perception when congruent (similar across channels) compared to incongruent (different). Most previous studies on this congruency effect used stimuli from different sets, compromising their quality. In this context, we created and validated a new static stimulus set (ECIFBSS) featuring 1952 facial and body expressions of basic emotions in congruent and incongruent situations. We photographed 40 actors expressing facial emotions and body postures (anger, disgust, happiness, neutral, fear, surprise, and sadness) in both congruent and incongruent situations. The validation was conducted in two parts. In the first part, 76 participants performed a recognition task on facial and bodily expressions separately. In the second part, 40 participants performed the same recognition task, along with an evaluation of four features: intensity, authenticity, arousal, and valence. All emotions (face and body) were well recognized. Consistent with the literature, facial emotions were recognized better than body postures. Happiness was the most recognized facial emotion, while fear was the least. Among body expressions, anger had the highest recognition, while disgust was the least accurately recognized. Finally, facial and bodily expressions were considered moderately authentic, and the evaluation of intensity, valence, and arousal aligned with the dimensional model. The ECIFBSS offers static stimuli for studying facial and body expressions of basic emotions, providing a new tool to explore integrating emotional information from various channels and their reciprocal influence.

通常，我们通过多种渠道（例如，面部、声音、姿势）感知情绪信息。这些线索相互作用，促进情感感知时，一致（跨渠道相似）相比，不一致（不同）。以往大多数关于这种一致性效应的研究使用了来自不同集合的刺激，影响了它们的质量。在此背景下，我们创建并验证了一个新的静态刺激集（ECIFBSS），其中包含了在一致和不一致情况下1952种基本情绪的面部和身体表达。我们拍摄了40位演员在一致和不一致的情况下表达面部情绪和身体姿势（愤怒、厌恶、快乐、中性、恐惧、惊讶和悲伤）。验证分两部分进行。在第一部分中，76名参与者分别完成了面部和身体表情的识别任务。在第二部分中，40名参与者完成了同样的识别任务，并对四个特征进行了评估：强度、真实性、唤醒和效价。所有的情绪（面部和身体）都被很好地识别出来。与文献一致，面部情绪比身体姿势更容易被识别。快乐是最容易识别的面部表情，而恐惧是最不容易识别的。在身体表情中，愤怒的识别度最高，而厌恶的识别度最低。最后，面部和身体表情被认为是适度真实的，并且对强度、效价和唤醒的评估与维度模型一致。ECIFBSS为研究基本情绪的面部和身体表达提供了静态刺激，为探索整合各种渠道的情绪信息及其相互影响提供了新的工具。

{"title":"Validation of the Emotionally Congruent and Incongruent Face-Body Static Set (ECIFBSS).","authors":"Anne-Sophie Puffet, Simon Rigoulot","doi":"10.3758/s13428-024-02550-w","DOIUrl":"10.3758/s13428-024-02550-w","url":null,"abstract":"Frequently, we perceive emotional information through multiple channels (e.g., face, voice, posture). These cues interact, facilitating emotional perception when congruent (similar across channels) compared to incongruent (different). Most previous studies on this congruency effect used stimuli from different sets, compromising their quality. In this context, we created and validated a new static stimulus set (ECIFBSS) featuring 1952 facial and body expressions of basic emotions in congruent and incongruent situations. We photographed 40 actors expressing facial emotions and body postures (anger, disgust, happiness, neutral, fear, surprise, and sadness) in both congruent and incongruent situations. The validation was conducted in two parts. In the first part, 76 participants performed a recognition task on facial and bodily expressions separately. In the second part, 40 participants performed the same recognition task, along with an evaluation of four features: intensity, authenticity, arousal, and valence. All emotions (face and body) were well recognized. Consistent with the literature, facial emotions were recognized better than body postures. Happiness was the most recognized facial emotion, while fear was the least. Among body expressions, anger had the highest recognition, while disgust was the least accurately recognized. Finally, facial and bodily expressions were considered moderately authentic, and the evaluation of intensity, valence, and arousal aligned with the dimensional model. The ECIFBSS offers static stimuli for studying facial and body expressions of basic emotions, providing a new tool to explore integrating emotional information from various channels and their reciprocal influence.","PeriodicalId":8717,"journal":{"name":"Behavior Research Methods","volume":"57 1","pages":"41"},"PeriodicalIF":4.6,"publicationDate":"2025-01-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142926346","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Perception of emotion across cultures: Norms of valence, arousal, and sensory experience for 4923 Chinese words translated from English in Warriner et al. (2013). 跨文化情感感知：Warriner et al.（2013）对4923个汉译英词汇的效价、觉醒和感官体验规范。

IF 4.6 2区心理学 Q1 PSYCHOLOGY, EXPERIMENTAL

Behavior Research Methods

Pub Date : 2025-01-03 DOI: 10.3758/s13428-024-02580-4

Wei Yi, Haitao Xu, Kaiwen Man

Perception of emotion conveyed through language is influenced by embodied experiences obtained from social interactions, which may vary across different cultures. To explore cross-cultural differences in the perception of emotion between Chinese and English speakers, this study collected norms of valence and arousal from 322 native Mandarin speakers for 4923 Chinese words translated from Warriner et al., (Behavior Research Methods, 45, 1191-1207, 2013). Additionally, sensory experience ratings for each word were collected. Analysis demonstrated that the reliability of this dataset is satisfactory, as indicated by comparisons with previous datasets. We examined the distributions of valence and arousal for the entire dataset, as well as for positive and negative emotion categories. Further analysis suggested that valence, arousal, and sensory experience correlated with various psycholinguistic variables, including the number of syllables, number of strokes, imageability, familiarity, concreteness, frequency, and age of acquisition. Cross-language comparison indicated that native speakers of Chinese and English differ in their perception of emotional valence and arousal, largely due to cross-cultural variations associated with ecological, sociopolitical, and religious factors. This dataset will be a valuable resource for research examining the impact of emotional and sensory information on Chinese lexical processing, as well as for bilingual research investigating the interplay between language and emotion across different cultural contexts.

通过语言传达的情感感知受到从社会互动中获得的具身经验的影响，这在不同的文化中可能有所不同。为了探究中英两种语言的情感感知的跨文化差异，本研究收集了322名母语为汉语的人对Warriner等人（行为研究方法，45,1191-1207,2013）翻译的4923个中文单词的效价和唤醒规范。此外，还收集了每个单词的感官体验评分。分析表明，该数据集的可靠性令人满意，与以前的数据集进行了比较。我们检查了整个数据集的效价和唤醒分布，以及积极和消极情绪类别。进一步的分析表明，效价、唤醒和感觉体验与各种心理语言学变量相关，包括音节数、笔画数、可想象性、熟悉度、具体性、频率和习得年龄。跨语言比较表明，母语为汉语和英语的人对情绪效价和情绪唤起的感知存在差异，这主要是由于与生态、社会政治和宗教因素相关的跨文化差异。该数据集将为研究情感和感官信息对汉语词汇加工的影响以及研究不同文化背景下语言和情感之间相互作用的双语研究提供宝贵的资源。

{"title":"Perception of emotion across cultures: Norms of valence, arousal, and sensory experience for 4923 Chinese words translated from English in Warriner et al. (2013).","authors":"Wei Yi, Haitao Xu, Kaiwen Man","doi":"10.3758/s13428-024-02580-4","DOIUrl":"10.3758/s13428-024-02580-4","url":null,"abstract":"Perception of emotion conveyed through language is influenced by embodied experiences obtained from social interactions, which may vary across different cultures. To explore cross-cultural differences in the perception of emotion between Chinese and English speakers, this study collected norms of valence and arousal from 322 native Mandarin speakers for 4923 Chinese words translated from Warriner et al., (Behavior Research Methods, 45, 1191-1207, 2013). Additionally, sensory experience ratings for each word were collected. Analysis demonstrated that the reliability of this dataset is satisfactory, as indicated by comparisons with previous datasets. We examined the distributions of valence and arousal for the entire dataset, as well as for positive and negative emotion categories. Further analysis suggested that valence, arousal, and sensory experience correlated with various psycholinguistic variables, including the number of syllables, number of strokes, imageability, familiarity, concreteness, frequency, and age of acquisition. Cross-language comparison indicated that native speakers of Chinese and English differ in their perception of emotional valence and arousal, largely due to cross-cultural variations associated with ecological, sociopolitical, and religious factors. This dataset will be a valuable resource for research examining the impact of emotional and sensory information on Chinese lexical processing, as well as for bilingual research investigating the interplay between language and emotion across different cultural contexts.","PeriodicalId":8717,"journal":{"name":"Behavior Research Methods","volume":"57 1","pages":"43"},"PeriodicalIF":4.6,"publicationDate":"2025-01-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142926336","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Large language models can segment narrative events similarly to humans. 大型语言模型可以像人类一样分割叙事事件。

IF 4.6 2区心理学 Q1 PSYCHOLOGY, EXPERIMENTAL

Behavior Research Methods

Pub Date : 2025-01-03 DOI: 10.3758/s13428-024-02569-z

Sebastian Michelmann, Manoj Kumar, Kenneth A Norman, Mariya Toneva

Humans perceive discrete events such as "restaurant visits" and "train rides" in their continuous experience. One important prerequisite for studying human event perception is the ability of researchers to quantify when one event ends and another begins. Typically, this information is derived by aggregating behavioral annotations from several observers. Here, we present an alternative computational approach where event boundaries are derived using a large language model, GPT-3, instead of using human annotations. We demonstrate that GPT-3 can segment continuous narrative text into events. GPT-3-annotated events are significantly correlated with human event annotations. Furthermore, these GPT-derived annotations achieve a good approximation of the "consensus" solution (obtained by averaging across human annotations); the boundaries identified by GPT-3 are closer to the consensus, on average, than boundaries identified by individual human annotators. This finding suggests that GPT-3 provides a feasible solution for automated event annotations, and it demonstrates a further parallel between human cognition and prediction in large language models. In the future, GPT-3 may thereby help to elucidate the principles underlying human event perception.

人类在连续的经历中感知离散的事件，如“去餐馆吃饭”和“坐火车”。研究人类事件感知的一个重要前提是研究人员有能力量化一个事件何时结束，另一个事件何时开始。通常，这些信息是通过聚合来自多个观察者的行为注释得来的。在这里，我们提出了一种替代计算方法，其中使用大型语言模型GPT-3派生事件边界，而不是使用人工注释。我们证明了GPT-3可以将连续的叙事文本分割成事件。gpt -3注释的事件与人类事件注释显著相关。此外，这些gpt衍生的注释实现了“共识”解决方案的良好近似（通过对人类注释进行平均获得）；平均而言，GPT-3确定的边界比单个人类注释者确定的边界更接近共识。这一发现表明GPT-3为自动事件注释提供了一种可行的解决方案，并进一步证明了在大型语言模型中人类认知和预测之间的平行关系。在未来，GPT-3可能因此有助于阐明人类事件感知的基本原则。

{"title":"Large language models can segment narrative events similarly to humans.","authors":"Sebastian Michelmann, Manoj Kumar, Kenneth A Norman, Mariya Toneva","doi":"10.3758/s13428-024-02569-z","DOIUrl":"10.3758/s13428-024-02569-z","url":null,"abstract":"Humans perceive discrete events such as \"restaurant visits\" and \"train rides\" in their continuous experience. One important prerequisite for studying human event perception is the ability of researchers to quantify when one event ends and another begins. Typically, this information is derived by aggregating behavioral annotations from several observers. Here, we present an alternative computational approach where event boundaries are derived using a large language model, GPT-3, instead of using human annotations. We demonstrate that GPT-3 can segment continuous narrative text into events. GPT-3-annotated events are significantly correlated with human event annotations. Furthermore, these GPT-derived annotations achieve a good approximation of the \"consensus\" solution (obtained by averaging across human annotations); the boundaries identified by GPT-3 are closer to the consensus, on average, than boundaries identified by individual human annotators. This finding suggests that GPT-3 provides a feasible solution for automated event annotations, and it demonstrates a further parallel between human cognition and prediction in large language models. In the future, GPT-3 may thereby help to elucidate the principles underlying human event perception.","PeriodicalId":8717,"journal":{"name":"Behavior Research Methods","volume":"57 1","pages":"39"},"PeriodicalIF":4.6,"publicationDate":"2025-01-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11810054/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142920531","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

The Visual Integration of Semantic and Spatial Information of Objects in Naturalistic Scenes (VISIONS) database: attentional, conceptual, and perceptual norms. 自然场景数据库中物体语义和空间信息的视觉整合：注意、概念和知觉规范。

IF 4.6 2区心理学 Q1 PSYCHOLOGY, EXPERIMENTAL

Behavior Research Methods

Pub Date : 2025-01-03 DOI: 10.3758/s13428-024-02535-9

Elena Allegretti, Giorgia D'Innocenzo, Moreno I Coco

The complex interplay between low- and high-level mechanisms governing our visual system can only be fully understood within ecologically valid naturalistic contexts. For this reason, in recent years, substantial efforts have been devoted to equipping the scientific community with datasets of realistic images normed on semantic or spatial features. Here, we introduce VISIONS, an extensive database of 1136 naturalistic scenes normed on a wide range of perceptual and conceptual norms by 185 English speakers across three levels of granularity: isolated object, whole scene, and object-in-scene. Each naturalistic scene contains a critical object systematically manipulated and normed regarding its semantic consistency (e.g., a toothbrush vs. a flashlight in a bathroom) and spatial position (i.e., left, right). Normative data are also available for low- (i.e., clarity, visual complexity) and high-level (i.e., name agreement, confidence, familiarity, prototypicality, manipulability) features of the critical object and its embedding scene context. Eye-tracking data during a free-viewing task further confirms the experimental validity of our manipulations while theoretically demonstrating that object semantics is acquired in extra-foveal vision and used to guide early overt attention. To our knowledge, VISIONS is the first database exhaustively covering norms about integrating objects in scenes and providing several perceptual and conceptual norms of the two as independently taken. We expect VISIONS to become an invaluable image dataset to examine and answer timely questions above and beyond vision science, where a diversity of perceptual, attentive, mnemonic, or linguistic processes could be explored as they develop, age, or become neuropathological.

控制我们视觉系统的低级和高级机制之间复杂的相互作用，只有在生态有效的自然主义背景下才能完全理解。出于这个原因，近年来，科学界一直致力于为语义或空间特征规范的真实图像数据集提供装备。在这里，我们介绍VISIONS，这是一个广泛的数据库，包含1136个自然场景，由185名英语使用者在三个粒度级别上规范了广泛的感知和概念规范：孤立的物体、整个场景和场景中的物体。每个自然场景都包含一个关键对象，系统地操纵和规范其语义一致性（例如，牙刷和浴室里的手电筒）和空间位置（例如，左，右）。规范性数据也可用于关键对象及其嵌入场景上下文的低（即清晰度，视觉复杂性）和高（即名称一致性，置信度，熟悉度，原型性，可操作性）特征。在自由观看任务中的眼动追踪数据进一步证实了我们操作的实验有效性，同时从理论上证明了物体语义是在中央凹外视觉中获得的，并用于指导早期的显性注意力。据我们所知，VISIONS是第一个详尽地涵盖场景中物体整合规范的数据库，并提供了两个独立的感知和概念规范。我们期望VISIONS成为一个宝贵的图像数据集，用于检查和回答视觉科学以外的及时问题，在视觉科学中，随着感知、注意、记忆或语言过程的发展、年龄或神经病理学的发展，可以探索这些过程的多样性。

{"title":"The Visual Integration of Semantic and Spatial Information of Objects in Naturalistic Scenes (VISIONS) database: attentional, conceptual, and perceptual norms.","authors":"Elena Allegretti, Giorgia D'Innocenzo, Moreno I Coco","doi":"10.3758/s13428-024-02535-9","DOIUrl":"10.3758/s13428-024-02535-9","url":null,"abstract":"The complex interplay between low- and high-level mechanisms governing our visual system can only be fully understood within ecologically valid naturalistic contexts. For this reason, in recent years, substantial efforts have been devoted to equipping the scientific community with datasets of realistic images normed on semantic or spatial features. Here, we introduce VISIONS, an extensive database of 1136 naturalistic scenes normed on a wide range of perceptual and conceptual norms by 185 English speakers across three levels of granularity: isolated object, whole scene, and object-in-scene. Each naturalistic scene contains a critical object systematically manipulated and normed regarding its semantic consistency (e.g., a toothbrush vs. a flashlight in a bathroom) and spatial position (i.e., left, right). Normative data are also available for low- (i.e., clarity, visual complexity) and high-level (i.e., name agreement, confidence, familiarity, prototypicality, manipulability) features of the critical object and its embedding scene context. Eye-tracking data during a free-viewing task further confirms the experimental validity of our manipulations while theoretically demonstrating that object semantics is acquired in extra-foveal vision and used to guide early overt attention. To our knowledge, VISIONS is the first database exhaustively covering norms about integrating objects in scenes and providing several perceptual and conceptual norms of the two as independently taken. We expect VISIONS to become an invaluable image dataset to examine and answer timely questions above and beyond vision science, where a diversity of perceptual, attentive, mnemonic, or linguistic processes could be explored as they develop, age, or become neuropathological.","PeriodicalId":8717,"journal":{"name":"Behavior Research Methods","volume":"57 1","pages":"42"},"PeriodicalIF":4.6,"publicationDate":"2025-01-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142926337","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A normative database of Swahili-Chinese paired associates. 斯瓦希里语-汉语配对的规范数据库。

IF 4.6 2区心理学 Q1 PSYCHOLOGY, EXPERIMENTAL

Behavior Research Methods

Pub Date : 2025-01-03 DOI: 10.3758/s13428-024-02531-z

Tian Fan, Wenbo Zhao, Bukuan Sun, Shaohang Liu, Yue Yin, Muzi Xu, Xiao Hu, Chunliang Yang, Liang Luo

Over the past few decades, Swahili-English and Lithuanian-English word pair databases have been extensively utilized in research on learning and memory. However, these normative databases are specifically designed for generating study stimuli in learning and memory research involving native (or fluent) English speakers. Consequently, they are not suitable for investigations that encompass populations whose first language is not English, such as Chinese individuals. Notably, native Chinese speakers constitute a substantial proportion, approximately 18%, of the global population. The current study aims to establish a new database of translation equivalences, specifically tailored to facilitate research on learning, memory, and metacognition among the Chinese population. We present a comprehensive set of normative measures for 200 Swahili-Chinese paired associates, including recall accuracy, recall latency, error patterns, confidence ratings, perceived learning difficulty, judgments of learning, and perceived learning interestingness for the entire word pairs. Additionally, we include word-likeness ratings and word length for the Swahili words, and concreteness ratings, familiarity ratings, word frequency, and number of strokes for the Chinese words. This diverse array of measures, gathered across a substantial number of Swahili-Chinese word pairs, is poised to effectively support future research seeking to investigate the intricate processes of learning, memory and metacognition within the Chinese population.

在过去的几十年里，斯瓦希里语-英语和立陶宛语-英语词对数据库在学习和记忆研究中得到了广泛的应用。然而，这些规范的数据库是专门为在涉及母语（或流利）英语人士的学习和记忆研究中产生学习刺激而设计的。因此，它们不适合调查包括第一语言不是英语的人群，如中国人。值得注意的是，以汉语为母语的人占全球人口的相当大比例，约占18%。本研究旨在建立一个新的翻译对等数据库，以促进汉语学习、记忆和元认知方面的研究。我们对200对斯瓦希里语-汉语配对的联想对象进行了一套全面的规范测量，包括回忆正确率、回忆延迟、错误模式、信心评级、感知学习困难、学习判断和感知学习兴趣。此外，我们还包括斯瓦希里语单词的单词相似度评分和单词长度评分，以及中文单词的具体度评分、熟悉度评分、词频和笔画数评分。从大量斯瓦希里语-汉语对词中收集的这些不同的测量方法，将有效地支持未来的研究，以调查中国人学习、记忆和元认知的复杂过程。

{"title":"A normative database of Swahili-Chinese paired associates.","authors":"Tian Fan, Wenbo Zhao, Bukuan Sun, Shaohang Liu, Yue Yin, Muzi Xu, Xiao Hu, Chunliang Yang, Liang Luo","doi":"10.3758/s13428-024-02531-z","DOIUrl":"10.3758/s13428-024-02531-z","url":null,"abstract":"Over the past few decades, Swahili-English and Lithuanian-English word pair databases have been extensively utilized in research on learning and memory. However, these normative databases are specifically designed for generating study stimuli in learning and memory research involving native (or fluent) English speakers. Consequently, they are not suitable for investigations that encompass populations whose first language is not English, such as Chinese individuals. Notably, native Chinese speakers constitute a substantial proportion, approximately 18%, of the global population. The current study aims to establish a new database of translation equivalences, specifically tailored to facilitate research on learning, memory, and metacognition among the Chinese population. We present a comprehensive set of normative measures for 200 Swahili-Chinese paired associates, including recall accuracy, recall latency, error patterns, confidence ratings, perceived learning difficulty, judgments of learning, and perceived learning interestingness for the entire word pairs. Additionally, we include word-likeness ratings and word length for the Swahili words, and concreteness ratings, familiarity ratings, word frequency, and number of strokes for the Chinese words. This diverse array of measures, gathered across a substantial number of Swahili-Chinese word pairs, is poised to effectively support future research seeking to investigate the intricate processes of learning, memory and metacognition within the Chinese population.","PeriodicalId":8717,"journal":{"name":"Behavior Research Methods","volume":"57 1","pages":"40"},"PeriodicalIF":4.6,"publicationDate":"2025-01-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142920528","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A multiverse assessment of the reliability of the self-matching task as a measurement of the self-prioritization effect. 自匹配任务的多重宇宙可靠性评估作为自优先级效应的测量。

IF 4.6 2区心理学 Q1 PSYCHOLOGY, EXPERIMENTAL

Behavior Research Methods

Pub Date : 2025-01-02 DOI: 10.3758/s13428-024-02538-6

Zheng Liu, Mengzhen Hu, Yuanrui Zheng, Jie Sui, Hu Chuan-Peng

The self-matching task (SMT) is widely used to investigate the cognitive mechanisms underlying the self-prioritization effect (SPE), wherein performance is enhanced for self-associated stimuli compared to other-associated ones. Although the SMT robustly elicits the SPE, there is a lack of data quantifying the reliability of this paradigm. This is problematic, given the prevalence of the reliability paradox in cognitive tasks: many well-established cognitive tasks demonstrate relatively low reliability when used to evaluate individual differences, despite exhibiting replicable effects at the group level. To fill this gap, this preregistered study investigated the reliability of SPE derived from the SMT using a multiverse approach, combining all possible indicators and baselines reported in the literature. We first examined the robustness of 24 SPE measures across 42 datasets (N = 2250) using a meta-analytical approach. We then calculated the split-half reliability (r) and intraclass correlation coefficient (ICC2) for each SPE measure. Our findings revealed a robust group-level SPE across datasets. However, when evaluating individual differences, SPE indices derived from reaction time (RT) and efficiency exhibited relatively higher, compared to other SPE indices, but still unsatisfied split-half reliability (approximately 0.5). The reliability across multiple time points, as assessed by ICC2, RT, and efficiency, demonstrated moderate levels of test-retest reliability (close to 0.5). These findings revealed the presence of a reliability paradox in the context of SMT-based SPE assessment. We discussed the implications of how to enhance individual-level reliability using this paradigm for future study design.

自我匹配任务（SMT）被广泛用于研究自我优先效应（SPE）的认知机制，其中自我相关刺激比其他相关刺激的表现更强。尽管SMT健壮地引出了SPE，但缺乏量化该范式可靠性的数据。考虑到认知任务中普遍存在的可靠性悖论，这是有问题的：尽管在群体水平上表现出可复制的效果，但许多公认的认知任务在用于评估个体差异时显示出相对较低的可靠性。为了填补这一空白，这项预先注册的研究使用多元宇宙方法，结合文献中报道的所有可能的指标和基线，调查了SMT得出的SPE的可靠性。我们首先使用荟萃分析方法检查了42个数据集（N = 2250）中24个SPE测量的稳健性。然后，我们计算了每个SPE测量的分半信度(r)和类内相关系数（ICC2）。我们的研究结果揭示了跨数据集的稳健的组级SPE。然而，在评估个体差异时，由反应时间（RT）和效率得出的SPE指标与其他SPE指标相比表现出相对较高的水平，但仍然不满足半分信度（约为0.5）。通过ICC2、RT和效率评估，多个时间点的信度显示中等水平的重测信度（接近0.5）。这些发现揭示了基于smt的SPE评估中存在的可靠性悖论。我们讨论了如何在未来的研究设计中使用这种范式来提高个人水平的可靠性。

{"title":"A multiverse assessment of the reliability of the self-matching task as a measurement of the self-prioritization effect.","authors":"Zheng Liu, Mengzhen Hu, Yuanrui Zheng, Jie Sui, Hu Chuan-Peng","doi":"10.3758/s13428-024-02538-6","DOIUrl":"10.3758/s13428-024-02538-6","url":null,"abstract":"The self-matching task (SMT) is widely used to investigate the cognitive mechanisms underlying the self-prioritization effect (SPE), wherein performance is enhanced for self-associated stimuli compared to other-associated ones. Although the SMT robustly elicits the SPE, there is a lack of data quantifying the reliability of this paradigm. This is problematic, given the prevalence of the reliability paradox in cognitive tasks: many well-established cognitive tasks demonstrate relatively low reliability when used to evaluate individual differences, despite exhibiting replicable effects at the group level. To fill this gap, this preregistered study investigated the reliability of SPE derived from the SMT using a multiverse approach, combining all possible indicators and baselines reported in the literature. We first examined the robustness of 24 SPE measures across 42 datasets (N = 2250) using a meta-analytical approach. We then calculated the split-half reliability (r) and intraclass correlation coefficient (ICC2) for each SPE measure. Our findings revealed a robust group-level SPE across datasets. However, when evaluating individual differences, SPE indices derived from reaction time (RT) and efficiency exhibited relatively higher, compared to other SPE indices, but still unsatisfied split-half reliability (approximately 0.5). The reliability across multiple time points, as assessed by ICC2, RT, and efficiency, demonstrated moderate levels of test-retest reliability (close to 0.5). These findings revealed the presence of a reliability paradox in the context of SMT-based SPE assessment. We discussed the implications of how to enhance individual-level reliability using this paradigm for future study design.","PeriodicalId":8717,"journal":{"name":"Behavior Research Methods","volume":"57 1","pages":"37"},"PeriodicalIF":4.6,"publicationDate":"2025-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142920524","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

How accurately can we estimate spontaneous body kinematics from video recordings? Effect of movement amplitude on OpenPose accuracy. 我们从录像中估计人体自发运动的准确度有多高？运动幅度对OpenPose精度的影响。

IF 4.6 2区心理学 Q1 PSYCHOLOGY, EXPERIMENTAL

Behavior Research Methods

Pub Date : 2025-01-02 DOI: 10.3758/s13428-024-02546-6

Atesh Koul, Giacomo Novembre

Estimating how the human body moves in space and time-body kinematics-has important applications for industry, healthcare, and several research fields. Gold-standard methodologies capturing body kinematics are expensive and impractical for naturalistic recordings as they rely on infrared-reflective wearables and bulky instrumentation. To overcome these limitations, several algorithms have been developed to extract body kinematics from plain video recordings. This comes with a drop in accuracy, which however has not been clearly quantified. To fill this knowledge gap, we analysed a dataset comprising 46 human participants exhibiting spontaneous movements of varying amplitude. Body kinematics were estimated using OpenPose (video-based) and Vicon (infrared-based) motion capture systems simultaneously. OpenPose accuracy was assessed using Vicon estimates as ground truth. We report that OpenPose accuracy is overall moderate and varies substantially across participants and body parts. This is explained by variability in movement amplitude. OpenPose estimates are weak for low-amplitude movements. Conversely, large-amplitude movements (i.e., > ~ 10 cm) yield highly accurate estimates. The relationship between accuracy and movement amplitude is not linear (but mostly exponential or power) and relatively robust to camera-body distance. Together, these results dissect the limits of video-based motion capture and provide useful guidelines for future studies.

估计人体如何在空间和时间中运动-身体运动学-在工业，医疗保健和一些研究领域具有重要的应用。捕捉人体运动学的黄金标准方法对于自然记录来说是昂贵且不切实际的，因为它们依赖于红外反射可穿戴设备和笨重的仪器。为了克服这些限制，已经开发了几种算法来从普通视频记录中提取人体运动学。这伴随着准确性的下降，然而，这还没有明确的量化。为了填补这一知识空白，我们分析了一个包含46名人类参与者的数据集，这些参与者表现出不同幅度的自发运动。同时使用OpenPose（基于视频的）和Vicon（基于红外的）运动捕捉系统估计身体运动学。使用Vicon估计作为基础真值来评估OpenPose的准确性。我们报告说，OpenPose的准确性总体上是中等的，并且在参与者和身体部位之间存在很大差异。这可以用运动幅度的变化来解释。OpenPose对低振幅运动的估计很弱。相反，大振幅运动（即> ~ 10cm）产生高度准确的估计。精度和运动幅度之间的关系不是线性的（但大多是指数或幂），并且相对健壮的相机与身体的距离。总之，这些结果剖析了基于视频的动作捕捉的局限性，并为未来的研究提供了有用的指导。

{"title":"How accurately can we estimate spontaneous body kinematics from video recordings? Effect of movement amplitude on OpenPose accuracy.","authors":"Atesh Koul, Giacomo Novembre","doi":"10.3758/s13428-024-02546-6","DOIUrl":"10.3758/s13428-024-02546-6","url":null,"abstract":"Estimating how the human body moves in space and time-body kinematics-has important applications for industry, healthcare, and several research fields. Gold-standard methodologies capturing body kinematics are expensive and impractical for naturalistic recordings as they rely on infrared-reflective wearables and bulky instrumentation. To overcome these limitations, several algorithms have been developed to extract body kinematics from plain video recordings. This comes with a drop in accuracy, which however has not been clearly quantified. To fill this knowledge gap, we analysed a dataset comprising 46 human participants exhibiting spontaneous movements of varying amplitude. Body kinematics were estimated using OpenPose (video-based) and Vicon (infrared-based) motion capture systems simultaneously. OpenPose accuracy was assessed using Vicon estimates as ground truth. We report that OpenPose accuracy is overall moderate and varies substantially across participants and body parts. This is explained by variability in movement amplitude. OpenPose estimates are weak for low-amplitude movements. Conversely, large-amplitude movements (i.e., > ~ 10 cm) yield highly accurate estimates. The relationship between accuracy and movement amplitude is not linear (but mostly exponential or power) and relatively robust to camera-body distance. Together, these results dissect the limits of video-based motion capture and provide useful guidelines for future studies.","PeriodicalId":8717,"journal":{"name":"Behavior Research Methods","volume":"57 1","pages":"38"},"PeriodicalIF":4.6,"publicationDate":"2025-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11695451/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142920529","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Measuring visual ability in linguistically diverse populations. 测量不同语言人群的视觉能力。

IF 4.6 2区心理学 Q1 PSYCHOLOGY, EXPERIMENTAL

Behavior Research Methods

Pub Date : 2024-12-30 DOI: 10.3758/s13428-024-02579-x

Madison A Hooper, Andrew Tomarken, Isabel Gauthier

Measurement of object recognition (OR) ability could predict learning and success in real-world settings, and there is hope that it may reduce bias often observed in cognitive tests. Although the measurement of visual OR is not expected to be influenced by the language of participants or the language of instructions, these assumptions remain largely untested. Here, we address the challenges of measuring OR abilities across linguistically diverse populations. In Study 1, we find that English-Spanish bilinguals, when randomly assigned to the English or Spanish version of the novel object memory test (NOMT), exhibit a highly similar overall performance. Study 2 extends this by assessing psychometric equivalence using an approach grounded in item response theory (IRT). We examined whether groups fluent in English or Spanish differed in (a) latent OR ability as assessed by a three-parameter logistic IRT model, and (2) the mapping of observed item responses on the latent OR construct, as assessed by differential item functioning (DIF) analyses. Spanish speakers performed better than English speakers, a difference we suggest is due to motivational differences between groups of vastly different size on the Prolific platform. That we found no substantial DIF between the groups tested in English or Spanish on the NOMT indicates measurement invariance. The feasibility of increasing diversity by combining groups tested in different languages remains unexplored. Adopting this approach could enable visual scientists to enhance diversity, equity, and inclusion in their research, and potentially in the broader application of their work in society.

物体识别（OR）能力的测量可以预测现实环境中的学习和成功，并且有希望减少在认知测试中经常观察到的偏见。虽然视觉OR的测量不受参与者语言或指令语言的影响，但这些假设在很大程度上仍未经检验。在这里，我们解决了在不同语言人群中测量OR能力的挑战。在研究1中，我们发现，当被随机分配到英语或西班牙语版本的新对象记忆测试（NOMT）时，英语-西班牙语双语者表现出高度相似的整体表现。研究2通过使用基于项目反应理论（IRT）的方法评估心理测量等效性来扩展这一研究。我们检验了英语流利组和西班牙语流利组在以下方面是否存在差异：(a)通过三参数logistic IRT模型评估潜在or能力，以及(2)通过差异项目功能（DIF）分析评估观察到的项目反应在潜在or结构上的映射。说西班牙语的人比说英语的人表现得更好，我们认为这种差异是由于多产平台上不同规模的群体之间的动机差异造成的。我们发现在英语和西班牙语的NOMT测试组之间没有实质性的差异表明测量不变性。通过组合不同语言测试的群体来增加多样性的可行性仍未得到探索。采用这种方法可以使视觉科学家在他们的研究中增强多样性、公平性和包容性，并有可能使他们的工作在社会上得到更广泛的应用。

{"title":"Measuring visual ability in linguistically diverse populations.","authors":"Madison A Hooper, Andrew Tomarken, Isabel Gauthier","doi":"10.3758/s13428-024-02579-x","DOIUrl":"10.3758/s13428-024-02579-x","url":null,"abstract":"Measurement of object recognition (OR) ability could predict learning and success in real-world settings, and there is hope that it may reduce bias often observed in cognitive tests. Although the measurement of visual OR is not expected to be influenced by the language of participants or the language of instructions, these assumptions remain largely untested. Here, we address the challenges of measuring OR abilities across linguistically diverse populations. In Study 1, we find that English-Spanish bilinguals, when randomly assigned to the English or Spanish version of the novel object memory test (NOMT), exhibit a highly similar overall performance. Study 2 extends this by assessing psychometric equivalence using an approach grounded in item response theory (IRT). We examined whether groups fluent in English or Spanish differed in (a) latent OR ability as assessed by a three-parameter logistic IRT model, and (2) the mapping of observed item responses on the latent OR construct, as assessed by differential item functioning (DIF) analyses. Spanish speakers performed better than English speakers, a difference we suggest is due to motivational differences between groups of vastly different size on the Prolific platform. That we found no substantial DIF between the groups tested in English or Spanish on the NOMT indicates measurement invariance. The feasibility of increasing diversity by combining groups tested in different languages remains unexplored. Adopting this approach could enable visual scientists to enhance diversity, equity, and inclusion in their research, and potentially in the broader application of their work in society.","PeriodicalId":8717,"journal":{"name":"Behavior Research Methods","volume":"57 1","pages":"36"},"PeriodicalIF":4.6,"publicationDate":"2024-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11685244/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142909184","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

How do manipulation checks interfere with the inference of causal relationships? 操纵检查如何干扰因果关系的推断？

IF 4.6 2区心理学 Q1 PSYCHOLOGY, EXPERIMENTAL

Behavior Research Methods

Pub Date : 2024-12-30 DOI: 10.3758/s13428-024-02573-3

Yuhwa Han, Wooyeol Lee

This study investigates the performance of mediation analyses, including manipulation check variables, in experimental studies where manipulated psychological attributes are independent variables. We simulated the level of manipulation intensities and measurement errors of the manipulation check variable to test the validity of the analytic practice. Our results showed that when manipulation is successful and measurement error is low, mediation analyses with the manipulation check variable revealed an unstable path coefficient and standard error. Moreover, many of the detected indirect effects were inconsistent mediation situations. However, when individual differences in psychological attributes remained within the condition (low manipulation intensity) and the manipulation check variable contained low measurement error, the indirect effect indicated the validity of the manipulation. We discuss the implications of our findings for the use of manipulation checks in experimental research.

本研究探讨了在实验研究中，被操纵心理属性为自变量的中介分析的表现，包括操纵检查变量。我们模拟了操作强度水平和操作检查变量的测量误差，以检验分析实践的有效性。结果表明，当操作成功且测量误差较低时，使用操作检验变量的中介分析显示路径系数和标准误差不稳定。此外，许多检测到的间接影响是不一致的调解情况。然而，当心理属性的个体差异保持在低操作强度的条件下，且操作检验变量的测量误差较低时，间接效应表明操作的有效性。我们讨论了我们的发现在实验研究中使用操纵检查的含义。

引用次数: 0