Assessment最新文献_第10页

Measurement Invariance of and Mean Differences on the Kessler Psychological Distress Scale (K6) for LGBT and Cisgender, Heterosexual Individuals. LGBT与顺性、异性恋者Kessler心理困扰量表（K6）的测量不变性及均数差异

IF 3.4 2区心理学 Q1 PSYCHOLOGY, CLINICAL

Assessment

Pub Date : 2025-09-25 DOI: 10.1177/10731911251376235

Anna L Gilmour, Brian A Feinstein, Mark A Whisman

The Kessler Psychological Distress Scale (K6) is used as a self-report measure of nonspecific psychological distress. Although research documents higher K6 scores among lesbian, gay, bisexual, and transgender (LGBT) individuals relative to cisgender, heterosexual individuals, measurement invariance of the K6 has not been established between these groups. We used multigroup confirmatory factor analysis to examine factorial invariance of the K6 between 1,765 LGBT and 20,632 cisgender, heterosexual individuals who completed the Well-Being and Basic Needs Survey. The K6 exhibited configural, weak/metric, and strong/scalar measurement invariance between groups, suggesting that it operates equivalently for both groups. We then examined differences in latent mean K6 scores between groups and differences in the percentage of individuals in each group who met a threshold for serious psychological distress (scores ≥ 13). The latent K6 mean and the percentage of individuals who met the threshold for serious psychological distress were both significantly higher for LGBT than for cisgender, heterosexual individuals.

Kessler心理困扰量表（K6）被用作非特异性心理困扰的自我报告测量。尽管研究表明，女同性恋、男同性恋、双性恋和变性人（LGBT）的K6得分高于异性恋和顺性人，但这些群体之间的K6测量不变性尚未得到证实。我们使用多组验证性因子分析来检验完成幸福感和基本需求调查的1765名LGBT和20632名顺性别、异性恋者之间K6的因子不变性。K6在两组间表现出构型不变性、弱/度量不变性和强/标量不变性，表明它在两组间的作用是等价的。然后，我们检查了各组之间潜在平均K6评分的差异，以及每组中达到严重心理困扰阈值（评分≥13）的个体百分比的差异。LGBT群体的潜在K6均值和达到严重心理困扰阈值的个体比例均显著高于异性恋和异性恋群体。

{"title":"Measurement Invariance of and Mean Differences on the Kessler Psychological Distress Scale (K6) for LGBT and Cisgender, Heterosexual Individuals.","authors":"Anna L Gilmour, Brian A Feinstein, Mark A Whisman","doi":"10.1177/10731911251376235","DOIUrl":"https://doi.org/10.1177/10731911251376235","url":null,"abstract":"The Kessler Psychological Distress Scale (K6) is used as a self-report measure of nonspecific psychological distress. Although research documents higher K6 scores among lesbian, gay, bisexual, and transgender (LGBT) individuals relative to cisgender, heterosexual individuals, measurement invariance of the K6 has not been established between these groups. We used multigroup confirmatory factor analysis to examine factorial invariance of the K6 between 1,765 LGBT and 20,632 cisgender, heterosexual individuals who completed the Well-Being and Basic Needs Survey. The K6 exhibited configural, weak/metric, and strong/scalar measurement invariance between groups, suggesting that it operates equivalently for both groups. We then examined differences in latent mean K6 scores between groups and differences in the percentage of individuals in each group who met a threshold for serious psychological distress (scores ≥ 13). The latent K6 mean and the percentage of individuals who met the threshold for serious psychological distress were both significantly higher for LGBT than for cisgender, heterosexual individuals.","PeriodicalId":8577,"journal":{"name":"Assessment","volume":" ","pages":"10731911251376235"},"PeriodicalIF":3.4,"publicationDate":"2025-09-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145136026","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Psychometric Validation and Preliminary Clinical Correlation of an Experiential Foraging Task. 体验性觅食任务的心理测量验证及其初步临床相关性。

IF 3.4 2区心理学 Q1 PSYCHOLOGY, CLINICAL

Assessment

Pub Date : 2025-09-25 DOI: 10.1177/10731911251376214

Aaron N McInnes, Christi R P Sullivan, Angus W MacDonald, Alik S Widge

Measuring the function of decision-making systems reliably is a key goal to assess cognitive functions that underlie psychopathology. However, few metrics are demonstrably reliable, clinically relevant, and able to capture complex overlapping cognitive domains while quantifying heterogeneity across individuals. The WebSurf task is a reverse-translational human experiential foraging paradigm that indexes naturalistic and clinically relevant decision-making. To determine its potential clinical utility, we examined the psychometric properties and clinical correlates of behavioral parameters extracted from WebSurf in an initial exploratory experiment (N = 132) and a preregistered validation experiment (N = 109). Behavior was stable over repeated administrations of the task, as were individual differences. The ability to measure decision-making consistently supports WebSurf's potential utility to predict treatment response, monitor clinical change, and define neurocognitive profiles associated with psychopathology. Moreover, specific WebSurf metrics were predicted by psychiatric symptoms in a replicable manner. Mania and externalizing symptom profiles predicted variability in reward pursuit, while externalizing profiles also predicted reward evaluation. These replicable results suggest that WebSurf and similar paradigms offer promising platforms for computational psychological methods, providing reliable, clinically relevant metrics of decision-making that may enhance psychiatric assessment and personalize treatment approaches.

可靠地测量决策系统的功能是评估精神病理学基础上的认知功能的关键目标。然而，很少有指标是可靠的、临床相关的，并且能够在量化个体异质性的同时捕获复杂的重叠认知领域。WebSurf任务是一个逆向翻译的人类经验觅食范式，索引自然和临床相关的决策。为了确定其潜在的临床应用，我们在初始探索性实验（N = 132）和预注册验证实验（N = 109）中检查了从WebSurf中提取的行为参数的心理测量特性和临床相关性。行为在重复执行任务时是稳定的，个体差异也是如此。持续测量决策的能力支持WebSurf在预测治疗反应、监测临床变化和定义与精神病理相关的神经认知特征方面的潜在效用。此外，特定的WebSurf指标可通过精神症状以可复制的方式预测。躁狂和外化症状特征预测了奖励追求的变异性，而外化特征也预测了奖励评价。这些可重复的结果表明，WebSurf和类似的范例为计算心理学方法提供了有前途的平台，提供了可靠的、临床相关的决策指标，可以增强精神病学评估和个性化治疗方法。

{"title":"Psychometric Validation and Preliminary Clinical Correlation of an Experiential Foraging Task.","authors":"Aaron N McInnes, Christi R P Sullivan, Angus W MacDonald, Alik S Widge","doi":"10.1177/10731911251376214","DOIUrl":"10.1177/10731911251376214","url":null,"abstract":"Measuring the function of decision-making systems reliably is a key goal to assess cognitive functions that underlie psychopathology. However, few metrics are demonstrably reliable, clinically relevant, and able to capture complex overlapping cognitive domains while quantifying heterogeneity across individuals. The WebSurf task is a reverse-translational human experiential foraging paradigm that indexes naturalistic and clinically relevant decision-making. To determine its potential clinical utility, we examined the psychometric properties and clinical correlates of behavioral parameters extracted from WebSurf in an initial exploratory experiment (N = 132) and a preregistered validation experiment (N = 109). Behavior was stable over repeated administrations of the task, as were individual differences. The ability to measure decision-making consistently supports WebSurf's potential utility to predict treatment response, monitor clinical change, and define neurocognitive profiles associated with psychopathology. Moreover, specific WebSurf metrics were predicted by psychiatric symptoms in a replicable manner. Mania and externalizing symptom profiles predicted variability in reward pursuit, while externalizing profiles also predicted reward evaluation. These replicable results suggest that WebSurf and similar paradigms offer promising platforms for computational psychological methods, providing reliable, clinically relevant metrics of decision-making that may enhance psychiatric assessment and personalize treatment approaches.","PeriodicalId":8577,"journal":{"name":"Assessment","volume":" ","pages":"10731911251376214"},"PeriodicalIF":3.4,"publicationDate":"2025-09-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145136014","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

The Development and Validation of a Dimensional Childhood Adversity Measure. 儿童期逆境维度量表的开发与验证。

IF 3.4 2区心理学 Q1 PSYCHOLOGY, CLINICAL

Assessment

Pub Date : 2025-09-24 DOI: 10.1177/10731911251367142

Angelina Pei-Tzu Tsai, Peter F Halpin, Lucy Lurie, Meredith Gruhn, Maya Rosen, Donald H Baucom, Michael B Sarabosing, Sneha Sai Boda, Katie A McLaughlin, Margaret A Sheridan

Research on the developmental consequences of early adversity has grown rapidly, yet measures of childhood adversity have not kept pace with evolving theoretical models. Existing measures often lack comprehensive assessment and psychometric evidence. This study addresses these gaps by developing the Deprivation and Threat-Adult Self-report (DT-AS) measure, a psychometrically sound scale assessing childhood threat and deprivation exposure, evaluated in young adults. Psychometric analysis was performed in waves on a total sample of N = 796 participants. Pilot data (n₁ = 210; n₂ = 208) were analyzed using Classical Test Theory (CTT) and Item Response Theory (IRT) to refine item selection and optimize response formats. The final sample (n₃ = 378) confirmed a correlated factor structure of threat and deprivation with excellent psychometric properties. DT-AS consists of 33 items measuring threat and 30 measuring deprivation, offering a robust tool to examine associations between childhood adversity and psychopathology outcomes in adulthood.

关于早期逆境对发展的影响的研究发展迅速，但童年逆境的测量却没有跟上不断发展的理论模型的步伐。现有的措施往往缺乏全面的评估和心理测量证据。本研究通过开发剥夺和威胁-成人自我报告（DT-AS）测量来解决这些差距，这是一种心理测量学上健全的评估儿童威胁和剥夺暴露的量表，在年轻人中进行评估。心理测量分析是分波进行的，总共有N = 796名参与者。采用经典测试理论（CTT）和项目反应理论（IRT）对试验数据（n1 = 210, n2 = 208）进行分析，完善项目选择，优化回答格式。最终样本（n3 = 378）证实了威胁和剥夺的相关因素结构具有良好的心理测量特性。DT-AS由33个测量威胁的项目和30个测量剥夺的项目组成，为检验童年逆境和成年后精神病理结果之间的关系提供了一个强有力的工具。

{"title":"The Development and Validation of a Dimensional Childhood Adversity Measure.","authors":"Angelina Pei-Tzu Tsai, Peter F Halpin, Lucy Lurie, Meredith Gruhn, Maya Rosen, Donald H Baucom, Michael B Sarabosing, Sneha Sai Boda, Katie A McLaughlin, Margaret A Sheridan","doi":"10.1177/10731911251367142","DOIUrl":"https://doi.org/10.1177/10731911251367142","url":null,"abstract":"Research on the developmental consequences of early adversity has grown rapidly, yet measures of childhood adversity have not kept pace with evolving theoretical models. Existing measures often lack comprehensive assessment and psychometric evidence. This study addresses these gaps by developing the Deprivation and Threat-Adult Self-report (DT-AS) measure, a psychometrically sound scale assessing childhood threat and deprivation exposure, evaluated in young adults. Psychometric analysis was performed in waves on a total sample of N = 796 participants. Pilot data (n1 = 210; n2 = 208) were analyzed using Classical Test Theory (CTT) and Item Response Theory (IRT) to refine item selection and optimize response formats. The final sample (n3 = 378) confirmed a correlated factor structure of threat and deprivation with excellent psychometric properties. DT-AS consists of 33 items measuring threat and 30 measuring deprivation, offering a robust tool to examine associations between childhood adversity and psychopathology outcomes in adulthood.","PeriodicalId":8577,"journal":{"name":"Assessment","volume":" ","pages":"10731911251367142"},"PeriodicalIF":3.4,"publicationDate":"2025-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145136024","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Identifying Latent Cognitive Profiles in Autism Spectrum Disorder and Attention-Deficit/Hyperactivity Disorder Using the Stanford-Binet Intelligence Scales-5th Edition. 用斯坦福-比奈智力量表第5版识别自闭症谱系障碍和注意缺陷/多动障碍的潜在认知特征。

IF 3.4 2区心理学 Q1 PSYCHOLOGY, CLINICAL

Assessment

Pub Date : 2025-09-24 DOI: 10.1177/10731911251369977

Soo Youn Kim, Eric A Youngstrom, Megan Norris, Ann Levine, Eric M Butter, Kevin G Stephenson

There is limited information concerning the presence of empirically derived, person-centered latent cognitive profiles in youth with autism spectrum disorder (ASD) or attention-deficit/hyperactivity disorder (ADHD) and whether these profiles are diagnostically useful. The aim of this study was to identify empirically driven cognitive subgroups in youth with ASD or ADHD and examine predictors of those profiles. A retrospective chart review was conducted with patients seen at a developmental assessment clinic who were identified with ASD or ADHD aged 2 to 16 years (n = 1,679, M_age = 8.4, SD_age = 3.1). A Latent Profile Analysis with Stanford-Binet-Fifth Edition composites resulted in 14 profiles, which were roughly parallel to each other across various levels of cognitive functioning. Several profiles were characterized by a relatively large discrepancy between the Nonverbal IQ and Verbal IQ. Younger age and higher IQ were significant predictors of those with scattered profiles, whereas diagnoses (i.e., ASD or ADHD), sex, and emotional-behavioral functioning were not.

关于在患有自闭症谱系障碍（ASD）或注意缺陷/多动障碍（ADHD）的青少年中存在经验衍生的、以人为中心的潜在认知特征以及这些特征是否在诊断上有用的信息有限。本研究的目的是在患有ASD或ADHD的青少年中确定经验驱动的认知亚组，并检查这些特征的预测因子。对在发育评估诊所就诊的2至16岁的ASD或ADHD患者进行回顾性图表回顾（n = 1679, Mage = 8.4, SDage = 3.1）。用斯坦福-比奈-第五版复合材料进行的潜在轮廓分析得出了14个轮廓，这些轮廓在不同的认知功能水平上大致平行。非语言智商和语言智商之间存在较大差异。年龄较小和智商较高是那些具有分散特征的人的显著预测因素，而诊断（即ASD或ADHD）、性别和情绪行为功能则不是。

{"title":"Identifying Latent Cognitive Profiles in Autism Spectrum Disorder and Attention-Deficit/Hyperactivity Disorder Using the Stanford-Binet Intelligence Scales-5th Edition.","authors":"Soo Youn Kim, Eric A Youngstrom, Megan Norris, Ann Levine, Eric M Butter, Kevin G Stephenson","doi":"10.1177/10731911251369977","DOIUrl":"https://doi.org/10.1177/10731911251369977","url":null,"abstract":"There is limited information concerning the presence of empirically derived, person-centered latent cognitive profiles in youth with autism spectrum disorder (ASD) or attention-deficit/hyperactivity disorder (ADHD) and whether these profiles are diagnostically useful. The aim of this study was to identify empirically driven cognitive subgroups in youth with ASD or ADHD and examine predictors of those profiles. A retrospective chart review was conducted with patients seen at a developmental assessment clinic who were identified with ASD or ADHD aged 2 to 16 years (n = 1,679, Mage = 8.4, SDage = 3.1). A Latent Profile Analysis with Stanford-Binet-Fifth Edition composites resulted in 14 profiles, which were roughly parallel to each other across various levels of cognitive functioning. Several profiles were characterized by a relatively large discrepancy between the Nonverbal IQ and Verbal IQ. Younger age and higher IQ were significant predictors of those with scattered profiles, whereas diagnoses (i.e., ASD or ADHD), sex, and emotional-behavioral functioning were not.","PeriodicalId":8577,"journal":{"name":"Assessment","volume":" ","pages":"10731911251369977"},"PeriodicalIF":3.4,"publicationDate":"2025-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145136053","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

The GRoNC: Guidelines for Reporting on Norm-Referenced and Criterion-Referenced Scores. GRoNC：标准参考分数和标准参考分数报告指南。

IF 3.4 2区心理学 Q1 PSYCHOLOGY, CLINICAL

Assessment

Pub Date : 2025-09-24 DOI: 10.1177/10731911251371395

Marieke E Timmerman, Annelies De Bildt, Julian Urban

Psychological test manuals vary widely in their reporting of the construction and interpretation of standardized scores. Consequently, the critical evaluation of norm quality and meaning is difficult for test users and reviewers. Because a specific standard for reporting on standardized scores is lacking, we developed Guidelines for Reporting on Norm-referenced and Criterion-referenced Scores (GRoNC), following a systematic approach for creating reporting guidelines (EQUATOR). The development took place in two stages: Stage 1, developing a preliminary version of the GRoNC based on a literature review; Stage 2, a Delphi process in two rounds, involving both theoretical experts (n = 11) and test developers (n = 14). The GRoNC includes a series of questions and associated explanations. It supports test developers in developing and reporting upon their standardized scores, and reviewers in evaluating a psychological test on its standardized scores. We provide recommendations on using the GRoNC and conclude by describing our expectations and plans to increase the impact of the GRoNC on reporting practice.

心理测试手册在报告标准化分数的结构和解释方面差异很大。因此，规范的质量和意义的关键评价是困难的测试用户和评论者。由于缺乏报告标准化分数的具体标准，我们制定了规范参考和标准参考分数报告指南（GRoNC），遵循创建报告指南的系统方法（EQUATOR）。发展分两个阶段进行：第一阶段，根据文献综述制定GRoNC的初步版本；第二阶段，两轮德尔菲过程，包括理论专家（n = 11）和测试开发人员（n = 14）。GRoNC包括一系列问题和相关解释。它支持测试开发人员开发和报告他们的标准化分数，并支持审阅者根据其标准化分数评估心理测试。我们提供了使用GRoNC的建议，并通过描述我们的期望和计划来增加GRoNC对报告实践的影响。

{"title":"The GRoNC: Guidelines for Reporting on Norm-Referenced and Criterion-Referenced Scores.","authors":"Marieke E Timmerman, Annelies De Bildt, Julian Urban","doi":"10.1177/10731911251371395","DOIUrl":"https://doi.org/10.1177/10731911251371395","url":null,"abstract":"Psychological test manuals vary widely in their reporting of the construction and interpretation of standardized scores. Consequently, the critical evaluation of norm quality and meaning is difficult for test users and reviewers. Because a specific standard for reporting on standardized scores is lacking, we developed Guidelines for Reporting on Norm-referenced and Criterion-referenced Scores (GRoNC), following a systematic approach for creating reporting guidelines (EQUATOR). The development took place in two stages: Stage 1, developing a preliminary version of the GRoNC based on a literature review; Stage 2, a Delphi process in two rounds, involving both theoretical experts (n = 11) and test developers (n = 14). The GRoNC includes a series of questions and associated explanations. It supports test developers in developing and reporting upon their standardized scores, and reviewers in evaluating a psychological test on its standardized scores. We provide recommendations on using the GRoNC and conclude by describing our expectations and plans to increase the impact of the GRoNC on reporting practice.","PeriodicalId":8577,"journal":{"name":"Assessment","volume":" ","pages":"10731911251371395"},"PeriodicalIF":3.4,"publicationDate":"2025-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145136163","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Natural Language Response Formats for Assessing Depression and Worry With Large Language Models: A Sequential Evaluation With Model Pre-Registration. 用大语言模型评估抑郁和忧虑的自然语言反应格式：模型预注册的顺序评价。

IF 3.4 2区心理学 Q1 PSYCHOLOGY, CLINICAL

Assessment

Pub Date : 2025-09-20 DOI: 10.1177/10731911251364022

Zhuojun Gu, Katarina Kjell, H Andrew Schwartz, Oscar Kjell

Large language models can transform individuals' mental health descriptions into scores that correlate with rating scales approaching theoretical upper limits. However, such analyses have combined word- and text responses with little known about their differences. We develop response formats ranging from closed-ended to open-ended: (a) select words from lists, write (b) descriptive words, (c) phrases, or (d) texts. Participants answered questions about their depression/worry using the response formats and related rating scales. Language responses were transformed into word embeddings and trained to rating scales. We compare the validity (concurrent, incremental, face, discriminant, and external validity) and reliability (prospective sample and test-retest reliability) of the response formats. Using the Sequential Evaluation with Model Pre-Registration design, machine-learning models were trained on a development dataset (N = 963), and then pre-registered before tested on a prospective sample (N = 145). The pre-registered models demonstrate strong validity and reliability, yielding high accuracy in the prospective sample (r = .60-.79). Additionally, the models demonstrated external validity to self-reported sick-leave/healthcare visits, where the text-format yielded the strongest correlations (being higher/equal to rating scales for 9 of 12 cases). The overall high validity and reliability across formats suggest the possibility of choosing formats according to clinical needs.

大型语言模型可以将个人的心理健康描述转化为与接近理论上限的评分量表相关的分数。然而，这样的分析将单词和文本反应结合在一起，对它们的差异知之甚少。我们开发了从封闭式到开放式的回答格式：(a)从列表中选择单词，写(b)描述性单词，(c)短语或(d)文本。参与者使用回答格式和相关的评分量表回答有关他们抑郁/担忧的问题。语言反应被转化为词嵌入，并被训练成评分量表。我们比较了反应格式的效度（并发效度、增量效度、面对效度、判别效度和外部效度）和信度（前瞻性样本和重测信度）。使用带有模型预注册设计的顺序评估，机器学习模型在开发数据集（N = 963）上进行训练，然后在对预期样本（N = 145）进行测试之前进行预注册。预注册模型显示出很强的有效性和可靠性，在预期样本中产生很高的准确性（r = 0.60 - 0.79）。此外，这些模型还证明了自我报告的病假/医疗访问的外部有效性，其中文本格式产生了最强的相关性（在12个案例中有9个案例的评分等级更高/等于评分等级）。各格式的整体高效度和信度提示根据临床需要选择格式的可能性。

{"title":"Natural Language Response Formats for Assessing Depression and Worry With Large Language Models: A Sequential Evaluation With Model Pre-Registration.","authors":"Zhuojun Gu, Katarina Kjell, H Andrew Schwartz, Oscar Kjell","doi":"10.1177/10731911251364022","DOIUrl":"https://doi.org/10.1177/10731911251364022","url":null,"abstract":"Large language models can transform individuals' mental health descriptions into scores that correlate with rating scales approaching theoretical upper limits. However, such analyses have combined word- and text responses with little known about their differences. We develop response formats ranging from closed-ended to open-ended: (a) select words from lists, write (b) descriptive words, (c) phrases, or (d) texts. Participants answered questions about their depression/worry using the response formats and related rating scales. Language responses were transformed into word embeddings and trained to rating scales. We compare the validity (concurrent, incremental, face, discriminant, and external validity) and reliability (prospective sample and test-retest reliability) of the response formats. Using the Sequential Evaluation with Model Pre-Registration design, machine-learning models were trained on a development dataset (N = 963), and then pre-registered before tested on a prospective sample (N = 145). The pre-registered models demonstrate strong validity and reliability, yielding high accuracy in the prospective sample (r = .60-.79). Additionally, the models demonstrated external validity to self-reported sick-leave/healthcare visits, where the text-format yielded the strongest correlations (being higher/equal to rating scales for 9 of 12 cases). The overall high validity and reliability across formats suggest the possibility of choosing formats according to clinical needs.","PeriodicalId":8577,"journal":{"name":"Assessment","volume":" ","pages":"10731911251364022"},"PeriodicalIF":3.4,"publicationDate":"2025-09-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145091186","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Personal Recovery in the General Population: Comparison of Psychometric Properties of the Brief INSPIRE-O in Those With and Without Common Mental Disorders. 普通人群的个人康复：有和没有常见精神障碍的人的简短INSPIRE-O心理测量特性的比较

IF 3.4 2区心理学 Q1 PSYCHOLOGY, CLINICAL

Assessment

Pub Date : 2025-09-16 DOI: 10.1177/10731911251367122

Xingjian Ruan, Femke Vergeer-Hagoort, Margreet Ten Have, Annemarie I Luik, Marlous Tuithof, Ernst Bohlmeijer, Peter M Ten Klooster

The 5-item Brief INSPIRE-O instrument, based on the Connectedness, Hope, Identity, Meaning in Life, and Empowerment framework, is a novel tool to assess personal recovery. Although initially developed for clinical populations, its conceptual alignment with core dimensions of psychological well-being suggests its potential applicability to a broader audience. The current study aimed to examine its validity, reliability, and measurement invariance across people with and without common mental disorders (CMDs). The scale was administered in a Dutch general population sample (n = 5,451). Confirmatory factor analyses supported a unidimensional structure with robust factor loadings and scalar invariance across individuals with and without CMDs in the past year. In addition, the Brief INPSIRE-O showed acceptable reliability (ω = .71-.78) and the expected pattern of correlations with other health indicators supported its construct validity. In conclusion, the Brief INSPIRE-O appears to be a psychometrically sound measure of positive psychological functioning that can be validly used and compared across people with and without CMDs.

基于连通性、希望、身份、生活意义和赋权框架的5项简短INSPIRE-O工具是一种评估个人康复的新工具。虽然最初是为临床人群开发的，但其概念与心理健康的核心维度一致，表明它可能适用于更广泛的受众。目前的研究旨在检验其在患有和不患有常见精神障碍（cmd）的人群中的有效性、可靠性和测量不变性。该量表在荷兰普通人群样本中进行（n = 5,451）。验证性因子分析支持一维结构，具有强大的因子负载和标量不变性，在过去一年中患有和没有CMDs的个体之间。此外，简要INPSIRE-O显示出可接受的信度（ω = 0.71 - 0.78），与其他健康指标的预期相关模式支持其结构效度。总之，Brief INSPIRE-O似乎是一种心理计量学上可靠的积极心理功能测量方法，可以在患有和不患有CMDs的人群中有效地使用和比较。

{"title":"Personal Recovery in the General Population: Comparison of Psychometric Properties of the Brief INSPIRE-O in Those With and Without Common Mental Disorders.","authors":"Xingjian Ruan, Femke Vergeer-Hagoort, Margreet Ten Have, Annemarie I Luik, Marlous Tuithof, Ernst Bohlmeijer, Peter M Ten Klooster","doi":"10.1177/10731911251367122","DOIUrl":"https://doi.org/10.1177/10731911251367122","url":null,"abstract":"The 5-item Brief INSPIRE-O instrument, based on the Connectedness, Hope, Identity, Meaning in Life, and Empowerment framework, is a novel tool to assess personal recovery. Although initially developed for clinical populations, its conceptual alignment with core dimensions of psychological well-being suggests its potential applicability to a broader audience. The current study aimed to examine its validity, reliability, and measurement invariance across people with and without common mental disorders (CMDs). The scale was administered in a Dutch general population sample (n = 5,451). Confirmatory factor analyses supported a unidimensional structure with robust factor loadings and scalar invariance across individuals with and without CMDs in the past year. In addition, the Brief INPSIRE-O showed acceptable reliability (ω = .71-.78) and the expected pattern of correlations with other health indicators supported its construct validity. In conclusion, the Brief INSPIRE-O appears to be a psychometrically sound measure of positive psychological functioning that can be validly used and compared across people with and without CMDs.","PeriodicalId":8577,"journal":{"name":"Assessment","volume":" ","pages":"10731911251367122"},"PeriodicalIF":3.4,"publicationDate":"2025-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145074244","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Indicators of Impulsivity in Routine Clinical Assessment of Adult ADHD. 成人ADHD临床常规评估中的冲动性指标。

IF 3.4 2区心理学 Q1 PSYCHOLOGY, CLINICAL

Assessment

Pub Date : 2025-09-16 DOI: 10.1177/10731911251365744

Hui Dong, Anselm B M Fuermaier, Janneke Koerts, Gerdina H M Pijnenborg, Nana Guo, Ragnar Schwierczok, Norbert Scherbaum, Bernhard W Müller

Impulsivity in adult attention-deficit/hyperactivity disorder (ADHD) represents a multidimensional construct rather than a unitary trait. This study examined a proposed three-factor model of impulsivity comprising (a) self-reported impulsive behavior (Barratt Impulsiveness Scale), (b) commission errors, and (c) reaction time measures from neuropsychological tests in 654 adults undergoing routine clinical assessment of adult ADHD. Using confirmatory factor analyses on split subsamples, we found consistent support for the proposed three-factor structure, whereas the network analysis favored a two-group conceptualization that separates performance-based from self-report-based measures. Self-reported impulsivity demonstrated the highest severity levels, followed by commission errors, with reaction times being least affected. Demographic and clinical characteristics significantly predicted self-reports and commission error measures but not reaction times. The results emphasize the importance of interpreting self-reports independently of performance-based tests. The coherence between commission errors and reaction time variables across tasks of related constructs suggests that administering multiple tasks may yield redundant information in the clinical assessment of impulsivity.

成人注意缺陷多动障碍（ADHD）的冲动表现为一个多维结构，而不是一个单一的特征。本研究检验了一个提议的三因素冲动性模型，包括(a)自我报告的冲动性行为（Barratt冲动性量表），(b)委托错误，以及(c)对654名成人进行成人ADHD常规临床评估的神经心理学测试的反应时间测量。通过对分裂子样本的验证性因素分析，我们发现了对提议的三因素结构的一致支持，而网络分析倾向于将基于绩效的措施与基于自我报告的措施分开的两组概念化。自我报告的冲动表现出最高的严重程度，其次是委员会错误，反应时间受到的影响最小。人口统计学和临床特征显著预测自我报告和佣金误差测量，但不影响反应时间。研究结果强调了独立于基于表现的测试来解释自我报告的重要性。任务错误和反应时间变量在相关构念任务之间的一致性表明，在冲动性的临床评估中，管理多个任务可能会产生冗余信息。

{"title":"Indicators of Impulsivity in Routine Clinical Assessment of Adult ADHD.","authors":"Hui Dong, Anselm B M Fuermaier, Janneke Koerts, Gerdina H M Pijnenborg, Nana Guo, Ragnar Schwierczok, Norbert Scherbaum, Bernhard W Müller","doi":"10.1177/10731911251365744","DOIUrl":"https://doi.org/10.1177/10731911251365744","url":null,"abstract":"Impulsivity in adult attention-deficit/hyperactivity disorder (ADHD) represents a multidimensional construct rather than a unitary trait. This study examined a proposed three-factor model of impulsivity comprising (a) self-reported impulsive behavior (Barratt Impulsiveness Scale), (b) commission errors, and (c) reaction time measures from neuropsychological tests in 654 adults undergoing routine clinical assessment of adult ADHD. Using confirmatory factor analyses on split subsamples, we found consistent support for the proposed three-factor structure, whereas the network analysis favored a two-group conceptualization that separates performance-based from self-report-based measures. Self-reported impulsivity demonstrated the highest severity levels, followed by commission errors, with reaction times being least affected. Demographic and clinical characteristics significantly predicted self-reports and commission error measures but not reaction times. The results emphasize the importance of interpreting self-reports independently of performance-based tests. The coherence between commission errors and reaction time variables across tasks of related constructs suggests that administering multiple tasks may yield redundant information in the clinical assessment of impulsivity.","PeriodicalId":8577,"journal":{"name":"Assessment","volume":" ","pages":"10731911251365744"},"PeriodicalIF":3.4,"publicationDate":"2025-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145074206","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

An Analysis of Measurement Invariance of the Patient Health Questionnaire-9 Between Indonesia, Germany, and the USA. 印度尼西亚、德国和美国患者健康问卷-9测量不变性分析。

IF 3.4 2区心理学 Q1 PSYCHOLOGY, CLINICAL

Assessment

Pub Date : 2025-09-10 DOI: 10.1177/10731911251361230

Eric Sucitra, Riangga Novrianto, Yolanda T Pasaribu, Tania M Lincoln, Edo S Jaya

The Patient Health Questionnaire-9 (PHQ-9) is a screening tool for assessing depressive symptomatology that has received widespread use. However, there is a scarcity of research on whether the instrument measures the same construct between high-income (HIC) and low- and middle-income countries (LMICs). Online surveys were utilized to assess samples across Indonesia, Germany, and the USA (N = 2350). Measurement Invariance (MI) was computed using multi-group confirmatory factor analyses. We found the general factor model to have a good fit and configural, metric, scalar, and residual MI across three countries. There were no significant differences in mean scores (Indonesia, M = 1.87, SD = 0.56; Germany, M = 1.90, SD = 0.65; USA, M = 1.90, SD = 0.75). These results highlight that depressive symptomatology is universal across distinct geographical regions, regardless of the population's income levels. Hence, this study further emphasizes the urgency of developing universal, accessible assessment and treatment for depression.

患者健康问卷-9 （PHQ-9）是一种评估抑郁症状的筛查工具，已被广泛使用。然而，关于该工具是否在高收入国家（HIC）和低收入和中等收入国家（LMICs）之间测量相同结构的研究很少。利用在线调查来评估印度尼西亚、德国和美国的样本（N = 2350）。采用多组验证性因子分析计算测量不变性（MI）。我们发现一般因素模型在三个国家具有良好的拟合和配置、度量、标量和残差MI。均分差异无统计学意义（印度尼西亚，M = 1.87, SD = 0.56；德国，M = 1.90, SD = 0.65；美国，M = 1.90, SD = 0.75）。这些结果强调，抑郁症状在不同的地理区域是普遍的，无论人口的收入水平如何。因此，本研究进一步强调了开发普遍、可获得的抑郁症评估和治疗的紧迫性。

{"title":"An Analysis of Measurement Invariance of the Patient Health Questionnaire-9 Between Indonesia, Germany, and the USA.","authors":"Eric Sucitra, Riangga Novrianto, Yolanda T Pasaribu, Tania M Lincoln, Edo S Jaya","doi":"10.1177/10731911251361230","DOIUrl":"https://doi.org/10.1177/10731911251361230","url":null,"abstract":"The Patient Health Questionnaire-9 (PHQ-9) is a screening tool for assessing depressive symptomatology that has received widespread use. However, there is a scarcity of research on whether the instrument measures the same construct between high-income (HIC) and low- and middle-income countries (LMICs). Online surveys were utilized to assess samples across Indonesia, Germany, and the USA (N = 2350). Measurement Invariance (MI) was computed using multi-group confirmatory factor analyses. We found the general factor model to have a good fit and configural, metric, scalar, and residual MI across three countries. There were no significant differences in mean scores (Indonesia, M = 1.87, SD = 0.56; Germany, M = 1.90, SD = 0.65; USA, M = 1.90, SD = 0.75). These results highlight that depressive symptomatology is universal across distinct geographical regions, regardless of the population's income levels. Hence, this study further emphasizes the urgency of developing universal, accessible assessment and treatment for depression.","PeriodicalId":8577,"journal":{"name":"Assessment","volume":" ","pages":"10731911251361230"},"PeriodicalIF":3.4,"publicationDate":"2025-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145028827","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

The Big Five Inventory-2 in Korea: Validation and Cross-Cultural Comparisons with the U.S. and Chinese Versions. 韩国的五大目录-2：与美国和中国版本的验证和跨文化比较。

IF 3.4 2区心理学 Q1 PSYCHOLOGY, CLINICAL

Assessment

Pub Date : 2025-09-10 DOI: 10.1177/10731911251357466

Jinsoo Choi, Nanhee Kim, Bo Zhang, Sang Woo Park, Seonghee Cho, Young Woo Sohn, Christopher J Soto, Oliver P John

The Big Five Inventory-2 (BFI-2) has enjoyed global popularity due to its good balance of content coverage and brevity and has been officially translated into 12 languages in addition to the original English version. The current study aimed to further enhance the cultural accessibility of the BFI-2 by translating it into the Korean language and comprehensively validating the Korean version in two South Korean samples: working adults and college students. Across the two samples, the Korean BFI-2 demonstrated good reliability (e.g., test-retest reliability, Cronbach's alpha), construct validity (e.g., convergent/discriminant validity), and criterion-related validity with a wide range of outcome measures. Additionally, we compared the psychometric properties of the Korean BFI-2 to those of the original English and Chinese versions to further establish its comparability with other language versions. Overall, our results demonstrate that the Korean BFI-2 is a reliable and valid personality measure that can be confidently used by other researchers. Implications, limitations, and future directions were discussed.

《五大目录-2》（Big Five Inventory-2，简称BFI-2）在内容覆盖面和简练之间取得了良好的平衡，在全球广受欢迎，除英文原版外，还被正式翻译成12种语言。本研究旨在进一步提高BFI-2的文化可及性，将其翻译成韩语，并在两个韩国样本中进行全面验证：工作成年人和大学生。在两个样本中，韩国BFI-2表现出良好的信度（例如，测试-重测信度，Cronbach's alpha），结构效度（例如，收敛/判别效度），以及与广泛结果测量相关的标准效度。此外，我们还比较了韩文BFI-2与中英文原版的心理测量特征，以进一步确定其与其他语言版本的可比性。总体而言，我们的研究结果表明，韩国人的BFI-2是一个可靠和有效的人格测量，可以被其他研究者自信地使用。讨论了影响、局限性和未来的发展方向。

{"title":"The Big Five Inventory-2 in Korea: Validation and Cross-Cultural Comparisons with the U.S. and Chinese Versions.","authors":"Jinsoo Choi, Nanhee Kim, Bo Zhang, Sang Woo Park, Seonghee Cho, Young Woo Sohn, Christopher J Soto, Oliver P John","doi":"10.1177/10731911251357466","DOIUrl":"https://doi.org/10.1177/10731911251357466","url":null,"abstract":"The Big Five Inventory-2 (BFI-2) has enjoyed global popularity due to its good balance of content coverage and brevity and has been officially translated into 12 languages in addition to the original English version. The current study aimed to further enhance the cultural accessibility of the BFI-2 by translating it into the Korean language and comprehensively validating the Korean version in two South Korean samples: working adults and college students. Across the two samples, the Korean BFI-2 demonstrated good reliability (e.g., test-retest reliability, Cronbach's alpha), construct validity (e.g., convergent/discriminant validity), and criterion-related validity with a wide range of outcome measures. Additionally, we compared the psychometric properties of the Korean BFI-2 to those of the original English and Chinese versions to further establish its comparability with other language versions. Overall, our results demonstrate that the Korean BFI-2 is a reliable and valid personality measure that can be confidently used by other researchers. Implications, limitations, and future directions were discussed.","PeriodicalId":8577,"journal":{"name":"Assessment","volume":" ","pages":"10731911251357466"},"PeriodicalIF":3.4,"publicationDate":"2025-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145028861","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0