Psychological Assessment最新文献_第4页

Latent structure and measurement invariance of the Depression Self-Rating Scale for Children across sex and age. 不同性别和年龄儿童抑郁自评量表的潜在结构和测量不变性。

IF 3.3 2区心理学 Q1 PSYCHOLOGY, CLINICAL

Psychological Assessment

Pub Date : 2024-09-01 Epub Date: 2024-07-18 DOI: 10.1037/pas0001327

Haley E Green, Lindsay N Gabel, Emma K Stewart, Yuliya Kotelnikova, Elizabeth P Hayden

Measurement tools from which valid interpretations can be made are critical for assessing early emerging depressive symptoms, as depressive symptoms in childhood are associated with increased risk for early-onset depressive disorder, recurrence, suicidality, and other psychopathology. The Depression Self-Rating Scale for Children (DSRS) is a widely used self-report scale assessing youth depressive symptoms. The relatively few studies investigating the DSRS' latent structure have yielded mixed results, and measurement invariance (MI) based on sex and age has not been examined. We examined the factor structure and MI of the DSRS across sex and age in a community sample of 6-9-year-olds (N = 352; M_age = 7.57 years, SD = .70). Consistent with the largest prior structural study of the DSRS, a two-factor structure, with factors reflecting elevated negative affect (NA) and low positive affect (PA), showed strong model fit. Although this structure was consistent across sex and age (i.e., configural invariance), loadings of DSRS items varied across sex and age (i.e., metric noninvariance). Allowing the loadings of items contributing to noninvariance to vary across groups improved model fit. Implications for the clinical and research utility of the DSRS and suggestions for future research are discussed. (PsycInfo Database Record (c) 2024 APA, all rights reserved).

由于儿童时期的抑郁症状与早发性抑郁症、复发、自杀和其他精神病理学风险的增加有关，因此能够做出有效解释的测量工具对于评估早期出现的抑郁症状至关重要。儿童抑郁自评量表（DSRS）是一种广泛使用的评估青少年抑郁症状的自我报告量表。对 DSRS 的潜在结构进行调查的研究相对较少，得出的结果也不尽相同，而基于性别和年龄的测量不变性（MI）还没有进行过研究。我们在一个 6-9 岁的社区样本（样本数 = 352；年龄 = 7.57 岁，SD = .70）中研究了 DSRS 在不同性别和年龄下的因子结构和 MI。与之前对 DSRS 进行的最大规模的结构研究一致，双因素结构显示出很强的模型拟合度，其中的因素反映了消极情绪（NA）的升高和积极情绪（PA）的降低。虽然这一结构在性别和年龄上是一致的（即构型不变性），但 DSRS 项目的载荷在性别和年龄上是不同的（即度量非方差性）。允许造成非方差的项目的负荷量在不同组别间变化，可以提高模型的拟合度。本文讨论了 DSRS 在临床和研究中的应用以及对未来研究的建议。(PsycInfo Database Record (c) 2024 APA，保留所有权利）。

{"title":"Latent structure and measurement invariance of the Depression Self-Rating Scale for Children across sex and age.","authors":"Haley E Green, Lindsay N Gabel, Emma K Stewart, Yuliya Kotelnikova, Elizabeth P Hayden","doi":"10.1037/pas0001327","DOIUrl":"10.1037/pas0001327","url":null,"abstract":"Measurement tools from which valid interpretations can be made are critical for assessing early emerging depressive symptoms, as depressive symptoms in childhood are associated with increased risk for early-onset depressive disorder, recurrence, suicidality, and other psychopathology. The Depression Self-Rating Scale for Children (DSRS) is a widely used self-report scale assessing youth depressive symptoms. The relatively few studies investigating the DSRS' latent structure have yielded mixed results, and measurement invariance (MI) based on sex and age has not been examined. We examined the factor structure and MI of the DSRS across sex and age in a community sample of 6-9-year-olds (N = 352; Mage = 7.57 years, SD = .70). Consistent with the largest prior structural study of the DSRS, a two-factor structure, with factors reflecting elevated negative affect (NA) and low positive affect (PA), showed strong model fit. Although this structure was consistent across sex and age (i.e., configural invariance), loadings of DSRS items varied across sex and age (i.e., metric noninvariance). Allowing the loadings of items contributing to noninvariance to vary across groups improved model fit. Implications for the clinical and research utility of the DSRS and suggestions for future research are discussed. (PsycInfo Database Record (c) 2024 APA, all rights reserved).","PeriodicalId":20770,"journal":{"name":"Psychological Assessment","volume":" ","pages":"552-561"},"PeriodicalIF":3.3,"publicationDate":"2024-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141634303","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Identifying analogue samples of individuals with clinically significant social anxiety: Updating and combining cutoff scores on the Social Phobia Inventory and Sheehan Disability Scale. 确定具有临床意义的社交焦虑症患者的模拟样本：更新和合并社交恐惧症量表和希恩残疾量表的临界分数。

IF 3.3 2区心理学 Q1 PSYCHOLOGY, CLINICAL

Psychological Assessment

Pub Date : 2024-09-01 Epub Date: 2024-06-20 DOI: 10.1037/pas0001328

Sophie M Kudryk, Jolie T K Ho, Joshua R C Budge, David A Moscovitch

The use of analogue samples, as opposed to clinical groups, is common in mental health research, including research on social anxiety disorder (SAD). Recent observational and statistical evidence has raised doubts about the validity of current methods for establishing analogue samples of individuals with clinically significant social anxiety. Here, we used data from large community samples of clinical and nonclinical participants to determine new cutoff scores on self-report measures of social anxiety symptoms and symptom-related impairment. We then examined whether using these newly determined cutoff scores alone or in combination improves the identification of individuals who have SAD from those who do not, revealing the most ideal cutoff combination to be 34 or above on the Social Phobia Inventory and 11 or above on the Sheehan Disability Scale. Finally, we compared the effects of our new cutoff scores with old cutoff scores by extracting analogue samples of participants with high social anxiety from historical data on seven large groups of undergraduate Psychology research participants from the authors' institution spanning the past 5 years (2018-2023). We observed that the new combined cutoff scores identified markedly fewer students as having high social anxiety, lending credibility to their utility. We also observed a striking increase in levels of social anxiety symptoms in the undergraduate population from before to after the COVID-19 pandemic. Of note, most participants were under 30 and identified as Caucasian or Asian women, indicating that future research is needed to examine whether our findings generalize to diverse populations. (PsycInfo Database Record (c) 2024 APA, all rights reserved).

在心理健康研究（包括社交焦虑症（SAD）研究）中，使用模拟样本而非临床群体是很常见的。最近的观察和统计证据使人们对目前建立具有临床意义的社交焦虑症患者模拟样本的方法的有效性产生了怀疑。在此，我们利用来自大型社区临床和非临床参与者样本的数据，确定了社交焦虑症状自我报告测量和症状相关损害的新临界值。然后，我们研究了单独使用或结合使用这些新确定的临界值是否能更好地识别出患有社交焦虑症的人和未患有社交焦虑症的人，结果发现最理想的临界值组合是社交恐惧症量表上的 34 分或以上和希恩残疾量表上的 11 分或以上。最后，我们通过从作者所在机构过去 5 年（2018-2023 年）的 7 个大型心理学本科生研究参与者群体的历史数据中提取高度社交焦虑参与者的模拟样本，比较了新的临界值与旧的临界值的效果。我们观察到，新的综合临界值能识别出高度社交焦虑的学生明显减少，这使其效用更加可信。我们还观察到，从 COVID-19 大流行之前到之后，本科生群体中的社交焦虑症状水平显著增加。值得注意的是，大多数参与者的年龄都在 30 岁以下，并被认定为白种人或亚裔女性，这表明未来的研究还需要考察我们的发现是否适用于不同的人群。(PsycInfo Database Record (c) 2024 APA, 版权所有）。

{"title":"Identifying analogue samples of individuals with clinically significant social anxiety: Updating and combining cutoff scores on the Social Phobia Inventory and Sheehan Disability Scale.","authors":"Sophie M Kudryk, Jolie T K Ho, Joshua R C Budge, David A Moscovitch","doi":"10.1037/pas0001328","DOIUrl":"10.1037/pas0001328","url":null,"abstract":"The use of analogue samples, as opposed to clinical groups, is common in mental health research, including research on social anxiety disorder (SAD). Recent observational and statistical evidence has raised doubts about the validity of current methods for establishing analogue samples of individuals with clinically significant social anxiety. Here, we used data from large community samples of clinical and nonclinical participants to determine new cutoff scores on self-report measures of social anxiety symptoms and symptom-related impairment. We then examined whether using these newly determined cutoff scores alone or in combination improves the identification of individuals who have SAD from those who do not, revealing the most ideal cutoff combination to be 34 or above on the Social Phobia Inventory and 11 or above on the Sheehan Disability Scale. Finally, we compared the effects of our new cutoff scores with old cutoff scores by extracting analogue samples of participants with high social anxiety from historical data on seven large groups of undergraduate Psychology research participants from the authors' institution spanning the past 5 years (2018-2023). We observed that the new combined cutoff scores identified markedly fewer students as having high social anxiety, lending credibility to their utility. We also observed a striking increase in levels of social anxiety symptoms in the undergraduate population from before to after the COVID-19 pandemic. Of note, most participants were under 30 and identified as Caucasian or Asian women, indicating that future research is needed to examine whether our findings generalize to diverse populations. (PsycInfo Database Record (c) 2024 APA, all rights reserved).","PeriodicalId":20770,"journal":{"name":"Psychological Assessment","volume":" ","pages":"513-525"},"PeriodicalIF":3.3,"publicationDate":"2024-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141427456","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Psychometric properties of the German versions of the Problem Areas in Diabetes Scale for Children (PAID-C) with Type 1 Diabetes and Their Parents (P-PAID-C). 德文版 1 型糖尿病患儿及其家长糖尿病问题领域量表 (PAID-C) 的心理计量特性 (P-PAID-C)。

IF 3.6 2区心理学 Q1 PSYCHOLOGY, CLINICAL

Psychological Assessment

Pub Date : 2024-09-01 DOI: 10.1037/pas0001338

Su-Jong Kim-Dorner,Heike Saßmann,Juliane R Framme,Bettina Heidtmann,Thomas M Kapellen,Olga Kordonouri,Karolin M E Nettelrodt,Nicole Pisarek,Roland Schweizer,Simone von Sengbusch,Karin Lange

Children with Type 1 diabetes (T1D) and their parent-caregivers often experience diabetes distress due to the daily demands of diabetes management. Regular screening for diabetes distress is needed to prevent the deterioration of metabolic control and the development of mental health disorders. The aim of this analysis was to examine the psychometric properties of the German versions of the Problem Areas in Diabetes Scale for Children (PAID-C) and for caregiver burden in Parents (P-PAID-C). Data were collected from 136 children aged 7-12 years (46.7% females) and 304 parents (Mage = 42.9 (SD 6.1) years; 78% mothers) by using linguistically translated questionnaires in a multicenter study. Confirmatory factor analysis and correlational analyses were conducted. Results confirmed the two-factor model for the PAID-C and the four-factor model for the P-PAID-C with a slight modification. Cronbach's αs for children and parents were 0.88 and 0.92, respectively. The PAID-C and P-PAID-C scores had small positive associations with HbA1c (rs = .220 and .139, respectively, all p < .05) and strong inverse association with the KIDSCREEN-10 index (r = -.643 and -.520, respectively, all p < .001). P-PAID-C scores increased with increasing depressive symptoms measured in nine-item Patient Health Questionnaire among parents (rs = .534, p < .001). The scores produced by the German PAID-C and P-PAID-C were reliable and valid in measuring diabetes burdens. These German versions of PAID can be utilized to assess diabetes-specific distress and to design interventions for children and their parents experiencing high levels of diabetes distress. (PsycInfo Database Record (c) 2024 APA, all rights reserved).

1 型糖尿病（T1D）患儿及其父母照顾者经常会因为糖尿病的日常管理需求而感到糖尿病困扰。需要定期筛查糖尿病困扰，以防止代谢控制恶化和精神疾病的发展。本分析旨在研究德国版儿童糖尿病问题领域量表（PAID-C）和父母照顾者负担量表（P-PAID-C）的心理测量特性。在一项多中心研究中，通过使用语言翻译问卷收集了 136 名 7-12 岁儿童（46.7% 为女性）和 304 名家长（年龄 = 42.9 (SD 6.1) 岁；78% 为母亲）的数据。研究进行了确认性因素分析和相关分析。结果证实了 PAID-C 的双因素模型和 P-PAID-C 的四因素模型（略有修改）。儿童和家长的 Cronbach's α 分别为 0.88 和 0.92。PAID-C 和 P-PAID-C 分数与 HbA1c 有微小的正相关性（rs 分别为 0.220 和 0.139，均 p < 0.05），与 KIDSCREEN-10 指数有较强的反相关性（r 分别为 -.643 和 -.520，均 p < 0.001）。P-PAID-C得分随着父母九项患者健康问卷中抑郁症状的增加而增加（rs = .534，p < .001）。德文 PAID-C 和 P-PAID-C 的得分在衡量糖尿病负担方面既可靠又有效。这些德文版 PAID 可用于评估糖尿病特有的困扰，并为面临严重糖尿病困扰的儿童及其父母设计干预措施。(PsycInfo Database Record (c) 2024 APA, 版权所有）。

{"title":"Psychometric properties of the German versions of the Problem Areas in Diabetes Scale for Children (PAID-C) with Type 1 Diabetes and Their Parents (P-PAID-C).","authors":"Su-Jong Kim-Dorner,Heike Saßmann,Juliane R Framme,Bettina Heidtmann,Thomas M Kapellen,Olga Kordonouri,Karolin M E Nettelrodt,Nicole Pisarek,Roland Schweizer,Simone von Sengbusch,Karin Lange","doi":"10.1037/pas0001338","DOIUrl":"https://doi.org/10.1037/pas0001338","url":null,"abstract":"Children with Type 1 diabetes (T1D) and their parent-caregivers often experience diabetes distress due to the daily demands of diabetes management. Regular screening for diabetes distress is needed to prevent the deterioration of metabolic control and the development of mental health disorders. The aim of this analysis was to examine the psychometric properties of the German versions of the Problem Areas in Diabetes Scale for Children (PAID-C) and for caregiver burden in Parents (P-PAID-C). Data were collected from 136 children aged 7-12 years (46.7% females) and 304 parents (Mage = 42.9 (SD 6.1) years; 78% mothers) by using linguistically translated questionnaires in a multicenter study. Confirmatory factor analysis and correlational analyses were conducted. Results confirmed the two-factor model for the PAID-C and the four-factor model for the P-PAID-C with a slight modification. Cronbach's αs for children and parents were 0.88 and 0.92, respectively. The PAID-C and P-PAID-C scores had small positive associations with HbA1c (rs = .220 and .139, respectively, all p < .05) and strong inverse association with the KIDSCREEN-10 index (r = -.643 and -.520, respectively, all p < .001). P-PAID-C scores increased with increasing depressive symptoms measured in nine-item Patient Health Questionnaire among parents (rs = .534, p < .001). The scores produced by the German PAID-C and P-PAID-C were reliable and valid in measuring diabetes burdens. These German versions of PAID can be utilized to assess diabetes-specific distress and to design interventions for children and their parents experiencing high levels of diabetes distress. (PsycInfo Database Record (c) 2024 APA, all rights reserved).","PeriodicalId":20770,"journal":{"name":"Psychological Assessment","volume":"19 1","pages":"e38-e50"},"PeriodicalIF":3.6,"publicationDate":"2024-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142165993","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

The Inventory of Callous-Unemotional Traits (ICU) self-report version: Factor structure, measurement invariance, and predictive validity in justice-involved male adolescents. 冷酷无情-情绪特质量表（ICU）自我报告版：涉及司法问题的男性青少年的因子结构、测量不变性和预测有效性。

IF 3.3 2区心理学 Q1 PSYCHOLOGY, CLINICAL

Psychological Assessment

Pub Date : 2024-09-01 Epub Date: 2024-06-20 DOI: 10.1037/pas0001322

Emily C Kemp, James V Ray, Paul J Frick, Laura C Thornton, Tina D Wall Myers, Emily L Robertson, Laurence Steinberg, Elizabeth Cauffman

The Inventory of Callous-Unemotional Traits (ICU) is a widely used measure of callous-unemotional (CU) traits that may aid in the assessment of the diagnostic specifier "with limited prosocial emotions," which has been added to diagnostic criteria for conduct disorder. Though there is substantial support for use of the ICU total score, the scale's factor structure has been highly debated. Inconsistencies in past factor analyses may be largely attributed to failure to control for method variance due to item wording (i.e., half of the items being worded in the callous direction and half worded in the prosocial direction). Thus, the present study used a multitrait-multimethod confirmatory factor analytic approach that models both trait and method variance to test the factor structure of the ICU self-report in a clinically relevant, high-risk sample of justice-involved male adolescents (N = 1,216). When comparing the fit of empirical and theoretical models, goodness of fit indices (χ² = 1105.877, df = 190, root-mean-square error of approximation = .063, comparative fit index = .916, Tucker-Lewis index = .878, standardized root-mean-square residual = .051) provided support for a hierarchical four-factor model (i.e., one overarching callous-unemotional factor, four latent trait factors) when accounting for method variance (i.e., covarying positively worded items). This factor structure is consistent with the way the ICU was constructed and with criteria for the limited prosocial emotions specifier. In addition, measurement invariance of this factor structure across age, race, and ethnicity was supported, and the predictive validity of the ICU was supported across these demographic groups in predicting self-reported antisocial behavior and rearrests over a 5-year period following an adolescent's first arrest. (PsycInfo Database Record (c) 2024 APA, all rights reserved).

冷漠无情特质量表（ICU）是一种广泛使用的冷漠无情（CU）特质测量方法，可帮助评估 "亲社会情感有限 "这一诊断指标，该指标已被添加到行为障碍的诊断标准中。虽然 ICU 总分的使用得到了大量支持，但该量表的因子结构一直备受争议。以往因子分析的不一致性在很大程度上可归因于未能控制因项目措辞（即一半项目的措辞偏向冷酷无情，一半项目的措辞偏向亲社会）而导致的方法变异。因此，本研究采用了一种多特质、多方法的确认性因子分析方法，对特质和方法方差进行建模，以测试ICU自我报告的因子结构，研究对象为临床相关的高风险涉法男性青少年样本（N = 1,216）。在比较经验模型和理论模型的拟合程度时，拟合优度指数（χ² = 1105.877，df = 190，均方根近似误差 = .063，比较拟合指数 = .916，塔克-刘易斯指数 = .878，标准化均方根残差 = .051）支持分层四因素模型（即：一个总体的 "胼胝-非胼胝 "因素，一个总体的 "胼胝-非胼胝 "因素，一个总体的 "胼胝-非胼胝 "因素，一个总体的 "胼胝-非胼胝 "因素）、在考虑方法变异（即正向措辞项目的共变）时，支持分层四因子模型（即一个总体的 "冷酷无情"-"非情感 "因子，四个潜在特质因子）。这种因子结构与 ICU 的构建方式和有限亲社会情绪指标的标准是一致的。此外，该因子结构在不同年龄、种族和民族之间的测量不变性也得到了支持，而且 ICU 在预测青少年首次被捕后 5 年内自我报告的反社会行为和再次被捕方面的预测有效性也得到了这些人口统计群体的支持。(PsycInfo Database Record (c) 2024 APA，保留所有权利）。

{"title":"The Inventory of Callous-Unemotional Traits (ICU) self-report version: Factor structure, measurement invariance, and predictive validity in justice-involved male adolescents.","authors":"Emily C Kemp, James V Ray, Paul J Frick, Laura C Thornton, Tina D Wall Myers, Emily L Robertson, Laurence Steinberg, Elizabeth Cauffman","doi":"10.1037/pas0001322","DOIUrl":"10.1037/pas0001322","url":null,"abstract":"The Inventory of Callous-Unemotional Traits (ICU) is a widely used measure of callous-unemotional (CU) traits that may aid in the assessment of the diagnostic specifier \"with limited prosocial emotions,\" which has been added to diagnostic criteria for conduct disorder. Though there is substantial support for use of the ICU total score, the scale's factor structure has been highly debated. Inconsistencies in past factor analyses may be largely attributed to failure to control for method variance due to item wording (i.e., half of the items being worded in the callous direction and half worded in the prosocial direction). Thus, the present study used a multitrait-multimethod confirmatory factor analytic approach that models both trait and method variance to test the factor structure of the ICU self-report in a clinically relevant, high-risk sample of justice-involved male adolescents (N = 1,216). When comparing the fit of empirical and theoretical models, goodness of fit indices (χ² = 1105.877, df = 190, root-mean-square error of approximation = .063, comparative fit index = .916, Tucker-Lewis index = .878, standardized root-mean-square residual = .051) provided support for a hierarchical four-factor model (i.e., one overarching callous-unemotional factor, four latent trait factors) when accounting for method variance (i.e., covarying positively worded items). This factor structure is consistent with the way the ICU was constructed and with criteria for the limited prosocial emotions specifier. In addition, measurement invariance of this factor structure across age, race, and ethnicity was supported, and the predictive validity of the ICU was supported across these demographic groups in predicting self-reported antisocial behavior and rearrests over a 5-year period following an adolescent's first arrest. (PsycInfo Database Record (c) 2024 APA, all rights reserved).","PeriodicalId":20770,"journal":{"name":"Psychological Assessment","volume":" ","pages":"562-571"},"PeriodicalIF":3.3,"publicationDate":"2024-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141427460","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Measurement invariance of the Child Behavior Checklist (CBCL) across race/ethnicity and sex in the Adolescent Brain and Cognitive Development (ABCD) study. 在青少年大脑和认知发展（ABCD）研究中，儿童行为检查表（CBCL）在不同种族/民族和性别间的测量不变性。

IF 3.3 2区心理学 Q1 PSYCHOLOGY, CLINICAL

Psychological Assessment

Pub Date : 2024-08-01 Epub Date: 2024-05-23 DOI: 10.1037/pas0001319

Lindsey C Stewart, Shayan Asadi, Craig Rodriguez-Seijas, Sylia Wilson, Giorgia Michelini, Roman Kotov, David C Cicero, Thomas M Olino

There are numerous studies examining differences in the experience of disorders and symptoms of psychopathology in adolescents across racial or ethnic groups and sex. Though there is substantial research exploring potential factors that may influence these differences, few studies have considered the potential contribution of measurement properties to these differences. Therefore, this study examined whether there are differences across racial or ethnic groups and sex in the measurement of psychopathology, assessed in mother-reported behavior of 9-11 year old youth from the Adolescent Brain Cognitive Development study sample using updated Child Behavior Checklist scales (CBCL; Achenbach & Rescorla, 2001). Tests of measurement invariance of the CBCL utilized the higher order factor structure identified by Michelini et al. (2019) using this same Adolescent Brain Cognitive Development cohort. The dimensions include internalizing, somatoform, detachment, externalizing, and neurodevelopmental problems. The configural model had a good-to-excellent fit on all subscales of the CBCL across racial or ethnic groups and sex. The metric and scalar models fit just as well as the configural models, indicating that the scales are measuring the same constructs across racial or ethnic groups and sex and are not influenced by measurement properties of items on the CBCL, although some high-severity response options were not endorsed for youth in all racial or ethnic groups. These findings support the use of the CBCL in research examining psychopathology in racially or ethnically diverse samples of youth. (PsycInfo Database Record (c) 2024 APA, all rights reserved).

有许多研究探讨了不同种族或民族以及不同性别的青少年在经历障碍和精神病症状方面的差异。尽管有大量研究探讨了可能影响这些差异的潜在因素，但很少有研究考虑到测量特性对这些差异的潜在影响。因此，本研究使用更新的儿童行为检查表量表（CBCL；Achenbach & Rescorla，2001 年）对青少年大脑认知发展研究样本中 9-11 岁青少年的母亲报告行为进行评估，研究不同种族或民族群体和性别的青少年在心理病理学测量方面是否存在差异。对 CBCL 测量不变性的测试采用了 Michelini 等人（2019 年）使用同一青少年大脑认知发展研究样本确定的高阶因子结构。这些维度包括内化问题、躯体形式问题、分离问题、外化问题和神经发育问题。在不同种族或族裔群体和性别的 CBCL 所有分量表上，构型模型的拟合度都达到了良好到优秀。度量模型和标度模型的拟合效果与配置模型一样好，这表明这些量表测量的是不同种族或族裔群体和性别的相同构念，并且不受 CBCL 中项目测量属性的影响，尽管有些高严重性的回答选项并不为所有种族或族裔群体的青少年所认可。这些研究结果支持在对不同种族或民族的青少年样本进行心理病理学研究时使用 CBCL。(PsycInfo Database Record (c) 2024 APA，保留所有权利）。

{"title":"Measurement invariance of the Child Behavior Checklist (CBCL) across race/ethnicity and sex in the Adolescent Brain and Cognitive Development (ABCD) study.","authors":"Lindsey C Stewart, Shayan Asadi, Craig Rodriguez-Seijas, Sylia Wilson, Giorgia Michelini, Roman Kotov, David C Cicero, Thomas M Olino","doi":"10.1037/pas0001319","DOIUrl":"10.1037/pas0001319","url":null,"abstract":"There are numerous studies examining differences in the experience of disorders and symptoms of psychopathology in adolescents across racial or ethnic groups and sex. Though there is substantial research exploring potential factors that may influence these differences, few studies have considered the potential contribution of measurement properties to these differences. Therefore, this study examined whether there are differences across racial or ethnic groups and sex in the measurement of psychopathology, assessed in mother-reported behavior of 9-11 year old youth from the Adolescent Brain Cognitive Development study sample using updated Child Behavior Checklist scales (CBCL; Achenbach & Rescorla, 2001). Tests of measurement invariance of the CBCL utilized the higher order factor structure identified by Michelini et al. (2019) using this same Adolescent Brain Cognitive Development cohort. The dimensions include internalizing, somatoform, detachment, externalizing, and neurodevelopmental problems. The configural model had a good-to-excellent fit on all subscales of the CBCL across racial or ethnic groups and sex. The metric and scalar models fit just as well as the configural models, indicating that the scales are measuring the same constructs across racial or ethnic groups and sex and are not influenced by measurement properties of items on the CBCL, although some high-severity response options were not endorsed for youth in all racial or ethnic groups. These findings support the use of the CBCL in research examining psychopathology in racially or ethnically diverse samples of youth. (PsycInfo Database Record (c) 2024 APA, all rights reserved).","PeriodicalId":20770,"journal":{"name":"Psychological Assessment","volume":" ","pages":"441-451"},"PeriodicalIF":3.3,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11801408/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141082122","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

The Clinical Assessment of Prosocial Emotions (CAPE): Initial tests of reliability and validity in a clinic-referred sample of children and adolescents. 亲社会情绪临床评估（CAPE）：在诊所转介的儿童和青少年样本中进行的可靠性和有效性初步测试。

IF 3.3 2区心理学 Q1 PSYCHOLOGY, CLINICAL

Psychological Assessment

Pub Date : 2024-08-01 Epub Date: 2024-05-06 DOI: 10.1037/pas0001320

Courtney M Goetz, Taylor A Miller, Paul J Frick

Recent changes to diagnostic criteria for serious conduct problems in children and adolescents have included the presence of elevated callous-unemotional traits to define etiologically and clinically important subgroups of youth with a conduct problem diagnosis. The Clinical Assessment of Prosocial Emotions (CAPE) is an intensive assessment of the symptoms of this limited prosocial emotions specifier that uses a structured professional judgment method of scoring, which may make it useful in clinical settings when diagnoses may require more information than that provided by behavior rating scales. The present study adds to the limited tests of the CAPE's reliability and validity, using a sample of clinic-referred children ages 6-17 years of age, who were all administered the CAPE by trained clinicians. The mean age of the sample was 10.13 years (SD = 2.64); 54% of the sample identified as male and 46% identified as female; and 67% of participants identified as White, 29% identified as Black, and 52% identified as another race/ethnicity (i.e., Asian, Hispanic/Latinx, or other). The findings indicated that CAPE scores demonstrated strong interrater reliability. The scores also were associated with measures of conduct problems and aggression, even when controlling for behavior ratings of callous-unemotional traits. Further, when children with conduct problem diagnoses were divided into groups based on the presence of the limited prosocial emotions specifier from the CAPE, the subgroup with the specifier showed more severe conduct problems and aggression. The results support cautious clinical use of the CAPE, its further development and testing, and research into ways to make its use feasible in many clinical settings. (PsycInfo Database Record (c) 2024 APA, all rights reserved).

最近，儿童和青少年严重品行问题诊断标准发生了一些变化，其中包括 "冷漠无情 "特征的出现，从而从病因学和临床角度界定了品行问题青少年的重要亚群。亲社会情感临床评估（CAPE）是对这种有限的亲社会情感特质的症状进行的强化评估，它采用了结构化的专业判断评分法，这可能会使它在临床环境中发挥作用，因为诊断可能需要比行为评分量表提供更多的信息。本研究对 CAPE 可靠性和有效性的有限测试进行了补充，使用的样本是诊所转介的 6-17 岁儿童，他们均由受过培训的临床医生进行 CAPE 测试。样本的平均年龄为 10.13 岁（SD = 2.64）；54% 的样本为男性，46% 为女性；67% 的参与者为白人，29% 为黑人，52% 为其他种族/族裔（即亚裔、西班牙裔/拉丁裔或其他）。研究结果表明，CAPE 分数具有很高的互测可靠性。即使控制了对冷酷无情-非情感特质的行为评分，该评分也与行为问题和攻击行为的测量相关。此外，如果根据 CAPE 中是否存在有限亲社会情绪的特定指标将诊断出有行为问题的儿童分为几组，那么存在该特定指标的子组显示出更严重的行为问题和攻击性。这些结果支持谨慎地在临床上使用 CAPE，进一步开发和测试 CAPE，并研究如何在许多临床环境中使用 CAPE。(PsycInfo Database Record (c) 2024 APA，保留所有权利）。

{"title":"The Clinical Assessment of Prosocial Emotions (CAPE): Initial tests of reliability and validity in a clinic-referred sample of children and adolescents.","authors":"Courtney M Goetz, Taylor A Miller, Paul J Frick","doi":"10.1037/pas0001320","DOIUrl":"10.1037/pas0001320","url":null,"abstract":"Recent changes to diagnostic criteria for serious conduct problems in children and adolescents have included the presence of elevated callous-unemotional traits to define etiologically and clinically important subgroups of youth with a conduct problem diagnosis. The Clinical Assessment of Prosocial Emotions (CAPE) is an intensive assessment of the symptoms of this limited prosocial emotions specifier that uses a structured professional judgment method of scoring, which may make it useful in clinical settings when diagnoses may require more information than that provided by behavior rating scales. The present study adds to the limited tests of the CAPE's reliability and validity, using a sample of clinic-referred children ages 6-17 years of age, who were all administered the CAPE by trained clinicians. The mean age of the sample was 10.13 years (SD = 2.64); 54% of the sample identified as male and 46% identified as female; and 67% of participants identified as White, 29% identified as Black, and 52% identified as another race/ethnicity (i.e., Asian, Hispanic/Latinx, or other). The findings indicated that CAPE scores demonstrated strong interrater reliability. The scores also were associated with measures of conduct problems and aggression, even when controlling for behavior ratings of callous-unemotional traits. Further, when children with conduct problem diagnoses were divided into groups based on the presence of the limited prosocial emotions specifier from the CAPE, the subgroup with the specifier showed more severe conduct problems and aggression. The results support cautious clinical use of the CAPE, its further development and testing, and research into ways to make its use feasible in many clinical settings. (PsycInfo Database Record (c) 2024 APA, all rights reserved).","PeriodicalId":20770,"journal":{"name":"Psychological Assessment","volume":" ","pages":"452-461"},"PeriodicalIF":3.3,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140851157","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Longitudinal invariance of the Patient Health Questionnaire-9 among patients receiving pharmacotherapy for major depressive disorder: A secondary analysis of clinical trial data. 在接受药物治疗的重度抑郁障碍患者中，患者健康问卷-9 的纵向不变性：临床试验数据的二次分析。

IF 3.3 2区心理学 Q1 PSYCHOLOGY, CLINICAL

Psychological Assessment

Pub Date : 2024-08-01 Epub Date: 2024-05-16 DOI: 10.1037/pas0001317

Daniel J Reis, Adam R Kinney, Jeri E Forster, Kelly A Stearns-Yoder, Julie A Kittel, Amanda E Wood, David W Oslin, Lisa A Brenner, Joseph A Simonetti

Comparing self-reported symptom scores across time requires longitudinal measurement invariance (LMI), a psychometric property that means the measure is functioning identically across all time points. Despite its prominence as a measure of depression symptom severity in both research and health care, LMI has yet to be firmly established for the Patient Health Questionnaire-9 depression module (PHQ-9), particularly over the course of antidepressant pharmacotherapy. Accordingly, the objective of this study was to assess for LMI of the PHQ-9 during pharmacotherapy for major depressive disorder. This was a secondary analysis of data collected during a randomized controlled trial. A total of 1,944 veterans began antidepressant monotherapy and completed the PHQ-9 six times over 24 weeks of treatment. LMI was assessed using a series of four confirmatory factor analysis models that included all six time points, with estimated parameters increasingly constrained across models to test for different aspects of invariance. Root-mean-square error of approximation of the chi-square difference test values below 0.06 indicated the presence of LMI. Exploratory LMI analyses were also performed for separate sex, age, and race subgroups. Root-mean-square error of approximation of the chi-square difference test showed minimal change in model fits during invariance testing (≤ 0.06 for all steps), supporting full LMI for the PHQ-9. LMI was also supported for all tested veteran subgroups. As such, PHQ-9 sum scores can be compared across extended pharmacotherapy treatment durations. (PsycInfo Database Record (c) 2024 APA, all rights reserved).

比较不同时间段的自我报告症状得分需要纵向测量不变性（LMI），这是一种心理测量特性，意味着测量结果在所有时间点上的功能都是相同的。尽管患者健康问卷-9 抑郁症模块（PHQ-9）作为抑郁症状严重程度的测量指标在研究和医疗保健领域都占有重要地位，但其纵向测量不变性仍有待进一步证实，尤其是在抗抑郁药物治疗过程中。因此，本研究旨在评估重度抑郁障碍药物治疗过程中 PHQ-9 的 LMI。这是对随机对照试验期间收集的数据进行的二次分析。共有 1,944 名退伍军人开始接受抗抑郁药单一疗法，并在 24 周的治疗期间完成了六次 PHQ-9。LMI 采用一系列包含所有六个时间点的四个确证因素分析模型进行评估，各模型的估计参数限制越来越多，以测试不同方面的不变性。齐次方差检验的均方根近似误差值低于 0.06 表示存在 LMI。此外，还针对不同的性别、年龄和种族分组进行了探索性 LMI 分析。卡方差检验的均方根近似误差显示，在不变量测试期间，模型拟合的变化极小（所有步骤均小于 0.06），支持 PHQ-9 的完全 LMI。所有接受测试的退伍军人亚群也都支持 LMI。因此，PHQ-9总分可在延长的药物治疗持续时间内进行比较。(PsycInfo Database Record (c) 2024 APA，版权所有）。

{"title":"Longitudinal invariance of the Patient Health Questionnaire-9 among patients receiving pharmacotherapy for major depressive disorder: A secondary analysis of clinical trial data.","authors":"Daniel J Reis, Adam R Kinney, Jeri E Forster, Kelly A Stearns-Yoder, Julie A Kittel, Amanda E Wood, David W Oslin, Lisa A Brenner, Joseph A Simonetti","doi":"10.1037/pas0001317","DOIUrl":"10.1037/pas0001317","url":null,"abstract":"Comparing self-reported symptom scores across time requires longitudinal measurement invariance (LMI), a psychometric property that means the measure is functioning identically across all time points. Despite its prominence as a measure of depression symptom severity in both research and health care, LMI has yet to be firmly established for the Patient Health Questionnaire-9 depression module (PHQ-9), particularly over the course of antidepressant pharmacotherapy. Accordingly, the objective of this study was to assess for LMI of the PHQ-9 during pharmacotherapy for major depressive disorder. This was a secondary analysis of data collected during a randomized controlled trial. A total of 1,944 veterans began antidepressant monotherapy and completed the PHQ-9 six times over 24 weeks of treatment. LMI was assessed using a series of four confirmatory factor analysis models that included all six time points, with estimated parameters increasingly constrained across models to test for different aspects of invariance. Root-mean-square error of approximation of the chi-square difference test values below 0.06 indicated the presence of LMI. Exploratory LMI analyses were also performed for separate sex, age, and race subgroups. Root-mean-square error of approximation of the chi-square difference test showed minimal change in model fits during invariance testing (≤ 0.06 for all steps), supporting full LMI for the PHQ-9. LMI was also supported for all tested veteran subgroups. As such, PHQ-9 sum scores can be compared across extended pharmacotherapy treatment durations. (PsycInfo Database Record (c) 2024 APA, all rights reserved).","PeriodicalId":20770,"journal":{"name":"Psychological Assessment","volume":" ","pages":"462-471"},"PeriodicalIF":3.3,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140945720","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Locating triarchic model constructs in the hierarchical structure of a comprehensive trait-based psychopathy measure: Implications for research and clinical assessment. 在基于特质的综合心理变态测量的层次结构中定位三元模型建构：对研究和临床评估的启示。

IF 3.3 2区心理学 Q1 PSYCHOLOGY, CLINICAL

Psychological Assessment

Pub Date : 2024-08-01 Epub Date: 2024-06-20 DOI: 10.1037/pas0001321

Keanan J Joyner, Keenan Roberts, Ashley L Watts, Kelsey L Lowman, Robert D Latzman, Scott O Lilienfeld, Christopher J Patrick

The triarchic model posits that distinct trait constructs of boldness, meanness, and disinhibition underlie psychopathy. The triarchic model traits are conceptualized as biobehavioral dimensions that can be assessed using different sets of indicators from alternative measurement modalities; as such, the triarchic model would hypothesize that these traits are not confined to any one item set. The present study tested whether the triarchic model dimensions would emerge from a hierarchical-structural analysis of the facet scales of the Elemental Psychopathy Assessment (EPA), an inventory designed to comprehensively index psychopathy according to the five-factor personality model. Study participants (Ns = 811, 170) completed the EPA and three different scale sets assessing the triarchic traits along with criterion measures of antisocial/externalizing behaviors. Bass-ackwards modeling of the EPA facet scales revealed a four-level structure, with factors at the third level appearing similar to the triarchic trait dimensions. An analysis in which scores for the Level-3 EPA factors were regressed onto corresponding latent-trait dimensions defined using the different triarchic scale sets revealed extremely high convergence (βs = .84-.91). The Level-3 EPA factors also evidenced validity in relation to relevant criteria, approximating and sometimes exceeding that evident for the Level-4 EPA factors. Together, these results indicate that the triarchic trait constructs are embedded in a psychopathy inventory designed to align with a general personality model and effectively predict pertinent external criteria. (PsycInfo Database Record (c) 2024 APA, all rights reserved).

三元模型假定，大胆、卑鄙和抑制这三个不同的特质结构是心理变态的基础。三层次模型的特质被概念化为生物行为维度，可以使用其他测量方式的不同指标集进行评估；因此，三层次模型假设这些特质并不局限于任何一个项目集。本研究通过对元素心理变态评估（EPA）的面量表进行分层结构分析，测试了三元模型的维度是否会出现，EPA是一份根据五因素人格模型设计的心理变态综合指数量表。研究参与者（Ns = 811，170）完成了 EPA 和三个不同的量表集，这些量表集用于评估三元特质以及反社会/外化行为的标准测量。对 EPA 各方面量表的反向建模显示了一个四级结构，第三级的因子与三元特质维度相似。将第三级 EPA 因子的得分与使用不同的三层次量表集定义的相应潜在特质维度进行回归分析，结果显示出极高的趋同性（βs = .84-.91）。三级 EPA 因子在相关标准方面也显示出有效性，接近甚至有时超过四级 EPA 因子的有效性。总之，这些结果表明，三层次特质建构包含在心理变态问卷中，旨在与一般人格模型保持一致，并有效预测相关的外部标准。(PsycInfo Database Record (c) 2024 APA, 版权所有）。

{"title":"Locating triarchic model constructs in the hierarchical structure of a comprehensive trait-based psychopathy measure: Implications for research and clinical assessment.","authors":"Keanan J Joyner, Keenan Roberts, Ashley L Watts, Kelsey L Lowman, Robert D Latzman, Scott O Lilienfeld, Christopher J Patrick","doi":"10.1037/pas0001321","DOIUrl":"10.1037/pas0001321","url":null,"abstract":"The triarchic model posits that distinct trait constructs of boldness, meanness, and disinhibition underlie psychopathy. The triarchic model traits are conceptualized as biobehavioral dimensions that can be assessed using different sets of indicators from alternative measurement modalities; as such, the triarchic model would hypothesize that these traits are not confined to any one item set. The present study tested whether the triarchic model dimensions would emerge from a hierarchical-structural analysis of the facet scales of the Elemental Psychopathy Assessment (EPA), an inventory designed to comprehensively index psychopathy according to the five-factor personality model. Study participants (Ns = 811, 170) completed the EPA and three different scale sets assessing the triarchic traits along with criterion measures of antisocial/externalizing behaviors. Bass-ackwards modeling of the EPA facet scales revealed a four-level structure, with factors at the third level appearing similar to the triarchic trait dimensions. An analysis in which scores for the Level-3 EPA factors were regressed onto corresponding latent-trait dimensions defined using the different triarchic scale sets revealed extremely high convergence (βs = .84-.91). The Level-3 EPA factors also evidenced validity in relation to relevant criteria, approximating and sometimes exceeding that evident for the Level-4 EPA factors. Together, these results indicate that the triarchic trait constructs are embedded in a psychopathy inventory designed to align with a general personality model and effectively predict pertinent external criteria. (PsycInfo Database Record (c) 2024 APA, all rights reserved).","PeriodicalId":20770,"journal":{"name":"Psychological Assessment","volume":" ","pages":"472-487"},"PeriodicalIF":3.3,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11879147/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141427457","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Beyond frequency: Evaluating the validity of assessing the context, duration, ability, and botherment of depression and anxiety symptoms in South Brazil. 超越频率：评估南巴西抑郁和焦虑症状的背景、持续时间、能力和困扰的有效性。

IF 3.3 2区心理学 Q1 PSYCHOLOGY, CLINICAL

Psychological Assessment

Pub Date : 2024-08-01 Epub Date: 2024-06-27 DOI: 10.1037/pas0001323

Reza de Souza Brümmer, Karolin Rose Krause, Giovanni Abrahão Salum, Marcelo Pio de Almeida Fleck, Ighor Miron Porto, João Villanova do Amaral, João Pedro Gonçalves Pacheco, Bettina Moltrecht, Eoin McElroy, Mauricio Scopel Hoffmann

Assessment tools for depression and anxiety usually inquire about the frequency of symptoms. However, evidence suggests that different question framings might trigger different responses. Our aim is to test if asking about symptom's context, ability, duration, and botherment adds validity to Patient Health Questionnaire-9, General Anxiety Disorder-7, and Patient-Related Outcome Measurement Information Systems depression and anxiety. Participants came from two cross-sectional convenience-sampled surveys (N = 1,871) of adults (66% females, aged 33.4 ± 13.2), weighted to approximate with the state-level population. We examined measurement invariance across the different question frames, estimated whether framing affected mean scores, and tested their independent validity using covariate-adjusted and sample-weighted structural equation models. Validity was tested using tools assessing general disability, alcohol use, loneliness, well-being, grit, and frequency-based questions from depression and anxiety questionnaires. A bifactor model was applied to test the internal consistency of the question frames under the presence of a general factor (i.e., depression or anxiety). Measurement invariance was supported across the different frames. Framing questions as ability (i.e., "How easily …") produced a higher score, compared with framing by context (i.e., "In which daily situations …"). Construct and criterion validity analysis demonstrate that variance explained using multiple question frames was similar to using only one. We detected a strong overarching factor for each instrument, with little variances left to be explained by the question frame. Therefore, it is unlikely that using different adverbial phrasings can help clinicians and researchers to improve their ability to detect depression or anxiety. (PsycInfo Database Record (c) 2024 APA, all rights reserved).

抑郁和焦虑的评估工具通常会询问症状出现的频率。然而，有证据表明，不同的问题框架可能会引发不同的反应。我们的目的是测试询问症状的背景、能力、持续时间和困扰是否会增加患者健康问卷-9、一般焦虑症-7 和患者相关结果测量信息系统抑郁症和焦虑症的有效性。参与者来自两次横截面方便抽样调查（N = 1,871），调查对象均为成年人（66% 为女性，年龄为 33.4 ± 13.2），加权后与州一级人口相近。我们检验了不同问题框架的测量不变性，估计了框架是否会影响平均得分，并使用协变量调整和样本加权结构方程模型检验了它们的独立有效性。使用评估一般残疾、饮酒、孤独感、幸福感、勇气的工具以及抑郁和焦虑问卷中基于频率的问题对有效性进行了测试。双因素模型用于测试问题框架在一般因素（即抑郁或焦虑）存在的情况下的内部一致性。结果表明，不同的问题框架都具有测量不变性。以能力为问题框架（即 "如何轻松地......"）与以情境为问题框架（即 "在哪些日常情境中......"）相比，得分更高。结构效度和标准效度分析表明，使用多个问题框架所解释的方差与仅使用一个问题框架所解释的方差相似。我们在每个工具中都发现了一个强大的总体因子，问题框架所能解释的方差很小。因此，使用不同的副词措辞不太可能帮助临床医生和研究人员提高检测抑郁或焦虑的能力。(PsycInfo 数据库记录（c）2024 APA，版权所有）。

{"title":"Beyond frequency: Evaluating the validity of assessing the context, duration, ability, and botherment of depression and anxiety symptoms in South Brazil.","authors":"Reza de Souza Brümmer, Karolin Rose Krause, Giovanni Abrahão Salum, Marcelo Pio de Almeida Fleck, Ighor Miron Porto, João Villanova do Amaral, João Pedro Gonçalves Pacheco, Bettina Moltrecht, Eoin McElroy, Mauricio Scopel Hoffmann","doi":"10.1037/pas0001323","DOIUrl":"10.1037/pas0001323","url":null,"abstract":"Assessment tools for depression and anxiety usually inquire about the frequency of symptoms. However, evidence suggests that different question framings might trigger different responses. Our aim is to test if asking about symptom's context, ability, duration, and botherment adds validity to Patient Health Questionnaire-9, General Anxiety Disorder-7, and Patient-Related Outcome Measurement Information Systems depression and anxiety. Participants came from two cross-sectional convenience-sampled surveys (N = 1,871) of adults (66% females, aged 33.4 ± 13.2), weighted to approximate with the state-level population. We examined measurement invariance across the different question frames, estimated whether framing affected mean scores, and tested their independent validity using covariate-adjusted and sample-weighted structural equation models. Validity was tested using tools assessing general disability, alcohol use, loneliness, well-being, grit, and frequency-based questions from depression and anxiety questionnaires. A bifactor model was applied to test the internal consistency of the question frames under the presence of a general factor (i.e., depression or anxiety). Measurement invariance was supported across the different frames. Framing questions as ability (i.e., \"How easily …\") produced a higher score, compared with framing by context (i.e., \"In which daily situations …\"). Construct and criterion validity analysis demonstrate that variance explained using multiple question frames was similar to using only one. We detected a strong overarching factor for each instrument, with little variances left to be explained by the question frame. Therefore, it is unlikely that using different adverbial phrasings can help clinicians and researchers to improve their ability to detect depression or anxiety. (PsycInfo Database Record (c) 2024 APA, all rights reserved).","PeriodicalId":20770,"journal":{"name":"Psychological Assessment","volume":" ","pages":"488-504"},"PeriodicalIF":3.3,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141458963","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Estimating classification consistency of machine learning models for screening measures. 估算机器学习模型对筛查措施的分类一致性。

IF 3.6 2区心理学 Q1 PSYCHOLOGY, CLINICAL

Psychological Assessment

Pub Date : 2024-06-01 DOI: 10.1037/pas0001313

Oscar Gonzalez, A R Georgeson, William E Pelham

This article illustrates novel quantitative methods to estimate classification consistency in machine learning models used for screening measures. Screening measures are used in psychology and medicine to classify individuals into diagnostic classifications. In addition to achieving high accuracy, it is ideal for the screening process to have high classification consistency, which means that respondents would be classified into the same group every time if the assessment was repeated. Although machine learning models are increasingly being used to predict a screening classification based on individual item responses, methods to describe the classification consistency of machine learning models have not yet been developed. This article addresses this gap by describing methods to estimate classification inconsistency in machine learning models arising from two different sources: sampling error during model fitting and measurement error in the item responses. These methods use data resampling techniques such as the bootstrap and Monte Carlo sampling. These methods are illustrated using three empirical examples predicting a health condition/diagnosis from item responses. R code is provided to facilitate the implementation of the methods. This article highlights the importance of considering classification consistency alongside accuracy when studying screening measures and provides the tools and guidance necessary for applied researchers to obtain classification consistency indices in their machine learning research on diagnostic assessments. (PsycInfo Database Record (c) 2024 APA, all rights reserved).

本文阐述了新颖的定量方法，用于估算筛查措施所用机器学习模型的分类一致性。筛查方法被用于心理学和医学领域，以将个体划分为诊断类别。除了要达到高准确度外，筛查过程还必须具有高分类一致性，这意味着如果重复进行评估，受访者每次都会被归入同一组别。尽管机器学习模型越来越多地被用于预测基于单个项目反应的筛选分类，但描述机器学习模型分类一致性的方法尚未开发出来。本文针对这一空白，介绍了估算机器学习模型分类不一致性的方法，这种不一致性由两个不同的来源引起：模型拟合过程中的抽样误差和项目回答中的测量误差。这些方法使用了数据重采样技术，如自举法和蒙特卡罗采样。这些方法通过三个从项目回答中预测健康状况/诊断的经验示例进行了说明。本文提供了 R 代码，以方便方法的实施。本文强调了在研究筛查措施时考虑分类一致性和准确性的重要性，并为应用研究人员在诊断评估的机器学习研究中获取分类一致性指数提供了必要的工具和指导。(PsycInfo 数据库记录 (c) 2024 APA，保留所有权利）。

{"title":"Estimating classification consistency of machine learning models for screening measures.","authors":"Oscar Gonzalez, A R Georgeson, William E Pelham","doi":"10.1037/pas0001313","DOIUrl":"https://doi.org/10.1037/pas0001313","url":null,"abstract":"This article illustrates novel quantitative methods to estimate classification consistency in machine learning models used for screening measures. Screening measures are used in psychology and medicine to classify individuals into diagnostic classifications. In addition to achieving high accuracy, it is ideal for the screening process to have high classification consistency, which means that respondents would be classified into the same group every time if the assessment was repeated. Although machine learning models are increasingly being used to predict a screening classification based on individual item responses, methods to describe the classification consistency of machine learning models have not yet been developed. This article addresses this gap by describing methods to estimate classification inconsistency in machine learning models arising from two different sources: sampling error during model fitting and measurement error in the item responses. These methods use data resampling techniques such as the bootstrap and Monte Carlo sampling. These methods are illustrated using three empirical examples predicting a health condition/diagnosis from item responses. R code is provided to facilitate the implementation of the methods. This article highlights the importance of considering classification consistency alongside accuracy when studying screening measures and provides the tools and guidance necessary for applied researchers to obtain classification consistency indices in their machine learning research on diagnostic assessments. (PsycInfo Database Record (c) 2024 APA, all rights reserved).","PeriodicalId":20770,"journal":{"name":"Psychological Assessment","volume":"36 6-7","pages":"395-406"},"PeriodicalIF":3.6,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141200498","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0