Commentary on Sun and Tang: Measurement assessment and validity in problematic smartphone use

IF 5.3 1区 医学 Q1 PSYCHIATRY Addiction Pub Date : 2025-01-11 DOI:10.1111/add.16764
Richard J. E. James, Lucy Hitcham
{"title":"Commentary on Sun and Tang: Measurement assessment and validity in problematic smartphone use","authors":"Richard J. E. James,&nbsp;Lucy Hitcham","doi":"10.1111/add.16764","DOIUrl":null,"url":null,"abstract":"<p>The thoughtful choice of estimation procedures for the confirmatory factor analysis (CFA) and invariance testing is worth particular attention. Many assessment studies use maximum likelihood (ML or MLR with robust standard errors) for CFA despite well-known limitations when applied to ordinal data [<span>7</span>]. A popular alternative is to use limited information estimation, for example, weighted least squares (WLSMV) to overcome these. However, doing so comes with major drawbacks, most notably when assessing measurement invariance [<span>8, 9</span>]. Sun and Tang [<span>5</span>] carefully balance the strengths of both MLR and WLSMV to validate the Problematic Smartphone Use Scale among Chinese college students (PSUS-C). These considerations are valuable across the entirety of addiction research, especially in domains or populations where endorsement of indicators might be skewed (e.g. gambling, certain forms of substance use and general population samples). To illustrate why these problems matter, CFA studies have repeatedly shown inconsistent evidence of structural validity in prominent scales such as the Problem Gambling Severity Index [<span>10, 11</span>]. However, closer examination suggests that most of this inconsistency is an artifact of using ML on ordinal questionnaire items in general population samples where the distribution of responses is often skewed. When analyzed using an approach that balances the strengths of both ML and WLSMV, these inconsistencies disappear [<span>11, 12</span>].</p><p>The findings also highlight an important tension between identifying the best-fitting factor structure and deciding how a scale should be used. Both exploratory factor analysis (EFA) and CFA rejected a single-factor model in this study, yet a sum score was used to assess criterion validity. We raise this to promote the benefits of testing models specifying either a second-order or a bifactor structure because these can assess whether a single score is appropriate [<span>13</span>]. This is an issue across the PSU field, where many scales have been validated as multi-dimensional. but are used as a single score. This tension is a source of analytic flexibility and a potential threat to the validity of many findings, especially when methods such as structural equation modelling are used.</p><p>Our final reflection underscores the importance of invariance testing. Despite concluding in favor of strict invariance, there does not appear to be a comparison of latent mean differences that would allow a stronger test of group differences. Our examination of the descriptive data suggests the absence of a substantial sex difference in PSUS-C scores in this large, externally representative sample. We calculated the standardized effect size (d) using the mean (M) and SD statistics reported in table 1 (men: M = 58.05, SD = 18.09; women: M = 57.52, SD = 16.12). The difference observed in this study does not appear to practically differ from zero (d = 0.03). This finding contrasts with a large, disparate literature that has inconsistently found sex differences in the severity and prevalence of problematic smartphone behaviors (e.g. Cohen's <i>d</i> for women &gt; men = 0.16 [<span>14</span>], 0.39 [<span>15</span>], 0.22 [<span>16</span>], 0.10 [<span>17</span>] and 0.21 [<span>17</span>]). This is further complicated by a fixation on creating novel instruments or adapting scales from other behavioral addictions instead of improving and refining existing measures [<span>18</span>]. Ultimately, the absence of appropriate psychometric validation found in many PSU and behavioral addiction measures makes it impossible to determine whether the group differences observed elsewhere reflect genuine differences or bias caused by sampling, specific measurement scales or specific questionnaire items. Sun and Tang's study [<span>5</span>] offers insights on how to move forward with the assessment and validation of behavioral addiction measurement scales. The use of rigorous testing is essential to establish whether addiction constructs are equivalent across diverse groups of people to make valid group comparisons and inferences [<span>19</span>].</p><p><b>Richard J. E. James:</b> Writing—original draft (equal). <b>Lucy Hitcham:</b> Writing—original draft (equal).</p><p>R.J. has received funding for gambling research projects in the last 3 years from GREO Evidence Insights and the Academic Forum for the Study of Gambling. These funds are sourced from regulatory settlements levied by the Gambling Commission in lieu of penalties. L.H. is funded by the Engineering and Physical Sciences Research Council (EPSRC) on a PhD Studentship scholarship (EP/S023305/1).</p>","PeriodicalId":109,"journal":{"name":"Addiction","volume":"120 4","pages":"642-644"},"PeriodicalIF":5.3000,"publicationDate":"2025-01-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/add.16764","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Addiction","FirstCategoryId":"3","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/add.16764","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PSYCHIATRY","Score":null,"Total":0}
引用次数: 0

Abstract

The thoughtful choice of estimation procedures for the confirmatory factor analysis (CFA) and invariance testing is worth particular attention. Many assessment studies use maximum likelihood (ML or MLR with robust standard errors) for CFA despite well-known limitations when applied to ordinal data [7]. A popular alternative is to use limited information estimation, for example, weighted least squares (WLSMV) to overcome these. However, doing so comes with major drawbacks, most notably when assessing measurement invariance [8, 9]. Sun and Tang [5] carefully balance the strengths of both MLR and WLSMV to validate the Problematic Smartphone Use Scale among Chinese college students (PSUS-C). These considerations are valuable across the entirety of addiction research, especially in domains or populations where endorsement of indicators might be skewed (e.g. gambling, certain forms of substance use and general population samples). To illustrate why these problems matter, CFA studies have repeatedly shown inconsistent evidence of structural validity in prominent scales such as the Problem Gambling Severity Index [10, 11]. However, closer examination suggests that most of this inconsistency is an artifact of using ML on ordinal questionnaire items in general population samples where the distribution of responses is often skewed. When analyzed using an approach that balances the strengths of both ML and WLSMV, these inconsistencies disappear [11, 12].

The findings also highlight an important tension between identifying the best-fitting factor structure and deciding how a scale should be used. Both exploratory factor analysis (EFA) and CFA rejected a single-factor model in this study, yet a sum score was used to assess criterion validity. We raise this to promote the benefits of testing models specifying either a second-order or a bifactor structure because these can assess whether a single score is appropriate [13]. This is an issue across the PSU field, where many scales have been validated as multi-dimensional. but are used as a single score. This tension is a source of analytic flexibility and a potential threat to the validity of many findings, especially when methods such as structural equation modelling are used.

Our final reflection underscores the importance of invariance testing. Despite concluding in favor of strict invariance, there does not appear to be a comparison of latent mean differences that would allow a stronger test of group differences. Our examination of the descriptive data suggests the absence of a substantial sex difference in PSUS-C scores in this large, externally representative sample. We calculated the standardized effect size (d) using the mean (M) and SD statistics reported in table 1 (men: M = 58.05, SD = 18.09; women: M = 57.52, SD = 16.12). The difference observed in this study does not appear to practically differ from zero (d = 0.03). This finding contrasts with a large, disparate literature that has inconsistently found sex differences in the severity and prevalence of problematic smartphone behaviors (e.g. Cohen's d for women > men = 0.16 [14], 0.39 [15], 0.22 [16], 0.10 [17] and 0.21 [17]). This is further complicated by a fixation on creating novel instruments or adapting scales from other behavioral addictions instead of improving and refining existing measures [18]. Ultimately, the absence of appropriate psychometric validation found in many PSU and behavioral addiction measures makes it impossible to determine whether the group differences observed elsewhere reflect genuine differences or bias caused by sampling, specific measurement scales or specific questionnaire items. Sun and Tang's study [5] offers insights on how to move forward with the assessment and validation of behavioral addiction measurement scales. The use of rigorous testing is essential to establish whether addiction constructs are equivalent across diverse groups of people to make valid group comparisons and inferences [19].

Richard J. E. James: Writing—original draft (equal). Lucy Hitcham: Writing—original draft (equal).

R.J. has received funding for gambling research projects in the last 3 years from GREO Evidence Insights and the Academic Forum for the Study of Gambling. These funds are sourced from regulatory settlements levied by the Gambling Commission in lieu of penalties. L.H. is funded by the Engineering and Physical Sciences Research Council (EPSRC) on a PhD Studentship scholarship (EP/S023305/1).

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
孙和唐评论:问题智能手机使用的测量、评估和效度。
验证性因子分析(CFA)和不变性检验的估计程序的深思熟虑的选择值得特别注意。许多评估研究使用最大似然(ML或具有鲁棒标准误差的MLR)对CFA进行评估,尽管在应用于有序数据[7]时存在众所周知的局限性。一种流行的替代方法是使用有限信息估计,例如加权最小二乘(WLSMV)来克服这些问题。然而,这样做有很大的缺点,最明显的是在评估测量不变性时[8,9]。Sun和Tang[5]仔细地平衡了MLR和WLSMV的优势,以验证中国大学生智能手机问题使用量表(psu - c)。这些考虑在整个成瘾研究中都是有价值的,特别是在对指标的认可可能有偏差的领域或人群中(例如赌博、某些形式的物质使用和一般人群样本)。为了说明为什么这些问题很重要,CFA研究一再表明,在诸如问题赌博严重性指数(Problem Gambling Severity Index)等重要量表中存在不一致的结构效度证据[10,11]。然而,更仔细的检查表明,这种不一致的大部分是在一般人群样本中使用ML对顺序问卷项目的工件,其中响应的分布通常是倾斜的。当使用平衡ML和WLSMV优势的方法进行分析时,这些不一致就消失了[11,12]。研究结果还强调了确定最合适的因素结构和决定如何使用量表之间的重要紧张关系。探索性因素分析(EFA)和CFA均拒绝了本研究中的单因素模型,但使用总和评分来评估标准效度。我们提出这个问题是为了促进指定二阶或双因素结构的测试模型的好处,因为这些模型可以评估单个分数是否合适。这是PSU领域的一个问题,许多尺度已经被验证为多维的。而是作为一个单一的分数。这种张力是分析灵活性的来源,也是对许多发现有效性的潜在威胁,特别是当使用结构方程模型等方法时。我们最后的反思强调了不变性测试的重要性。尽管结论支持严格的不变性,但似乎没有潜在平均差异的比较,这将允许对群体差异进行更强的检验。我们对描述性数据的检查表明,在这个庞大的、具有外部代表性的样本中,pss - c分数没有实质性的性别差异。我们使用表1中报告的均值(M)和SD统计量计算标准化效应量(d)(男性:M = 58.05, SD = 18.09;女性:M = 57.52, SD = 16.12)。在本研究中观察到的差异实际上似乎与零无关(d = 0.03)。这一发现与大量不同的文献形成了对比,这些文献不一致地发现了智能手机问题行为的严重程度和流行程度的性别差异(例如,女性和男性的科恩d = 0.16[14], 0.39[15], 0.22[16], 0.10[17]和0.21[17])。由于执着于创造新的工具或从其他行为成瘾中改编量表,而不是改进和完善现有的测量方法,这使情况变得更加复杂。最终,在许多PSU和行为成瘾测量中缺乏适当的心理测量验证,因此无法确定其他地方观察到的群体差异是否反映了真正的差异,还是由抽样、特定测量量表或特定问卷项目引起的偏见。孙和唐的研究为如何进一步评估和验证行为成瘾测量量表提供了见解。为了进行有效的群体比较和推论,必须使用严格的测试来确定成瘾结构在不同人群中是否相同。理查德·j·e·詹姆斯:原稿(同等)。露西·希区姆:原稿(同等)。r.j.。在过去的三年里,他获得了来自GREO证据洞察和赌博研究学术论坛的赌博研究项目的资助。这些资金来自赌博委员会征收的监管和解金,以代替罚款。L.H.是由工程和物理科学研究委员会(EPSRC)资助的博士奖学金(EP/S023305/1)。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Addiction
Addiction 医学-精神病学
CiteScore
10.80
自引率
6.70%
发文量
319
审稿时长
3 months
期刊介绍: Addiction publishes peer-reviewed research reports on pharmacological and behavioural addictions, bringing together research conducted within many different disciplines. Its goal is to serve international and interdisciplinary scientific and clinical communication, to strengthen links between science and policy, and to stimulate and enhance the quality of debate. We seek submissions that are not only technically competent but are also original and contain information or ideas of fresh interest to our international readership. We seek to serve low- and middle-income (LAMI) countries as well as more economically developed countries. Addiction’s scope spans human experimental, epidemiological, social science, historical, clinical and policy research relating to addiction, primarily but not exclusively in the areas of psychoactive substance use and/or gambling. In addition to original research, the journal features editorials, commentaries, reviews, letters, and book reviews.
期刊最新文献
Quitting trajectories of Hong Kong Chinese smokers receiving behavioral smoking cessation interventions: A post hoc analysis of eight randomized controlled trials. Trends in fentanyl-containing drug samples seized by law enforcement agencies across Canada. Target trial emulation for addiction research-A useful method but not a panacea. Commentary on Dobbie et al.: Preventing adolescent gambling-related harm-Is a universal approach sufficient? Varenicline for cannabis use disorder: A randomized controlled trial.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1