Extracting relevant predictors of the severity of mental illnesses from clinical information using regularisation regression models

Q4 Mathematics Statistics in Transition Pub Date : 2022-06-01 DOI:10.2478/stattrans-2022-0020
Sakshi Kaushik, A. Sabharwal, G. Grover
{"title":"Extracting relevant predictors of the severity of mental illnesses from clinical information using regularisation regression models","authors":"Sakshi Kaushik, A. Sabharwal, G. Grover","doi":"10.2478/stattrans-2022-0020","DOIUrl":null,"url":null,"abstract":"Abstract Mental disorders are common non-communicable diseases whose occurrence rises at epidemic rates globally. The determination of the severity of a mental illness has important clinical implications and it serves as a prognostic factor for effective intervention planning and management. This paper aims to identify the relevant predictors of the severity of mental illnesses (measured by psychiatric rating scales) from a wide range of clinical variables consisting of information on both laboratory test results and psychiatric factors. The laboratory test results collectively indicate the measurements of 23 components derived from vital signs and blood tests results for the evaluation of the complete blood count. The 8 psychiatric factors known to affect the severity of mental illnesses are considered, viz. the family history, course and onset of an illness, etc. Retrospective data of 78 patients diagnosed with mental and behavioural disorders were collected from the Lady Hardinge Medical College & Smt. S.K, Hospital in New Delhi, India. The observations missing in the data are imputed using the non-parametric random forest algorithm. The multicollinearity is detected based on the variance inflation factor. Owing to the presence of multicollinearity, regularisation techniques such as ridge regression and extensions of the least absolute shrinkage and selection operator (LASSO), viz. adaptive and group LASSO are used for fitting the regression model. Optimal tuning parameter λ is obtained through 13-fold cross-validation. It was observed that the coefficients of the quantitative predictors extracted by the adaptive LASSO and the group of predictors extracted by the group LASSO were comparable to the coefficients obtained through ridge regression.","PeriodicalId":37985,"journal":{"name":"Statistics in Transition","volume":"23 1","pages":"129 - 152"},"PeriodicalIF":0.0000,"publicationDate":"2022-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Statistics in Transition","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2478/stattrans-2022-0020","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Mathematics","Score":null,"Total":0}
引用次数: 0

Abstract

Abstract Mental disorders are common non-communicable diseases whose occurrence rises at epidemic rates globally. The determination of the severity of a mental illness has important clinical implications and it serves as a prognostic factor for effective intervention planning and management. This paper aims to identify the relevant predictors of the severity of mental illnesses (measured by psychiatric rating scales) from a wide range of clinical variables consisting of information on both laboratory test results and psychiatric factors. The laboratory test results collectively indicate the measurements of 23 components derived from vital signs and blood tests results for the evaluation of the complete blood count. The 8 psychiatric factors known to affect the severity of mental illnesses are considered, viz. the family history, course and onset of an illness, etc. Retrospective data of 78 patients diagnosed with mental and behavioural disorders were collected from the Lady Hardinge Medical College & Smt. S.K, Hospital in New Delhi, India. The observations missing in the data are imputed using the non-parametric random forest algorithm. The multicollinearity is detected based on the variance inflation factor. Owing to the presence of multicollinearity, regularisation techniques such as ridge regression and extensions of the least absolute shrinkage and selection operator (LASSO), viz. adaptive and group LASSO are used for fitting the regression model. Optimal tuning parameter λ is obtained through 13-fold cross-validation. It was observed that the coefficients of the quantitative predictors extracted by the adaptive LASSO and the group of predictors extracted by the group LASSO were comparable to the coefficients obtained through ridge regression.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用正则化回归模型从临床信息中提取精神疾病严重程度的相关预测因子
摘要精神障碍是一种常见的非传染性疾病,其发病率在全球范围内呈上升趋势。精神疾病严重程度的确定具有重要的临床意义,它是有效干预计划和管理的预后因素。本文旨在从一系列临床变量中确定精神疾病严重程度的相关预测因素(通过精神评定量表测量),这些变量包括实验室测试结果和精神因素的信息。实验室测试结果共同指示了来自生命体征的23个成分的测量值和用于评估全血细胞计数的血液测试结果。考虑了已知影响精神疾病严重程度的8个精神因素,即家族史、病程和发病等。从哈丁夫人医学院和史密斯医学院收集了78名被诊断为精神和行为障碍的患者的回顾性数据。S.K,印度新德里医院。使用非参数随机森林算法估算数据中缺失的观测值。基于方差膨胀因子来检测多重共线性。由于多重共线性的存在,使用正则化技术,如岭回归和最小绝对收缩和选择算子(LASSO)的扩展,即自适应和群LASSO来拟合回归模型。通过13次交叉验证得到了最佳调谐参数λ。观察到由自适应LASSO提取的定量预测因子的系数和由LASSO组提取的预测因子组的系数与通过岭回归获得的系数相当。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Statistics in Transition
Statistics in Transition Decision Sciences-Statistics, Probability and Uncertainty
CiteScore
1.00
自引率
0.00%
发文量
0
审稿时长
9 weeks
期刊介绍: Statistics in Transition (SiT) is an international journal published jointly by the Polish Statistical Association (PTS) and the Central Statistical Office of Poland (CSO/GUS), which sponsors this publication. Launched in 1993, it was issued twice a year until 2006; since then it appears - under a slightly changed title, Statistics in Transition new series - three times a year; and after 2013 as a regular quarterly journal." The journal provides a forum for exchange of ideas and experience amongst members of international community of statisticians, data producers and users, including researchers, teachers, policy makers and the general public. Its initially dominating focus on statistical issues pertinent to transition from centrally planned to a market-oriented economy has gradually been extended to embracing statistical problems related to development and modernization of the system of public (official) statistics, in general.
期刊最新文献
Estimating the probability of leaving unemployment for older people in Poland using survival models with censored data Does economic freedom promote financial development? Evidence from EU countries Rotation schemes and Chebyshev polynomials A nonparametric analysis of discrete time competing risks data: a comparison of the cause-specific-hazards approach and the vertical approach Comments on „Probability vs. Nonprobability Sampling: From the Birth of Survey Sampling to the Present Day” by Graham Kalton
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1