使用基于基因签名的机器学习模型预测腰痛患者的治疗结果。

IF 4.1 2区 医学 Q1 CLINICAL NEUROLOGY Pain and Therapy Pub Date : 2025-02-01 Epub Date: 2024-12-25 DOI:10.1007/s40122-024-00700-8
Youzhi Lian, Yinyu Shi, Haibin Shang, Hongsheng Zhan
{"title":"使用基于基因签名的机器学习模型预测腰痛患者的治疗结果。","authors":"Youzhi Lian, Yinyu Shi, Haibin Shang, Hongsheng Zhan","doi":"10.1007/s40122-024-00700-8","DOIUrl":null,"url":null,"abstract":"<p><strong>Introduction: </strong>Low back pain (LBP) is a significant global health burden, with variable treatment outcomes and an unclear underlying molecular mechanism. Effective prediction of treatment responses remains a challenge. In this study, we aimed to develop gene signature-based machine learning models using transcriptomic data from peripheral immune cells to predict treatment outcomes in patients with LBP.</p><p><strong>Methods: </strong>The transcriptomic data of patients with LBP from peripheral immune cells were retrieved from the GEO database. Patients with LBP were recruited, and treatment outcomes were assessed after 3 months. Patients were classified into two groups: those with resolved pain and those with persistent pain. Differentially expressed genes (DEGs) between the two groups were identified through bioinformatic analysis. Key genes were selected using five machine learning models, including Lasso, Elastic Net, Random Forest, SVM, and GBM. These key genes were then used to train 45 machine learning models by combining nine different algorithms: Logistic Regression, K-Nearest Neighbors, Support Vector Machine, Decision Tree, Random Forest, Gradient Boosting Machine, Multilayer Perceptron, Naive Bayes, and Linear Discriminant Analysis. Five-fold cross-validation was employed to ensure robust model evaluation and minimize overfitting. In each fold, the dataset was split into training and validation sets, with model performance assessed using multiple metrics including accuracy, precision, recall, and F1 score. The final model performance was reported as the mean and standard deviation across all five folds, providing a more reliable estimate of the models' ability to predict LBP treatment outcomes using gene expression data from peripheral immune cells.</p><p><strong>Results: </strong>A total of 61 DEGs were identified between patients with resolved and persistent pain. From these genes, 45 machine learning models were constructed using different combinations of feature selection methods and classification algorithms. The Elastic Net with Logistic Regression achieved the highest accuracy of 88.7% ± 8.0% (mean ± standard deviation), followed closely by Elastic Net with Linear Discriminant Analysis (88.7% ± 7.5%) and Lasso with Multilayer Perceptron (87.7% ± 6.7%). Overall, 15 models demonstrated robust performance with accuracy > 80%, suggesting the reliability of our machine learning approach in predicting LBP treatment outcomes. The SHapley Additive exPlanations (SHAP) method was used to visualize the contribution of core genes to model performance, highlighting their roles in predicting treatment outcomes.</p><p><strong>Conclusion: </strong>The study demonstrates the potential of using transcriptomic data from peripheral immune cells and machine learning models to predict treatment outcomes in patients with LBP. The identification of key genes and the high accuracy of certain models provide a basis for future personalized treatment strategies in LBP management. Visualizing gene importance with SHAP adds interpretability to the predictive models, enhancing their clinical relevance.</p>","PeriodicalId":19908,"journal":{"name":"Pain and Therapy","volume":" ","pages":"359-373"},"PeriodicalIF":4.1000,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11751268/pdf/","citationCount":"0","resultStr":"{\"title\":\"Predicting Treatment Outcomes in Patients with Low Back Pain Using Gene Signature-Based Machine Learning Models.\",\"authors\":\"Youzhi Lian, Yinyu Shi, Haibin Shang, Hongsheng Zhan\",\"doi\":\"10.1007/s40122-024-00700-8\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Introduction: </strong>Low back pain (LBP) is a significant global health burden, with variable treatment outcomes and an unclear underlying molecular mechanism. Effective prediction of treatment responses remains a challenge. In this study, we aimed to develop gene signature-based machine learning models using transcriptomic data from peripheral immune cells to predict treatment outcomes in patients with LBP.</p><p><strong>Methods: </strong>The transcriptomic data of patients with LBP from peripheral immune cells were retrieved from the GEO database. Patients with LBP were recruited, and treatment outcomes were assessed after 3 months. Patients were classified into two groups: those with resolved pain and those with persistent pain. Differentially expressed genes (DEGs) between the two groups were identified through bioinformatic analysis. Key genes were selected using five machine learning models, including Lasso, Elastic Net, Random Forest, SVM, and GBM. These key genes were then used to train 45 machine learning models by combining nine different algorithms: Logistic Regression, K-Nearest Neighbors, Support Vector Machine, Decision Tree, Random Forest, Gradient Boosting Machine, Multilayer Perceptron, Naive Bayes, and Linear Discriminant Analysis. Five-fold cross-validation was employed to ensure robust model evaluation and minimize overfitting. In each fold, the dataset was split into training and validation sets, with model performance assessed using multiple metrics including accuracy, precision, recall, and F1 score. The final model performance was reported as the mean and standard deviation across all five folds, providing a more reliable estimate of the models' ability to predict LBP treatment outcomes using gene expression data from peripheral immune cells.</p><p><strong>Results: </strong>A total of 61 DEGs were identified between patients with resolved and persistent pain. From these genes, 45 machine learning models were constructed using different combinations of feature selection methods and classification algorithms. The Elastic Net with Logistic Regression achieved the highest accuracy of 88.7% ± 8.0% (mean ± standard deviation), followed closely by Elastic Net with Linear Discriminant Analysis (88.7% ± 7.5%) and Lasso with Multilayer Perceptron (87.7% ± 6.7%). Overall, 15 models demonstrated robust performance with accuracy > 80%, suggesting the reliability of our machine learning approach in predicting LBP treatment outcomes. The SHapley Additive exPlanations (SHAP) method was used to visualize the contribution of core genes to model performance, highlighting their roles in predicting treatment outcomes.</p><p><strong>Conclusion: </strong>The study demonstrates the potential of using transcriptomic data from peripheral immune cells and machine learning models to predict treatment outcomes in patients with LBP. The identification of key genes and the high accuracy of certain models provide a basis for future personalized treatment strategies in LBP management. Visualizing gene importance with SHAP adds interpretability to the predictive models, enhancing their clinical relevance.</p>\",\"PeriodicalId\":19908,\"journal\":{\"name\":\"Pain and Therapy\",\"volume\":\" \",\"pages\":\"359-373\"},\"PeriodicalIF\":4.1000,\"publicationDate\":\"2025-02-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11751268/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Pain and Therapy\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1007/s40122-024-00700-8\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/12/25 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q1\",\"JCRName\":\"CLINICAL NEUROLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pain and Therapy","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s40122-024-00700-8","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/12/25 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"CLINICAL NEUROLOGY","Score":null,"Total":0}
引用次数: 0

摘要

摘要:腰痛(LBP)是一种重要的全球健康负担,治疗结果不一,潜在的分子机制尚不清楚。有效预测治疗反应仍然是一个挑战。在这项研究中,我们旨在开发基于基因签名的机器学习模型,利用来自外周免疫细胞的转录组学数据来预测LBP患者的治疗结果。方法:从GEO数据库中检索LBP患者外周血免疫细胞的转录组数据。招募LBP患者,并在3个月后评估治疗结果。患者分为两组:缓解疼痛组和持续疼痛组。通过生物信息学分析鉴定两组间的差异表达基因(DEGs)。使用Lasso、Elastic Net、Random Forest、SVM和GBM五种机器学习模型选择关键基因。然后,这些关键基因通过结合9种不同的算法来训练45个机器学习模型:逻辑回归、k近邻、支持向量机、决策树、随机森林、梯度增强机、多层感知器、朴素贝叶斯和线性判别分析。采用五重交叉验证来确保模型评估的稳健性和最小化过拟合。在每个折叠中,数据集被分成训练集和验证集,使用多个指标评估模型性能,包括准确性、精密度、召回率和F1分数。最终的模型性能报告为所有五倍的平均值和标准偏差,提供了更可靠的模型预测LBP治疗结果的能力,使用来自外周免疫细胞的基因表达数据。结果:在缓解性疼痛和持续性疼痛患者之间共鉴定出61个deg。从这些基因中,使用不同的特征选择方法和分类算法组合构建了45个机器学习模型。使用Logistic回归的Elastic Net的准确率最高,为88.7%±8.0%(均值±标准差),其次是使用线性判别分析的Elastic Net(88.7%±7.5%)和使用多层感知器的Lasso(87.7%±6.7%)。总体而言,15个模型表现出稳健的性能,准确率达到80%,表明我们的机器学习方法在预测LBP治疗结果方面是可靠的。SHapley加性解释(SHAP)方法用于可视化核心基因对模型表现的贡献,突出了它们在预测治疗结果中的作用。结论:该研究证明了使用外周免疫细胞转录组数据和机器学习模型预测LBP患者治疗结果的潜力。关键基因的识别和某些模型的高准确性为未来LBP治疗的个性化治疗策略提供了基础。用SHAP可视化基因的重要性增加了预测模型的可解释性,增强了它们的临床相关性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Predicting Treatment Outcomes in Patients with Low Back Pain Using Gene Signature-Based Machine Learning Models.

Introduction: Low back pain (LBP) is a significant global health burden, with variable treatment outcomes and an unclear underlying molecular mechanism. Effective prediction of treatment responses remains a challenge. In this study, we aimed to develop gene signature-based machine learning models using transcriptomic data from peripheral immune cells to predict treatment outcomes in patients with LBP.

Methods: The transcriptomic data of patients with LBP from peripheral immune cells were retrieved from the GEO database. Patients with LBP were recruited, and treatment outcomes were assessed after 3 months. Patients were classified into two groups: those with resolved pain and those with persistent pain. Differentially expressed genes (DEGs) between the two groups were identified through bioinformatic analysis. Key genes were selected using five machine learning models, including Lasso, Elastic Net, Random Forest, SVM, and GBM. These key genes were then used to train 45 machine learning models by combining nine different algorithms: Logistic Regression, K-Nearest Neighbors, Support Vector Machine, Decision Tree, Random Forest, Gradient Boosting Machine, Multilayer Perceptron, Naive Bayes, and Linear Discriminant Analysis. Five-fold cross-validation was employed to ensure robust model evaluation and minimize overfitting. In each fold, the dataset was split into training and validation sets, with model performance assessed using multiple metrics including accuracy, precision, recall, and F1 score. The final model performance was reported as the mean and standard deviation across all five folds, providing a more reliable estimate of the models' ability to predict LBP treatment outcomes using gene expression data from peripheral immune cells.

Results: A total of 61 DEGs were identified between patients with resolved and persistent pain. From these genes, 45 machine learning models were constructed using different combinations of feature selection methods and classification algorithms. The Elastic Net with Logistic Regression achieved the highest accuracy of 88.7% ± 8.0% (mean ± standard deviation), followed closely by Elastic Net with Linear Discriminant Analysis (88.7% ± 7.5%) and Lasso with Multilayer Perceptron (87.7% ± 6.7%). Overall, 15 models demonstrated robust performance with accuracy > 80%, suggesting the reliability of our machine learning approach in predicting LBP treatment outcomes. The SHapley Additive exPlanations (SHAP) method was used to visualize the contribution of core genes to model performance, highlighting their roles in predicting treatment outcomes.

Conclusion: The study demonstrates the potential of using transcriptomic data from peripheral immune cells and machine learning models to predict treatment outcomes in patients with LBP. The identification of key genes and the high accuracy of certain models provide a basis for future personalized treatment strategies in LBP management. Visualizing gene importance with SHAP adds interpretability to the predictive models, enhancing their clinical relevance.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Pain and Therapy
Pain and Therapy CLINICAL NEUROLOGY-
CiteScore
6.60
自引率
5.00%
发文量
110
审稿时长
6 weeks
期刊介绍: Pain and Therapy is an international, open access, peer-reviewed, rapid publication journal dedicated to the publication of high-quality clinical (all phases), observational, real-world, and health outcomes research around the discovery, development, and use of pain therapies and pain-related devices. Studies relating to diagnosis, pharmacoeconomics, public health, quality of life, and patient care, management, and education are also encouraged. Areas of focus include, but are not limited to, acute pain, cancer pain, chronic pain, headache and migraine, neuropathic pain, opioids, palliative care and pain ethics, peri- and post-operative pain as well as rheumatic pain and fibromyalgia. The journal is of interest to a broad audience of pharmaceutical and healthcare professionals and publishes original research, reviews, case reports, trial protocols, short communications such as commentaries and editorials, and letters. The journal is read by a global audience and receives submissions from around the world. Pain and Therapy will consider all scientifically sound research be it positive, confirmatory or negative data. Submissions are welcomed whether they relate to an international and/or a country-specific audience, something that is crucially important when researchers are trying to target more specific patient populations. This inclusive approach allows the journal to assist in the dissemination of all scientifically and ethically sound research.
期刊最新文献
Correction to: Real‑World Evidence of the Safety and Effectiveness of Atogepant Added to OnabotulinumtoxinA for the Preventive Treatment of Chronic Migraine: A Retrospective Chart Review. Current and Novel Therapies for Cluster Headache: A Narrative Review. Transforaminal Endoscopic Lumbar Discectomy versus Coblation Nucleoplasty Combined with Collagenase Chemonucleolysis for Lumbar Disc Herniation with Grade I Degenerative Spondylolisthesis. Mechanisms and Therapeutic Potential of GPX4 in Pain Modulation. The Impact of ERAS and Multidisciplinary Teams on Perioperative Management in Colorectal Cancer.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1