使用scikit-learn建立和分析基于机器学习的华法林剂量预测模型。

IF 1.1 Q4 PHARMACOLOGY & PHARMACY Translational and Clinical Pharmacology Pub Date : 2022-12-01 DOI:10.12793/tcp.2022.30.e22

Sangzin Ahn

{"title":"使用scikit-learn建立和分析基于机器学习的华法林剂量预测模型。","authors":"Sangzin Ahn","doi":"10.12793/tcp.2022.30.e22","DOIUrl":null,"url":null,"abstract":"For personalized drug dosing, prediction models may be utilized to overcome the inter-individual variability. Multiple linear regression has been used as a conventional method to model the relationship between patient features and optimal drug dose. However, linear regression cannot capture non-linear relationships and may be adversely affected by non-normal distribution and collinearity of data. To overcome this hurdle, machine learning models have been extensively adapted in drug dose prediction. In this tutorial, random forest and neural network models will be trained in tandem with a multiple linear regression model on the International Warfarin Pharmacogenetics Consortium dataset using the scikit-learn python library. Subsequent model analyses including performance comparison, permutation feature importance computation and partial dependence plotting will be demonstrated. The basic methods of model training and analysis discussed in this article may be implemented in drug dose-related studies.","PeriodicalId":23288,"journal":{"name":"Translational and Clinical Pharmacology","volume":"30 4","pages":"172-181"},"PeriodicalIF":1.1000,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/67/ea/tcp-30-172.PMC9810489.pdf","citationCount":"4","resultStr":"{\"title\":\"Building and analyzing machine learning-based warfarin dose prediction models using scikit-learn.\",\"authors\":\"Sangzin Ahn\",\"doi\":\"10.12793/tcp.2022.30.e22\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"For personalized drug dosing, prediction models may be utilized to overcome the inter-individual variability. Multiple linear regression has been used as a conventional method to model the relationship between patient features and optimal drug dose. However, linear regression cannot capture non-linear relationships and may be adversely affected by non-normal distribution and collinearity of data. To overcome this hurdle, machine learning models have been extensively adapted in drug dose prediction. In this tutorial, random forest and neural network models will be trained in tandem with a multiple linear regression model on the International Warfarin Pharmacogenetics Consortium dataset using the scikit-learn python library. Subsequent model analyses including performance comparison, permutation feature importance computation and partial dependence plotting will be demonstrated. The basic methods of model training and analysis discussed in this article may be implemented in drug dose-related studies.\",\"PeriodicalId\":23288,\"journal\":{\"name\":\"Translational and Clinical Pharmacology\",\"volume\":\"30 4\",\"pages\":\"172-181\"},\"PeriodicalIF\":1.1000,\"publicationDate\":\"2022-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/67/ea/tcp-30-172.PMC9810489.pdf\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Translational and Clinical Pharmacology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.12793/tcp.2022.30.e22\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"PHARMACOLOGY & PHARMACY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Translational and Clinical Pharmacology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.12793/tcp.2022.30.e22","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"PHARMACOLOGY & PHARMACY","Score":null,"Total":0}

引用次数: 4

摘要

对于个体化给药，预测模型可以用来克服个体间的可变性。多元线性回归是建立患者特征与最佳药物剂量关系模型的传统方法。然而，线性回归不能捕捉非线性关系，并且可能受到数据的非正态分布和共线性的不利影响。为了克服这一障碍，机器学习模型已广泛应用于药物剂量预测。在本教程中，随机森林和神经网络模型将与使用scikit-learn python库在国际华法林药物遗传学联盟数据集上的多元线性回归模型一起训练。随后的模型分析包括性能比较，排列特征重要性计算和部分依赖绘图将被演示。本文讨论的模型训练和分析的基本方法可应用于药物剂量相关研究。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

摘要图片

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Building and analyzing machine learning-based warfarin dose prediction models using scikit-learn.

For personalized drug dosing, prediction models may be utilized to overcome the inter-individual variability. Multiple linear regression has been used as a conventional method to model the relationship between patient features and optimal drug dose. However, linear regression cannot capture non-linear relationships and may be adversely affected by non-normal distribution and collinearity of data. To overcome this hurdle, machine learning models have been extensively adapted in drug dose prediction. In this tutorial, random forest and neural network models will be trained in tandem with a multiple linear regression model on the International Warfarin Pharmacogenetics Consortium dataset using the scikit-learn python library. Subsequent model analyses including performance comparison, permutation feature importance computation and partial dependence plotting will be demonstrated. The basic methods of model training and analysis discussed in this article may be implemented in drug dose-related studies.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Translational and Clinical Pharmacology Medicine-Pharmacology (medical)

CiteScore

1.60

自引率

11.10%

发文量

期刊介绍： Translational and Clinical Pharmacology (Transl Clin Pharmacol, TCP) is the official journal of the Korean Society for Clinical Pharmacology and Therapeutics (KSCPT). TCP is an interdisciplinary journal devoted to the dissemination of knowledge relating to all aspects of translational and clinical pharmacology. The categories for publication include pharmacokinetics (PK) and drug disposition, drug metabolism, pharmacodynamics (PD), clinical trials and design issues, pharmacogenomics and pharmacogenetics, pharmacometrics, pharmacoepidemiology, pharmacovigilence, and human pharmacology. Studies involving animal models, pharmacological characterization, and clinical trials are appropriate for consideration.