{"title":"使用scikit-learn建立和分析基于机器学习的华法林剂量预测模型。","authors":"Sangzin Ahn","doi":"10.12793/tcp.2022.30.e22","DOIUrl":null,"url":null,"abstract":"<p><p>For personalized drug dosing, prediction models may be utilized to overcome the inter-individual variability. Multiple linear regression has been used as a conventional method to model the relationship between patient features and optimal drug dose. However, linear regression cannot capture non-linear relationships and may be adversely affected by non-normal distribution and collinearity of data. To overcome this hurdle, machine learning models have been extensively adapted in drug dose prediction. In this tutorial, random forest and neural network models will be trained in tandem with a multiple linear regression model on the International Warfarin Pharmacogenetics Consortium dataset using the scikit-learn python library. Subsequent model analyses including performance comparison, permutation feature importance computation and partial dependence plotting will be demonstrated. The basic methods of model training and analysis discussed in this article may be implemented in drug dose-related studies.</p>","PeriodicalId":23288,"journal":{"name":"Translational and Clinical Pharmacology","volume":"30 4","pages":"172-181"},"PeriodicalIF":1.1000,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/67/ea/tcp-30-172.PMC9810489.pdf","citationCount":"4","resultStr":"{\"title\":\"Building and analyzing machine learning-based warfarin dose prediction models using scikit-learn.\",\"authors\":\"Sangzin Ahn\",\"doi\":\"10.12793/tcp.2022.30.e22\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>For personalized drug dosing, prediction models may be utilized to overcome the inter-individual variability. Multiple linear regression has been used as a conventional method to model the relationship between patient features and optimal drug dose. However, linear regression cannot capture non-linear relationships and may be adversely affected by non-normal distribution and collinearity of data. To overcome this hurdle, machine learning models have been extensively adapted in drug dose prediction. In this tutorial, random forest and neural network models will be trained in tandem with a multiple linear regression model on the International Warfarin Pharmacogenetics Consortium dataset using the scikit-learn python library. Subsequent model analyses including performance comparison, permutation feature importance computation and partial dependence plotting will be demonstrated. The basic methods of model training and analysis discussed in this article may be implemented in drug dose-related studies.</p>\",\"PeriodicalId\":23288,\"journal\":{\"name\":\"Translational and Clinical Pharmacology\",\"volume\":\"30 4\",\"pages\":\"172-181\"},\"PeriodicalIF\":1.1000,\"publicationDate\":\"2022-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/67/ea/tcp-30-172.PMC9810489.pdf\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Translational and Clinical Pharmacology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.12793/tcp.2022.30.e22\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"PHARMACOLOGY & PHARMACY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Translational and Clinical Pharmacology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.12793/tcp.2022.30.e22","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"PHARMACOLOGY & PHARMACY","Score":null,"Total":0}
Building and analyzing machine learning-based warfarin dose prediction models using scikit-learn.
For personalized drug dosing, prediction models may be utilized to overcome the inter-individual variability. Multiple linear regression has been used as a conventional method to model the relationship between patient features and optimal drug dose. However, linear regression cannot capture non-linear relationships and may be adversely affected by non-normal distribution and collinearity of data. To overcome this hurdle, machine learning models have been extensively adapted in drug dose prediction. In this tutorial, random forest and neural network models will be trained in tandem with a multiple linear regression model on the International Warfarin Pharmacogenetics Consortium dataset using the scikit-learn python library. Subsequent model analyses including performance comparison, permutation feature importance computation and partial dependence plotting will be demonstrated. The basic methods of model training and analysis discussed in this article may be implemented in drug dose-related studies.
期刊介绍:
Translational and Clinical Pharmacology (Transl Clin Pharmacol, TCP) is the official journal of the Korean Society for Clinical Pharmacology and Therapeutics (KSCPT). TCP is an interdisciplinary journal devoted to the dissemination of knowledge relating to all aspects of translational and clinical pharmacology. The categories for publication include pharmacokinetics (PK) and drug disposition, drug metabolism, pharmacodynamics (PD), clinical trials and design issues, pharmacogenomics and pharmacogenetics, pharmacometrics, pharmacoepidemiology, pharmacovigilence, and human pharmacology. Studies involving animal models, pharmacological characterization, and clinical trials are appropriate for consideration.