Sandra Klawitter, Johannes Böhm, Alexander Tolios, Julian E. Gebauer
{"title":"Automated sex and age partitioning for the estimation of reference intervals using a regression tree model","authors":"Sandra Klawitter, Johannes Böhm, Alexander Tolios, Julian E. Gebauer","doi":"10.1515/labmed-2024-0083","DOIUrl":null,"url":null,"abstract":"Objectives Reference intervals (RI) play a decisive role in the interpretation of medical laboratory results. An important step in the determination of RI is age- and sex specific partitioning, which is usually based on an empirical approach by graphical representation. In this study, we evaluate an automated machine learning approach. Methods This study uses pediatric data from the CALIPER RI (Canadian laboratory initiative on pediatric reference intervals) study. The calculation of potential partitions is carried out using a regression tree model included in the <jats:monospace>rpart</jats:monospace> package of the statistical programming language R. The Harris & Boyd method is used to compare the corresponding partitions suggested by <jats:monospace>rpart</jats:monospace> and CALIPER. For better comparability, the reference ranges of the partitions of both approaches are then calculated using <jats:monospace>reflimR</jats:monospace>. Results Most of the partitions suggested by <jats:monospace>rpart</jats:monospace> or CALIPER show sufficient heterogeneity among themselves to justify age- and/or sex-specific RI partitioning. With only few individual exceptions, both methods yield comparable results. The partitions of both approaches for albumin and <jats:italic>γ</jats:italic>-glutamyltransferase are very similar to each other. For creatinine <jats:monospace>rpart</jats:monospace> suggests a slightly earlier distinction between the sexes. Alkaline phosphatase shows the most pronounced differences. In addition to a considerable earlier sex split, <jats:monospace>rpart</jats:monospace> suggests different age intervals for both sexes, resulting in three partitions for females and four partitions for males. Conclusions Our findings indicate that the automated analysis provided by <jats:monospace>rpart</jats:monospace> yields results that comparable to traditional methods. Nevertheless, the medical plausibility of the automatic suggestions needs to be validated by human experts.","PeriodicalId":55986,"journal":{"name":"Journal of Laboratory Medicine","volume":"1 1","pages":""},"PeriodicalIF":1.1000,"publicationDate":"2024-08-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Laboratory Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1515/labmed-2024-0083","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"MEDICAL LABORATORY TECHNOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Objectives Reference intervals (RI) play a decisive role in the interpretation of medical laboratory results. An important step in the determination of RI is age- and sex specific partitioning, which is usually based on an empirical approach by graphical representation. In this study, we evaluate an automated machine learning approach. Methods This study uses pediatric data from the CALIPER RI (Canadian laboratory initiative on pediatric reference intervals) study. The calculation of potential partitions is carried out using a regression tree model included in the rpart package of the statistical programming language R. The Harris & Boyd method is used to compare the corresponding partitions suggested by rpart and CALIPER. For better comparability, the reference ranges of the partitions of both approaches are then calculated using reflimR. Results Most of the partitions suggested by rpart or CALIPER show sufficient heterogeneity among themselves to justify age- and/or sex-specific RI partitioning. With only few individual exceptions, both methods yield comparable results. The partitions of both approaches for albumin and γ-glutamyltransferase are very similar to each other. For creatinine rpart suggests a slightly earlier distinction between the sexes. Alkaline phosphatase shows the most pronounced differences. In addition to a considerable earlier sex split, rpart suggests different age intervals for both sexes, resulting in three partitions for females and four partitions for males. Conclusions Our findings indicate that the automated analysis provided by rpart yields results that comparable to traditional methods. Nevertheless, the medical plausibility of the automatic suggestions needs to be validated by human experts.
期刊介绍:
The Journal of Laboratory Medicine (JLM) is a bi-monthly published journal that reports on the latest developments in laboratory medicine. Particular focus is placed on the diagnostic aspects of the clinical laboratory, although technical, regulatory, and educational topics are equally covered. The Journal specializes in the publication of high-standard, competent and timely review articles on clinical, methodological and pathogenic aspects of modern laboratory diagnostics. These reviews are critically reviewed by expert reviewers and JLM’s Associate Editors who are specialists in the various subdisciplines of laboratory medicine. In addition, JLM publishes original research articles, case reports, point/counterpoint articles and letters to the editor, all of which are peer reviewed by at least two experts in the field.