Jolie Boullion, Amanda Husein, Akshat Agrawal, Diensn Xing, Md Ismail Hossain, Md Shenuarin Bhuiyan, Oren Rom, Steven A Conrad, John A Vanchiere, A Wayne Orr, Christopher G Kevil, Mohammad Alfrad Nobel Bhuiyan
{"title":"Machine Learning-Based Biomarker Identification for Early Diagnosis of Metabolic Dysfunction-Associated Steatotic Liver Disease.","authors":"Jolie Boullion, Amanda Husein, Akshat Agrawal, Diensn Xing, Md Ismail Hossain, Md Shenuarin Bhuiyan, Oren Rom, Steven A Conrad, John A Vanchiere, A Wayne Orr, Christopher G Kevil, Mohammad Alfrad Nobel Bhuiyan","doi":"10.1210/clinem/dgaf111","DOIUrl":null,"url":null,"abstract":"<p><strong>Aim: </strong>Metabolic dysfunction-associated steatotic liver disease (MASLD) is an umbrella term for simple hepatic steatosis and the more severe metabolic dysfunction-associated steatohepatitis. The current reliance on liver biopsy for diagnosis and a lack of validated biomarkers are major factors contributing to the overall burden of MASLD. This study investigates the association between biomarkers and hepatic steatosis and stiffness measurements, measured by FibroScan®.</p><p><strong>Methods: </strong>Data from the National Health and Nutritional Examination Survey (2017-2020) was collected for 15,560 patients. Propensity score matching balanced the data with a 1:1 case-to-control for age and sex allowing for preliminary trend assessment. Random Forest machine learning determined variable importance for the incorporation of key biomarkers (age, sex, race, BMI, HbA1C, PFG, insulin, total cholesterol, LDL-cholesterol, HDL-cholesterol, triglycerides, ALT, AST, ALP, albumin, GGT, LDH, iron, total bilirubin, total protein, uric acid, BUN, and hs-CRP) into logistic regression models predicting steatosis (MASLD indicated by a controlled attenuation parameter™ score of >238 dB/m) and stiffness (hepatic fibrosis indicated by a median liver stiffness >7 kPa). Sensitivity analysis using XGBoost and Recursive Feature Elimination was performed.</p><p><strong>Results: </strong>The Random Forest models (the most accurate) predicted MASLD with 79.59% accuracy (p<0.001) and specificity of 84.65% and, hepatic fibrosis with 86.07% accuracy (p<0.001) and sensitivity of 98.01%. Both the steatosis and stiffness models identified statistically significant biomarkers, with age, BMI, and insulin appearing significant to both.</p><p><strong>Conclusion: </strong>These findings indicate that assessing a variety of biomarkers, across demographic, metabolic, lipid, and standard biochemistry categories, may provide valuable initial insights for diagnosing patients for MASLD and hepatic fibrosis.</p>","PeriodicalId":50238,"journal":{"name":"Journal of Clinical Endocrinology & Metabolism","volume":" ","pages":""},"PeriodicalIF":5.0000,"publicationDate":"2025-02-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Clinical Endocrinology & Metabolism","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1210/clinem/dgaf111","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENDOCRINOLOGY & METABOLISM","Score":null,"Total":0}
引用次数: 0
Abstract
Aim: Metabolic dysfunction-associated steatotic liver disease (MASLD) is an umbrella term for simple hepatic steatosis and the more severe metabolic dysfunction-associated steatohepatitis. The current reliance on liver biopsy for diagnosis and a lack of validated biomarkers are major factors contributing to the overall burden of MASLD. This study investigates the association between biomarkers and hepatic steatosis and stiffness measurements, measured by FibroScan®.
Methods: Data from the National Health and Nutritional Examination Survey (2017-2020) was collected for 15,560 patients. Propensity score matching balanced the data with a 1:1 case-to-control for age and sex allowing for preliminary trend assessment. Random Forest machine learning determined variable importance for the incorporation of key biomarkers (age, sex, race, BMI, HbA1C, PFG, insulin, total cholesterol, LDL-cholesterol, HDL-cholesterol, triglycerides, ALT, AST, ALP, albumin, GGT, LDH, iron, total bilirubin, total protein, uric acid, BUN, and hs-CRP) into logistic regression models predicting steatosis (MASLD indicated by a controlled attenuation parameter™ score of >238 dB/m) and stiffness (hepatic fibrosis indicated by a median liver stiffness >7 kPa). Sensitivity analysis using XGBoost and Recursive Feature Elimination was performed.
Results: The Random Forest models (the most accurate) predicted MASLD with 79.59% accuracy (p<0.001) and specificity of 84.65% and, hepatic fibrosis with 86.07% accuracy (p<0.001) and sensitivity of 98.01%. Both the steatosis and stiffness models identified statistically significant biomarkers, with age, BMI, and insulin appearing significant to both.
Conclusion: These findings indicate that assessing a variety of biomarkers, across demographic, metabolic, lipid, and standard biochemistry categories, may provide valuable initial insights for diagnosing patients for MASLD and hepatic fibrosis.
期刊介绍:
The Journal of Clinical Endocrinology & Metabolism is the world"s leading peer-reviewed journal for endocrine clinical research and cutting edge clinical practice reviews. Each issue provides the latest in-depth coverage of new developments enhancing our understanding, diagnosis and treatment of endocrine and metabolic disorders. Regular features of special interest to endocrine consultants include clinical trials, clinical reviews, clinical practice guidelines, case seminars, and controversies in clinical endocrinology, as well as original reports of the most important advances in patient-oriented endocrine and metabolic research. According to the latest Thomson Reuters Journal Citation Report, JCE&M articles were cited 64,185 times in 2008.