Development and validation of machine-learning models of diet management for hyperphenylalaninemia: a multicenter retrospective study

IF 7 1区 医学 Q1 MEDICINE, GENERAL & INTERNAL BMC Medicine Pub Date : 2024-09-11 DOI:10.1186/s12916-024-03602-w
Yajie Su, Yaqiong Wang, Jinfeng He, Huijun Wang, Xian A, Haili Jiang, Wei Lu, Wenhao Zhou, Long Li
{"title":"Development and validation of machine-learning models of diet management for hyperphenylalaninemia: a multicenter retrospective study","authors":"Yajie Su, Yaqiong Wang, Jinfeng He, Huijun Wang, Xian A, Haili Jiang, Wei Lu, Wenhao Zhou, Long Li","doi":"10.1186/s12916-024-03602-w","DOIUrl":null,"url":null,"abstract":"Assessing dietary phenylalanine (Phe) tolerance is crucial for managing hyperphenylalaninemia (HPA) in children. However, traditionally, adjusting the diet requires significant time from clinicians and parents. This study aims to investigate the development of a machine-learning model that predicts a range of dietary Phe intake tolerance for children with HPA over 10 years following diagnosis. In this multicenter retrospective observational study, we collected the genotypes of phenylalanine hydroxylase (PAH), metabolic profiles at screening and diagnosis, and blood Phe concentrations corresponding to dietary Phe intake from over 10 years of follow-up data for 204 children with HPA. To incorporate genetic information, allelic phenotype value (APV) was input for 2965 missense variants in the PAH gene using a predicted APV (pAPV) model. This model was trained on known pheno-genotype relationships from the BioPKU database, utilizing 31 features. Subsequently, a multiclass classification model was constructed and trained on a dataset featuring metabolic data, genetic data, and follow-up data from 3177 events. The final model was fine-tuned using tenfold validation and validated against three independent datasets. The pAPV model achieved a good predictive performance with root mean squared error (RMSE) of 1.53 and 2.38 on the training and test datasets, respectively. The variants that cause amino acid changes in the region of 200–300 of PAH tend to exhibit lower pAPV. The final model achieved a sensitivity range of 0.77 to 0.91 and a specificity range of 0.8 to 1 across all validation datasets. Additional assessment metrics including positive predictive value (0.68–1), negative predictive values (0.8–0.98), F1 score (0.71–0.92), and balanced accuracy (0.8–0.92) demonstrated the robust performance of our model. Our model integrates metabolic and genetic information to accurately predict age-specific Phe tolerance, aiding in the precision management of patients with HPA. This study provides a potential framework that could be applied to other inborn errors of metabolism.","PeriodicalId":9188,"journal":{"name":"BMC Medicine","volume":null,"pages":null},"PeriodicalIF":7.0000,"publicationDate":"2024-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12916-024-03602-w","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MEDICINE, GENERAL & INTERNAL","Score":null,"Total":0}
引用次数: 0

Abstract

Assessing dietary phenylalanine (Phe) tolerance is crucial for managing hyperphenylalaninemia (HPA) in children. However, traditionally, adjusting the diet requires significant time from clinicians and parents. This study aims to investigate the development of a machine-learning model that predicts a range of dietary Phe intake tolerance for children with HPA over 10 years following diagnosis. In this multicenter retrospective observational study, we collected the genotypes of phenylalanine hydroxylase (PAH), metabolic profiles at screening and diagnosis, and blood Phe concentrations corresponding to dietary Phe intake from over 10 years of follow-up data for 204 children with HPA. To incorporate genetic information, allelic phenotype value (APV) was input for 2965 missense variants in the PAH gene using a predicted APV (pAPV) model. This model was trained on known pheno-genotype relationships from the BioPKU database, utilizing 31 features. Subsequently, a multiclass classification model was constructed and trained on a dataset featuring metabolic data, genetic data, and follow-up data from 3177 events. The final model was fine-tuned using tenfold validation and validated against three independent datasets. The pAPV model achieved a good predictive performance with root mean squared error (RMSE) of 1.53 and 2.38 on the training and test datasets, respectively. The variants that cause amino acid changes in the region of 200–300 of PAH tend to exhibit lower pAPV. The final model achieved a sensitivity range of 0.77 to 0.91 and a specificity range of 0.8 to 1 across all validation datasets. Additional assessment metrics including positive predictive value (0.68–1), negative predictive values (0.8–0.98), F1 score (0.71–0.92), and balanced accuracy (0.8–0.92) demonstrated the robust performance of our model. Our model integrates metabolic and genetic information to accurately predict age-specific Phe tolerance, aiding in the precision management of patients with HPA. This study provides a potential framework that could be applied to other inborn errors of metabolism.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
高苯丙氨酸血症饮食管理机器学习模型的开发与验证:一项多中心回顾性研究
评估饮食中苯丙氨酸(Phe)的耐受性对于控制儿童高苯丙氨酸血症(HPA)至关重要。然而,传统上,调整饮食需要临床医生和家长花费大量时间。本研究旨在调查机器学习模型的开发情况,该模型可预测 HPA 患儿在确诊后 10 年内的膳食 Phe 摄入耐受性范围。在这项多中心回顾性观察研究中,我们收集了 204 名 HPA 患儿的苯丙氨酸羟化酶(PAH)基因型、筛查和诊断时的代谢特征以及与饮食 Phe 摄入量相对应的血液 Phe 浓度,这些数据来自 10 年多的随访数据。为了纳入遗传信息,使用预测等位基因表型值(pAPV)模型输入了 PAH 基因中 2965 个错义变异的等位基因表型值(APV)。该模型根据 BioPKU 数据库中已知的表型-基因型关系,利用 31 个特征进行训练。随后,构建了一个多类分类模型,并在一个包含代谢数据、基因数据和来自 3177 例事件的随访数据的数据集上进行了训练。最终模型通过十倍验证进行了微调,并通过三个独立数据集进行了验证。pAPV 模型具有良好的预测性能,在训练数据集和测试数据集上的均方根误差(RMSE)分别为 1.53 和 2.38。导致 PAH 200-300 氨基酸变化的变体往往表现出较低的 pAPV。在所有验证数据集上,最终模型的灵敏度范围为 0.77 至 0.91,特异性范围为 0.8 至 1。其他评估指标包括阳性预测值(0.68-1)、阴性预测值(0.8-0.98)、F1 评分(0.71-0.92)和平衡准确度(0.8-0.92),这些指标都证明了我们的模型具有强大的性能。我们的模型整合了代谢和遗传信息,能准确预测特定年龄的Phe耐受性,有助于对HPA患者进行精准管理。这项研究提供了一个潜在的框架,可应用于其他先天性代谢错误。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
BMC Medicine
BMC Medicine 医学-医学:内科
CiteScore
13.10
自引率
1.10%
发文量
435
审稿时长
4-8 weeks
期刊介绍: BMC Medicine is an open access, transparent peer-reviewed general medical journal. It is the flagship journal of the BMC series and publishes outstanding and influential research in various areas including clinical practice, translational medicine, medical and health advances, public health, global health, policy, and general topics of interest to the biomedical and sociomedical professional communities. In addition to research articles, the journal also publishes stimulating debates, reviews, unique forum articles, and concise tutorials. All articles published in BMC Medicine are included in various databases such as Biological Abstracts, BIOSIS, CAS, Citebase, Current contents, DOAJ, Embase, MEDLINE, PubMed, Science Citation Index Expanded, OAIster, SCImago, Scopus, SOCOLAR, and Zetoc.
期刊最新文献
Interactive effect of air pollution and genetic risk of depression on processing speed by resting-state functional connectivity of occipitoparietal network Sleep patterns, physical activity, genetic susceptibility, and incident rheumatoid arthritis: a prospective cohort study Adiposity and mortality among intensive care patients with COVID-19 and non-COVID-19 respiratory conditions: a cross-context comparison study in the UK Single-cell sequencing of the vermiform appendix during development identifies transcriptional relationships with appendicitis in preschool children Efficacy of respiratory support therapies during pulmonary rehabilitation exercise training in chronic obstructive pulmonary disease patients: a systematic review and network meta-analysis
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1