使用机器学习的尿液代谢组分类及其在诊断间质性膀胱炎中的潜在应用。

Bladder (San Francisco, Calif.) Pub Date : 2020-06-02 eCollection Date: 2020-01-01 DOI:10.14440/bladder.2020.815

Feng Tong, Muhammad Shahid, Peng Jin, Sungyong Jung, Won Hwa Kim, Jayoung Kim

{"title":"使用机器学习的尿液代谢组分类及其在诊断间质性膀胱炎中的潜在应用。","authors":"Feng Tong, Muhammad Shahid, Peng Jin, Sungyong Jung, Won Hwa Kim, Jayoung Kim","doi":"10.14440/bladder.2020.815","DOIUrl":null,"url":null,"abstract":"With the advent of artificial intelligence (AI) in biostatistical analysis and modeling, machine learning can potentially be applied into developing diagnostic models for interstitial cystitis (IC). In the current clinical setting, urologists are dependent on cystoscopy and questionnaire-based decisions to diagnose IC. This is a result of a lack of objective diagnostic molecular biomarkers. The purpose of this study was to develop a machine learning-based method for diagnosing IC and assess its performance using metabolomics profiles obtained from a prior study. To develop the machine learning algorithm, two classification methods, support vector machine (SVM) and logistic regression (LR), set at various parameters, were applied to 43 IC patients and 16 healthy controls. There were 3 measures used in this study, accuracy, precision (positive predictive value), and recall (sensitivity). Individual precision and recall (PR) curves were drafted. Since the sample size was relatively small, complicated deep learning could not be done. We achieved a 76%-86% accuracy with leave-one-out cross validation depending on the method and parameters set. The highest accuracy achieved was 86.4% using SVM with a polynomial kernel degree set to 5, but a larger area under the curve (AUC) from the PR curve was achieved using LR with a l 1-norm regularizer. The AUC was greater than 0.9 in its ability to discriminate IC patients from controls, suggesting that the algorithm works well in identifying IC, even when there is a class distribution imbalance between the IC and control samples. This finding provides further insight into utilizing previously identified urinary metabolic biomarkers in developing machine learning algorithms that can be applied in the clinical setting.","PeriodicalId":72421,"journal":{"name":"Bladder (San Francisco, Calif.)","volume":"7 2","pages":"e43"},"PeriodicalIF":0.0000,"publicationDate":"2020-06-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/80/c3/bladder-7-2-e43.PMC7401992.pdf","citationCount":"7","resultStr":"{\"title\":\"Classification of the urinary metabolome using machine learning and potential applications to diagnosing interstitial cystitis.\",\"authors\":\"Feng Tong, Muhammad Shahid, Peng Jin, Sungyong Jung, Won Hwa Kim, Jayoung Kim\",\"doi\":\"10.14440/bladder.2020.815\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the advent of artificial intelligence (AI) in biostatistical analysis and modeling, machine learning can potentially be applied into developing diagnostic models for interstitial cystitis (IC). In the current clinical setting, urologists are dependent on cystoscopy and questionnaire-based decisions to diagnose IC. This is a result of a lack of objective diagnostic molecular biomarkers. The purpose of this study was to develop a machine learning-based method for diagnosing IC and assess its performance using metabolomics profiles obtained from a prior study. To develop the machine learning algorithm, two classification methods, support vector machine (SVM) and logistic regression (LR), set at various parameters, were applied to 43 IC patients and 16 healthy controls. There were 3 measures used in this study, accuracy, precision (positive predictive value), and recall (sensitivity). Individual precision and recall (PR) curves were drafted. Since the sample size was relatively small, complicated deep learning could not be done. We achieved a 76%-86% accuracy with leave-one-out cross validation depending on the method and parameters set. The highest accuracy achieved was 86.4% using SVM with a polynomial kernel degree set to 5, but a larger area under the curve (AUC) from the PR curve was achieved using LR with a l 1-norm regularizer. The AUC was greater than 0.9 in its ability to discriminate IC patients from controls, suggesting that the algorithm works well in identifying IC, even when there is a class distribution imbalance between the IC and control samples. This finding provides further insight into utilizing previously identified urinary metabolic biomarkers in developing machine learning algorithms that can be applied in the clinical setting.\",\"PeriodicalId\":72421,\"journal\":{\"name\":\"Bladder (San Francisco, Calif.)\",\"volume\":\"7 2\",\"pages\":\"e43\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-06-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/80/c3/bladder-7-2-e43.PMC7401992.pdf\",\"citationCount\":\"7\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Bladder (San Francisco, Calif.)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.14440/bladder.2020.815\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2020/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bladder (San Francisco, Calif.)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.14440/bladder.2020.815","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2020/1/1 0:00:00","PubModel":"eCollection","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 7

摘要

随着人工智能(AI)在生物统计分析和建模方面的出现，机器学习可以潜在地应用于开发间质性膀胱炎(IC)的诊断模型。在目前的临床环境中，泌尿科医生依赖膀胱镜检查和基于问卷的决定来诊断IC。这是缺乏客观诊断的分子生物标志物的结果。本研究的目的是开发一种基于机器学习的方法来诊断IC，并使用从先前研究中获得的代谢组学资料评估其性能。为了开发机器学习算法，采用支持向量机(SVM)和逻辑回归(LR)两种分类方法，设置不同的参数，对43例IC患者和16例健康对照进行分类。本研究使用了3种测量方法，准确度、精密度(阳性预测值)和召回率(敏感性)。绘制了个体精密度和召回率曲线。由于样本量相对较小，无法进行复杂的深度学习。根据方法和参数设置，我们通过留一交叉验证实现了76%-86%的准确率。使用多项式核度设置为5的SVM获得的最高准确率为86.4%，但使用带有11范数正则化器的LR获得的PR曲线下面积(AUC)更大。在区分IC患者和对照组的能力方面，AUC大于0.9，这表明该算法在识别IC方面效果很好，即使IC和对照样本之间存在类别分布不平衡。这一发现为利用先前确定的尿液代谢生物标志物开发可应用于临床环境的机器学习算法提供了进一步的见解。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

摘要图片

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Classification of the urinary metabolome using machine learning and potential applications to diagnosing interstitial cystitis.

With the advent of artificial intelligence (AI) in biostatistical analysis and modeling, machine learning can potentially be applied into developing diagnostic models for interstitial cystitis (IC). In the current clinical setting, urologists are dependent on cystoscopy and questionnaire-based decisions to diagnose IC. This is a result of a lack of objective diagnostic molecular biomarkers. The purpose of this study was to develop a machine learning-based method for diagnosing IC and assess its performance using metabolomics profiles obtained from a prior study. To develop the machine learning algorithm, two classification methods, support vector machine (SVM) and logistic regression (LR), set at various parameters, were applied to 43 IC patients and 16 healthy controls. There were 3 measures used in this study, accuracy, precision (positive predictive value), and recall (sensitivity). Individual precision and recall (PR) curves were drafted. Since the sample size was relatively small, complicated deep learning could not be done. We achieved a 76%-86% accuracy with leave-one-out cross validation depending on the method and parameters set. The highest accuracy achieved was 86.4% using SVM with a polynomial kernel degree set to 5, but a larger area under the curve (AUC) from the PR curve was achieved using LR with a l ₁-norm regularizer. The AUC was greater than 0.9 in its ability to discriminate IC patients from controls, suggesting that the algorithm works well in identifying IC, even when there is a class distribution imbalance between the IC and control samples. This finding provides further insight into utilizing previously identified urinary metabolic biomarkers in developing machine learning algorithms that can be applied in the clinical setting.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助