{"title":"Combination of plasma-based lipidomics and machine learning provides a useful diagnostic tool for ovarian cancer.","authors":"Jinhua Rong, Guojun Sun, Jing Zhu, Yiming Zhu, Zhongjian Chen","doi":"10.1016/j.jpba.2024.116559","DOIUrl":null,"url":null,"abstract":"<p><p>Ovarian cancer (OC), the second leading cause of death among gynecological cancers, is often diagnosed at an advanced stage due to its asymptomatic nature at early stages. This study aimed to explore the diagnostic potential of plasma-based lipidomics combined with machine learning (ML) in OC. Non-targeted lipidomics analysis was conducted on plasma samples from participants with epithelial ovarian cancer (EOC), benign ovarian tumor (BOT), and healthy control (HC). The samples were randomly divided into a train set and a test set. Differential lipids between groups were selected using two-tailed Student's t-test and partial least squares discriminant analysis (PLS-DA). Both single lipid-based receiver operating characteristic (ROC) model, and multiple lipid-based ML model, were constructed to investigate the diagnostic value of the differential lipids. The results showed several lipids with significant diagnostic potential. ST 27:2;O achieved the highest prediction accuracy of 0.92 in distinguishing EOC from HC. DG 42:2 had the highest prediction accuracy of 0.96 in diagnosing BOT from HC. Cer d18:1/18:0 had the highest prediction accuracy of 0.65 in differentiating EOC from BOT. Furthermore, multiple lipid-based ML models illustrated better diagnostic performance. K-nearest neighbors (k-NN), partial least squares (PLS), and random forest (RF) models achieved the highest prediction accuracy of 0.96 in discriminating EOC from HC. The support vector machine (SVM) model reached the highest prediction accuracy both in distinguishing BOT from HC, and in differentiating EOC from BOT, with accuracies of 1.00 and 0.74, respectively. In conclusion, this study revealed that the combination of plasma-based lipidomics and ML algorithms is an effective method for diagnosing OC.</p>","PeriodicalId":16685,"journal":{"name":"Journal of pharmaceutical and biomedical analysis","volume":null,"pages":null},"PeriodicalIF":3.1000,"publicationDate":"2024-11-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of pharmaceutical and biomedical analysis","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1016/j.jpba.2024.116559","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, ANALYTICAL","Score":null,"Total":0}
引用次数: 0
Abstract
Ovarian cancer (OC), the second leading cause of death among gynecological cancers, is often diagnosed at an advanced stage due to its asymptomatic nature at early stages. This study aimed to explore the diagnostic potential of plasma-based lipidomics combined with machine learning (ML) in OC. Non-targeted lipidomics analysis was conducted on plasma samples from participants with epithelial ovarian cancer (EOC), benign ovarian tumor (BOT), and healthy control (HC). The samples were randomly divided into a train set and a test set. Differential lipids between groups were selected using two-tailed Student's t-test and partial least squares discriminant analysis (PLS-DA). Both single lipid-based receiver operating characteristic (ROC) model, and multiple lipid-based ML model, were constructed to investigate the diagnostic value of the differential lipids. The results showed several lipids with significant diagnostic potential. ST 27:2;O achieved the highest prediction accuracy of 0.92 in distinguishing EOC from HC. DG 42:2 had the highest prediction accuracy of 0.96 in diagnosing BOT from HC. Cer d18:1/18:0 had the highest prediction accuracy of 0.65 in differentiating EOC from BOT. Furthermore, multiple lipid-based ML models illustrated better diagnostic performance. K-nearest neighbors (k-NN), partial least squares (PLS), and random forest (RF) models achieved the highest prediction accuracy of 0.96 in discriminating EOC from HC. The support vector machine (SVM) model reached the highest prediction accuracy both in distinguishing BOT from HC, and in differentiating EOC from BOT, with accuracies of 1.00 and 0.74, respectively. In conclusion, this study revealed that the combination of plasma-based lipidomics and ML algorithms is an effective method for diagnosing OC.
期刊介绍:
This journal is an international medium directed towards the needs of academic, clinical, government and industrial analysis by publishing original research reports and critical reviews on pharmaceutical and biomedical analysis. It covers the interdisciplinary aspects of analysis in the pharmaceutical, biomedical and clinical sciences, including developments in analytical methodology, instrumentation, computation and interpretation. Submissions on novel applications focusing on drug purity and stability studies, pharmacokinetics, therapeutic monitoring, metabolic profiling; drug-related aspects of analytical biochemistry and forensic toxicology; quality assurance in the pharmaceutical industry are also welcome.
Studies from areas of well established and poorly selective methods, such as UV-VIS spectrophotometry (including derivative and multi-wavelength measurements), basic electroanalytical (potentiometric, polarographic and voltammetric) methods, fluorimetry, flow-injection analysis, etc. are accepted for publication in exceptional cases only, if a unique and substantial advantage over presently known systems is demonstrated. The same applies to the assay of simple drug formulations by any kind of methods and the determination of drugs in biological samples based merely on spiked samples. Drug purity/stability studies should contain information on the structure elucidation of the impurities/degradants.