A stacked machine learning-based classification model for endometriosis and adenomyosis: a retrospective cohort study utilizing peripheral blood and coagulation markers.
{"title":"A stacked machine learning-based classification model for endometriosis and adenomyosis: a retrospective cohort study utilizing peripheral blood and coagulation markers.","authors":"Weiying Wang, Weiwei Zeng, Sen Yang","doi":"10.3389/fdgth.2024.1463419","DOIUrl":null,"url":null,"abstract":"<p><strong>Introduction: </strong>Endometriosis (EMs) and adenomyosis (AD) are common gynecological diseases that impact women's health, and they share symptoms such as dysmenorrhea, chronic pain, and infertility, which adversely affect women's quality of life. Current diagnostic approaches for EMs and AD involve invasive surgical procedures, and thus, methods of noninvasive differentiation between EMs and AD are needed. This retrospective cohort study introduces a novel, noninvasive classification methodology employing a stacked ensemble machine learning (ML) model that utilizes peripheral blood and coagulation markers to distinguish between EMs and AD.</p><p><strong>Methods: </strong>The study included a total of 558 patients (329 with EMs and 229 with AD), in whom key hematological and coagulation markers were analyzed to identify distinctive profiles. Feature selection was conducted through ML (logistic regression, support vector machine, and K-nearest neighbors) to determine significant hematological markers.</p><p><strong>Results: </strong>Red cell distribution width, mean corpuscular hemoglobin concentration, activated partial thromboplastin time, international normalized ratio, and antithrombin III were proved to be the key distinguishing indexes for disease differentiation. Among all the ML classification models developed, the stacked ensemble model demonstrated superior performance (area under the curve = 0.803, 95% credibility interval = 0.701-0.904). Our findings demonstrate the effectiveness of the stacked ensemble ML model for classifying EMs and AD.</p><p><strong>Discussion: </strong>Integrating biomarkers into this multi-algorithm framework offers a novel approach to noninvasive diagnosis. These results advocate for the application of stacked ensemble ML utilizing cost-effective and readily available peripheral blood and coagulation indicators for the early, rapid, and noninvasive differential diagnosis of EMs and AD, offering a potentially transformative approach for clinical decision-making and personalized treatment strategies.</p>","PeriodicalId":73078,"journal":{"name":"Frontiers in digital health","volume":"6 ","pages":"1463419"},"PeriodicalIF":3.2000,"publicationDate":"2024-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11428011/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in digital health","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3389/fdgth.2024.1463419","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}
引用次数: 0
Abstract
Introduction: Endometriosis (EMs) and adenomyosis (AD) are common gynecological diseases that impact women's health, and they share symptoms such as dysmenorrhea, chronic pain, and infertility, which adversely affect women's quality of life. Current diagnostic approaches for EMs and AD involve invasive surgical procedures, and thus, methods of noninvasive differentiation between EMs and AD are needed. This retrospective cohort study introduces a novel, noninvasive classification methodology employing a stacked ensemble machine learning (ML) model that utilizes peripheral blood and coagulation markers to distinguish between EMs and AD.
Methods: The study included a total of 558 patients (329 with EMs and 229 with AD), in whom key hematological and coagulation markers were analyzed to identify distinctive profiles. Feature selection was conducted through ML (logistic regression, support vector machine, and K-nearest neighbors) to determine significant hematological markers.
Results: Red cell distribution width, mean corpuscular hemoglobin concentration, activated partial thromboplastin time, international normalized ratio, and antithrombin III were proved to be the key distinguishing indexes for disease differentiation. Among all the ML classification models developed, the stacked ensemble model demonstrated superior performance (area under the curve = 0.803, 95% credibility interval = 0.701-0.904). Our findings demonstrate the effectiveness of the stacked ensemble ML model for classifying EMs and AD.
Discussion: Integrating biomarkers into this multi-algorithm framework offers a novel approach to noninvasive diagnosis. These results advocate for the application of stacked ensemble ML utilizing cost-effective and readily available peripheral blood and coagulation indicators for the early, rapid, and noninvasive differential diagnosis of EMs and AD, offering a potentially transformative approach for clinical decision-making and personalized treatment strategies.
导言:子宫内膜异位症(EMs)和子宫腺肌症(AD)是影响妇女健康的常见妇科疾病,它们都有痛经、慢性疼痛和不孕等症状,对妇女的生活质量造成不利影响。目前对子宫内膜异位症和子宫内膜异位症的诊断方法涉及侵入性外科手术,因此需要采用非侵入性方法来区分子宫内膜异位症和子宫内膜异位症。这项回顾性队列研究介绍了一种新颖的无创分类方法,该方法采用叠加式机器学习(ML)模型,利用外周血和凝血标记物来区分EM和AD:该研究共纳入 558 名患者(329 名 EMs 患者和 229 名 AD 患者),对他们的主要血液和凝血标记物进行分析,以确定独特的特征。通过ML(逻辑回归、支持向量机和K-近邻)进行特征选择,以确定重要的血液学标志物:结果:红细胞分布宽度、平均血红蛋白浓度、活化部分凝血活酶时间、国际标准化比率和抗凝血酶 III 被证明是区分疾病的关键指标。在所有已开发的多重层析分类模型中,堆叠集合模型表现出更优越的性能(曲线下面积 = 0.803,95% 可信区间 = 0.701-0.904)。我们的研究结果证明了堆叠集合 ML 模型在 EM 和 AD 分类中的有效性:讨论:将生物标记物纳入多算法框架为无创诊断提供了一种新方法。这些结果主张利用具有成本效益且随时可用的外周血和凝血指标,将堆叠集合 ML 应用于 EM 和 AD 的早期、快速和无创鉴别诊断,为临床决策和个性化治疗策略提供了一种潜在的变革性方法。