A recursive embedding and clustering technique for unraveling asymptomatic kidney disease using laboratory data and machine learning.

IF 3.9 2区 综合性期刊 Q1 MULTIDISCIPLINARY SCIENCES Scientific Reports Pub Date : 2025-02-17 DOI:10.1038/s41598-025-89499-8
Eman Alqaissi, Abdulmohsen Algarni, Mohammed Alshehri, Husain Alkhaldy, Afnan Alshehri
{"title":"A recursive embedding and clustering technique for unraveling asymptomatic kidney disease using laboratory data and machine learning.","authors":"Eman Alqaissi, Abdulmohsen Algarni, Mohammed Alshehri, Husain Alkhaldy, Afnan Alshehri","doi":"10.1038/s41598-025-89499-8","DOIUrl":null,"url":null,"abstract":"<p><p>Traditional methods for diagnosing chronic kidney disease (CKD) via laboratory data may not be capable of identifying early kidney disease. Kidney biopsy is unsuitable for regular screening, and imaging tests are costly and time-consuming. Several studies have implemented artificial intelligence (AI) to detect CKD. However, these studies used small datasets, had overfitting problems, lacked generalizability, or used complex algorithms that may require additional computational resources. In this study, we collected and analyzed center-based data and used a recursive embedding and clustering technique to reduce their dimensionality. We identified three clusters from 1600 records. We focused on the second cluster, as most of the characteristics had values in the normal ranges. Normal range values for most indicators generally represent stable kidney function with minor signs of strain, which often remain asymptomatic. Creatinine and eGFR levels within the threshold ranges indicate early kidney stress without filtration issues, which require close monitoring. The gradient-boosting algorithm showed superior performance among all algorithms in detecting these clusters. We evaluated an additional 400 unlabeled records to validate our method. This research can help clinicians automatically detect initial signs in numerous patients via routine tests to prevent the consequences of late-stage CKD detection.</p>","PeriodicalId":21811,"journal":{"name":"Scientific Reports","volume":"15 1","pages":"5820"},"PeriodicalIF":3.9000,"publicationDate":"2025-02-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11832896/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Scientific Reports","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1038/s41598-025-89499-8","RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

Traditional methods for diagnosing chronic kidney disease (CKD) via laboratory data may not be capable of identifying early kidney disease. Kidney biopsy is unsuitable for regular screening, and imaging tests are costly and time-consuming. Several studies have implemented artificial intelligence (AI) to detect CKD. However, these studies used small datasets, had overfitting problems, lacked generalizability, or used complex algorithms that may require additional computational resources. In this study, we collected and analyzed center-based data and used a recursive embedding and clustering technique to reduce their dimensionality. We identified three clusters from 1600 records. We focused on the second cluster, as most of the characteristics had values in the normal ranges. Normal range values for most indicators generally represent stable kidney function with minor signs of strain, which often remain asymptomatic. Creatinine and eGFR levels within the threshold ranges indicate early kidney stress without filtration issues, which require close monitoring. The gradient-boosting algorithm showed superior performance among all algorithms in detecting these clusters. We evaluated an additional 400 unlabeled records to validate our method. This research can help clinicians automatically detect initial signs in numerous patients via routine tests to prevent the consequences of late-stage CKD detection.

Abstract Image

Abstract Image

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
使用实验室数据和机器学习的递归嵌入和聚类技术来揭示无症状肾脏疾病。
通过实验室数据诊断慢性肾脏疾病的传统方法可能无法识别早期肾脏疾病。肾活检不适合常规筛查,影像学检查既昂贵又耗时。一些研究已经使用人工智能(AI)来检测CKD。然而,这些研究使用了小数据集,存在过拟合问题,缺乏通用性,或者使用了可能需要额外计算资源的复杂算法。在这项研究中,我们收集和分析了基于中心的数据,并使用递归嵌入和聚类技术来降低它们的维数。我们从1600条记录中确定了三个集群。我们关注的是第二类,因为大多数特征的值都在正常范围内。大多数指标的正常范围值通常代表肾功能稳定,有轻微的紧张迹象,通常没有症状。肌酐和eGFR水平在阈值范围内表明早期肾应激没有过滤问题,这需要密切监测。梯度增强算法在检测聚类方面表现出较好的性能。我们评估了另外400条未标记的记录来验证我们的方法。这项研究可以帮助临床医生通过常规检查自动检测许多患者的初始体征,以防止晚期CKD检测的后果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Scientific Reports
Scientific Reports Natural Science Disciplines-
CiteScore
7.50
自引率
4.30%
发文量
19567
审稿时长
3.9 months
期刊介绍: We publish original research from all areas of the natural sciences, psychology, medicine and engineering. You can learn more about what we publish by browsing our specific scientific subject areas below or explore Scientific Reports by browsing all articles and collections. Scientific Reports has a 2-year impact factor: 4.380 (2021), and is the 6th most-cited journal in the world, with more than 540,000 citations in 2020 (Clarivate Analytics, 2021). •Engineering Engineering covers all aspects of engineering, technology, and applied science. It plays a crucial role in the development of technologies to address some of the world''s biggest challenges, helping to save lives and improve the way we live. •Physical sciences Physical sciences are those academic disciplines that aim to uncover the underlying laws of nature — often written in the language of mathematics. It is a collective term for areas of study including astronomy, chemistry, materials science and physics. •Earth and environmental sciences Earth and environmental sciences cover all aspects of Earth and planetary science and broadly encompass solid Earth processes, surface and atmospheric dynamics, Earth system history, climate and climate change, marine and freshwater systems, and ecology. It also considers the interactions between humans and these systems. •Biological sciences Biological sciences encompass all the divisions of natural sciences examining various aspects of vital processes. The concept includes anatomy, physiology, cell biology, biochemistry and biophysics, and covers all organisms from microorganisms, animals to plants. •Health sciences The health sciences study health, disease and healthcare. This field of study aims to develop knowledge, interventions and technology for use in healthcare to improve the treatment of patients.
期刊最新文献
Fixed-point topology meets fractal memory: a Kutumba-stabilized framework for nonlocal fractal-fractional dynamics. DIPLI: deep image prior lucky imaging for blind astronomical image restoration. Advantages of magnetic compression in rat colonic anastomosis: a comparative study with a transcriptomic perspective. Bayesian optimization for uncertainty-aware prediction of rainfall-induced deformation in embankment dams. Synergetic mechanism by decompression and bone bioactive factors to induce the spontaneous resolution of lumbar disc herniation.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1