首页 > 最新文献

Health Information Science and Systems最新文献

英文 中文
Reconstructing brain causal dynamics for subject and task fingerprints using fMRI time-series data. 利用功能磁共振成像时间序列数据重建受试者和任务指纹的大脑因果动力学。
IF 3.4 3区 医学 Q1 MEDICAL INFORMATICS Pub Date : 2025-10-28 eCollection Date: 2025-12-01 DOI: 10.1007/s13755-025-00388-w
Dachuan Song, Li Shen, Duy Duong-Tran, Xuan Wang

Purpose: Recently, there has been a revived interest in system neuroscience causation models, driven by their unique capability to unravel complex relationships in multi-scale brain networks. In this paper, we present a novel method that leverages causal dynamics to achieve effective fMRI-based subject and task fingerprinting.

Methods: By applying an implicit-explicit discretization scheme, we develop a two-timescale linear state-space model. Through data-driven identification of its parameters, the model captures causal signatures, including directed interactions among brain regions from a spatial perspective, and disentangled fast and slow dynamic modes of brain activity from a temporal perspective. These causal signatures are then integrated with: (i) a modal decomposition and projection method for model-based subject identification, and (ii) a Graph Neural Network (GNN) framework for learning-based task classification. Furthermore, we introduce the concept of the brain reachability landscape as a novel visualization tool, which quantitatively characterizes the maximum possible activation levels of brain regions under various fMRI tasks.

Results: We evaluate the proposed approach using the Human Connectome Project dataset and demonstrate its advantage over non-causality-based methods. The obtained causal signatures are visualized and demonstrate clear biological relevance with established understandings of brain function.

Conclusion: We verified the feasibility and effectiveness of utilizing brain causal signatures for subject and task fingerprinting. Additionally, our work paves the way for further studies on causal fingerprints with potential applications in both healthy controls and neurodegenerative diseases.

目的:最近,由于系统神经科学的独特能力能够揭示多尺度大脑网络中的复杂关系,人们对系统神经科学的因果关系模型重新产生了兴趣。在本文中,我们提出了一种利用因果动力学来实现有效的基于fmri的主题和任务指纹识别的新方法。方法:采用隐式-显式离散化方法,建立双时间尺度线性状态空间模型。通过数据驱动的参数识别,该模型从空间角度捕捉因果特征,包括大脑区域之间的直接相互作用,并从时间角度解开大脑活动的快慢动态模式。然后将这些因果特征集成到:(i)基于模型的主题识别的模态分解和投影方法,以及(ii)基于学习的任务分类的图神经网络(GNN)框架。此外,我们引入了大脑可达性景观的概念,作为一种新的可视化工具,它定量表征了各种功能磁共振成像任务下大脑区域的最大可能激活水平。结果:我们使用人类连接组项目数据集评估了所提出的方法,并证明了其优于非基于因果关系的方法。获得的因果特征是可视化的,并展示了与已建立的脑功能理解明确的生物学相关性。结论:验证了利用脑因果特征进行主题和任务指纹识别的可行性和有效性。此外,我们的工作为进一步研究因果指纹在健康对照和神经退行性疾病中的潜在应用铺平了道路。
{"title":"Reconstructing brain causal dynamics for subject and task fingerprints using fMRI time-series data.","authors":"Dachuan Song, Li Shen, Duy Duong-Tran, Xuan Wang","doi":"10.1007/s13755-025-00388-w","DOIUrl":"10.1007/s13755-025-00388-w","url":null,"abstract":"<p><strong>Purpose: </strong>Recently, there has been a revived interest in system neuroscience causation models, driven by their unique capability to unravel complex relationships in multi-scale brain networks. In this paper, we present a novel method that leverages causal dynamics to achieve effective fMRI-based subject and task fingerprinting.</p><p><strong>Methods: </strong>By applying an implicit-explicit discretization scheme, we develop a two-timescale linear state-space model. Through data-driven identification of its parameters, the model captures causal signatures, including directed interactions among brain regions from a spatial perspective, and disentangled fast and slow dynamic modes of brain activity from a temporal perspective. These causal signatures are then integrated with: (i) a modal decomposition and projection method for model-based subject identification, and (ii) a Graph Neural Network (GNN) framework for learning-based task classification. Furthermore, we introduce the concept of the brain reachability landscape as a novel visualization tool, which quantitatively characterizes the maximum possible activation levels of brain regions under various fMRI tasks.</p><p><strong>Results: </strong>We evaluate the proposed approach using the Human Connectome Project dataset and demonstrate its advantage over non-causality-based methods. The obtained causal signatures are visualized and demonstrate clear biological relevance with established understandings of brain function.</p><p><strong>Conclusion: </strong>We verified the feasibility and effectiveness of utilizing brain causal signatures for subject and task fingerprinting. Additionally, our work paves the way for further studies on causal fingerprints with potential applications in both healthy controls and neurodegenerative diseases.</p>","PeriodicalId":46312,"journal":{"name":"Health Information Science and Systems","volume":"13 1","pages":"70"},"PeriodicalIF":3.4,"publicationDate":"2025-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12569264/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145410337","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Evaluating topological and graph-theoretical approaches to extract complex multimodal brain connectivity patterns in multiple sclerosis. 评估在多发性硬化症中提取复杂多模态大脑连接模式的拓扑和图理论方法。
IF 3.4 3区 医学 Q1 MEDICAL INFORMATICS Pub Date : 2025-10-19 eCollection Date: 2025-12-01 DOI: 10.1007/s13755-025-00386-y
Toni Lozano-Bagén, Eloy Martinez-Heras, Giuseppe Pontillo, Elisabeth Solana, Francesc Vivó, Maria Petracca, Alberto Calvi, Sandra Garrido-Romero, Albert Solé-Ribalta, Sara Llufriu, Ferran Prados, Jordi Casas-Roma

Brain networks, or graphs, derived from magnetic resonance imaging (MRI) offer a powerful framework for representing the structural, morphological, and functional organization of the brain. Graph-theoretical metrics have been widely employed to characterize properties such as efficiency, integration, and communication within these networks. More recently, topological data analysis techniques, such as persistent homology and Betti curves, have emerged as complementary approaches for capturing higher-order network patterns. In this study, we present a comparative analysis of these feature-generation methodologies in the context of neurodegenerative disease. Specifically, we evaluate the effectiveness of Betti curves and graph-theoretical metrics in extracting features for distinguishing people with multiple sclerosis (PwMS) from healthy volunteers (HV). Features are derived from structural connectivity, morphological gray matter, and resting-state functional networks, using both single layer and multilayer graph architectures. Our experiments, conducted on a cohort of PwMS and HV, demonstrate that features extracted using Betti curves generally outperform those based on graph-theoretical metrics. Furthermore, we show that multimodal data in terms of feature concatenation and multilayer graph architectures provide a more comprehensive representation of alterations in complex brain mechanisms associated with MS, leading to improved classification performance. These findings highlight the potential of topological features and multimodal integration for enhancing the understanding and diagnosis of neurodegenerative disorders.

从磁共振成像(MRI)衍生出来的脑网络或图,为表示大脑的结构、形态和功能组织提供了一个强大的框架。图理论度量被广泛用于描述这些网络中的效率、集成和通信等属性。最近,拓扑数据分析技术,如持久同调和贝蒂曲线,已经成为捕获高阶网络模式的补充方法。在这项研究中,我们提出了在神经退行性疾病的背景下这些特征生成方法的比较分析。具体而言,我们评估了Betti曲线和图形理论指标在提取特征以区分多发性硬化症(PwMS)和健康志愿者(HV)方面的有效性。特征来源于结构连接、形态灰质和静息状态功能网络,使用单层和多层图架构。我们在PwMS和HV队列上进行的实验表明,使用Betti曲线提取的特征通常优于基于图理论指标的特征。此外,我们表明,在特征连接和多层图架构方面的多模态数据提供了与MS相关的复杂大脑机制变化的更全面的表示,从而提高了分类性能。这些发现突出了拓扑特征和多模式整合的潜力,以加强对神经退行性疾病的理解和诊断。
{"title":"Evaluating topological and graph-theoretical approaches to extract complex multimodal brain connectivity patterns in multiple sclerosis.","authors":"Toni Lozano-Bagén, Eloy Martinez-Heras, Giuseppe Pontillo, Elisabeth Solana, Francesc Vivó, Maria Petracca, Alberto Calvi, Sandra Garrido-Romero, Albert Solé-Ribalta, Sara Llufriu, Ferran Prados, Jordi Casas-Roma","doi":"10.1007/s13755-025-00386-y","DOIUrl":"10.1007/s13755-025-00386-y","url":null,"abstract":"<p><p>Brain networks, or graphs, derived from magnetic resonance imaging (MRI) offer a powerful framework for representing the structural, morphological, and functional organization of the brain. Graph-theoretical metrics have been widely employed to characterize properties such as efficiency, integration, and communication within these networks. More recently, topological data analysis techniques, such as persistent homology and Betti curves, have emerged as complementary approaches for capturing higher-order network patterns. In this study, we present a comparative analysis of these feature-generation methodologies in the context of neurodegenerative disease. Specifically, we evaluate the effectiveness of Betti curves and graph-theoretical metrics in extracting features for distinguishing people with multiple sclerosis (PwMS) from healthy volunteers (HV). Features are derived from structural connectivity, morphological gray matter, and resting-state functional networks, using both single layer and multilayer graph architectures. Our experiments, conducted on a cohort of PwMS and HV, demonstrate that features extracted using Betti curves generally outperform those based on graph-theoretical metrics. Furthermore, we show that multimodal data in terms of feature concatenation and multilayer graph architectures provide a more comprehensive representation of alterations in complex brain mechanisms associated with MS, leading to improved classification performance. These findings highlight the potential of topological features and multimodal integration for enhancing the understanding and diagnosis of neurodegenerative disorders.</p>","PeriodicalId":46312,"journal":{"name":"Health Information Science and Systems","volume":"13 1","pages":"68"},"PeriodicalIF":3.4,"publicationDate":"2025-10-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12535957/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145349210","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Timelygpt: extrapolatable transformer pre-training for long-term time-series forecasting in healthcare. 时间:用于医疗保健长期时间序列预测的可外推变压器预训练。
IF 3.4 3区 医学 Q1 MEDICAL INFORMATICS Pub Date : 2025-10-14 eCollection Date: 2025-12-01 DOI: 10.1007/s13755-025-00384-0
Ziyang Song, Qincheng Lu, Hao Xu, Ziqi Yang, He Zhu, David Buckeridge, Yue Li

Purpose: Large-scale pre-trained models (PTMs) such as BERT and GPT have recently achieved great success in Natural Language Processing and Computer Vision domains. However, the development of PTMs on healthcare time-series data is lagging behind. This underscores the limitations of the existing transformer-based architectures, particularly their scalability to handle large-scale time series and ability to capture long-term temporal dependencies.

Methods: In this study, we present Timely Generative Pre-trained Transformer (TimelyGPT). TimelyGPT employs an extrapolatable position (xPos) embedding to encode trend and periodic patterns into time-series representations. It also integrates recurrent attention and temporal convolution modules to effectively capture global-local temporal dependencies.

Results: Our experiments show that TimelyGPT excels in modeling continuously monitored biosignals and irregularly-sampled time series data commonly observed in longitudinal electronic health records (EHRs). In forecasting continuous biosignals, TimelyGPT achieves accurate extrapolation up to 6000 timesteps of body temperature during the sleep stage transition given a short look-up window (i.e., prompt) containing only 2000 timesteps. For irregularly-sampled time series, TimelyGPT with a proposed time-specific inference demonstrates high top recall scores in predicting future diagnoses using early diagnostic records, effectively handling irregular intervals between clinical records. We further demonstrate that TimelyGPT achieves strong discriminative performance on both continuous and irregularly-sampled time series.

Conclusion: Together, we envision TimelyGPT to be useful in various health domains, including long-term patient health state forecasting, patient risk trajectory prediction, and disease classification. Its code is available at Github.

目的:大规模预训练模型(PTMs)如BERT和GPT最近在自然语言处理和计算机视觉领域取得了巨大的成功。然而,基于医疗时序数据的ptm的开发相对滞后。这强调了现有的基于转换器的体系结构的局限性,特别是它们处理大规模时间序列的可伸缩性和捕获长期时间依赖性的能力。方法:在本研究中,我们提出了适时生成预训练变压器(TimelyGPT)。TimelyGPT采用可外推位置(xPos)嵌入,将趋势和周期模式编码为时间序列表示。它还集成了循环关注和时间卷积模块,以有效地捕获全局-局部时间依赖性。结果:我们的实验表明,TimelyGPT在模拟连续监测的生物信号和纵向电子健康记录(EHRs)中常见的不规则采样时间序列数据方面表现出色。在预测连续的生物信号时,TimelyGPT在睡眠阶段转换期间实现了精确的体温外推,最高可达6000个时间步,给出了一个只有2000个时间步的短查找窗口(即提示)。对于不规则采样的时间序列,TimelyGPT与提出的特定时间推断在使用早期诊断记录预测未来诊断方面显示出较高的最高召回分数,有效地处理临床记录之间的不规则间隔。我们进一步证明了TimelyGPT在连续和不规则采样时间序列上都有很强的判别性能。总之,我们设想TimelyGPT在各种健康领域都是有用的,包括长期患者健康状态预测、患者风险轨迹预测和疾病分类。它的代码可以在Github上找到。
{"title":"Timelygpt: extrapolatable transformer pre-training for long-term time-series forecasting in healthcare.","authors":"Ziyang Song, Qincheng Lu, Hao Xu, Ziqi Yang, He Zhu, David Buckeridge, Yue Li","doi":"10.1007/s13755-025-00384-0","DOIUrl":"10.1007/s13755-025-00384-0","url":null,"abstract":"<p><strong>Purpose: </strong>Large-scale pre-trained models (PTMs) such as BERT and GPT have recently achieved great success in Natural Language Processing and Computer Vision domains. However, the development of PTMs on healthcare time-series data is lagging behind. This underscores the limitations of the existing transformer-based architectures, particularly their scalability to handle large-scale time series and ability to capture long-term temporal dependencies.</p><p><strong>Methods: </strong>In this study, we present Timely Generative Pre-trained Transformer (TimelyGPT). TimelyGPT employs an extrapolatable position (xPos) embedding to encode trend and periodic patterns into time-series representations. It also integrates recurrent attention and temporal convolution modules to effectively capture global-local temporal dependencies.</p><p><strong>Results: </strong>Our experiments show that TimelyGPT excels in modeling continuously monitored biosignals and irregularly-sampled time series data commonly observed in longitudinal electronic health records (EHRs). In forecasting continuous biosignals, TimelyGPT achieves accurate extrapolation up to 6000 timesteps of body temperature during the sleep stage transition given a short look-up window (i.e., prompt) containing only 2000 timesteps. For irregularly-sampled time series, TimelyGPT with a proposed time-specific inference demonstrates high top recall scores in predicting future diagnoses using early diagnostic records, effectively handling irregular intervals between clinical records. We further demonstrate that TimelyGPT achieves strong discriminative performance on both continuous and irregularly-sampled time series.</p><p><strong>Conclusion: </strong>Together, we envision TimelyGPT to be useful in various health domains, including long-term patient health state forecasting, patient risk trajectory prediction, and disease classification. Its code is available at Github.</p>","PeriodicalId":46312,"journal":{"name":"Health Information Science and Systems","volume":"13 1","pages":"64"},"PeriodicalIF":3.4,"publicationDate":"2025-10-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12521703/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145309483","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Clustering environmental pollutants associated with increased risk of metabolic disease: a hierarchical analysis. 聚类与代谢性疾病风险增加相关的环境污染物:层次分析
IF 3.4 3区 医学 Q1 MEDICAL INFORMATICS Pub Date : 2025-09-24 eCollection Date: 2025-12-01 DOI: 10.1007/s13755-025-00375-1
Brooke Scardino, Akshat Agrawal, Diensn G Xing, Jackson L St Pierre, Md Mostafizur Rahman Bhuiyan, Kanon Kamronnaher, Md Shenuarin Bhuiyan, Oren Rom, Steven A Conrad, John A Vanchiere, A Wayne Orr, Christopher G Kevil, Mohammad Alfrad Nobel Bhuiyan

Background: Metabolic syndrome (MetS), which affects one-third of the population of the United States, is a risk factor for chronic diseases such as cardiovascular diseases, stroke, and type 2 diabetes mellitus. Heavy metals (HM) and volatile organic compounds (VOC) are environmental factors typically occurring as mixtures. Although exposures to these substances have been studied separately, the impact of combined HM and VOC exposure on humans and their subsequent risk of developing MetS has not been explored. This study investigates whether combined exposure to HMs and VOCs affects the risk of developing MetS.

Methods: The National Health and Nutrition Examination Survey database from 2011 to 2020 was used to determine exposure to HMs and VOCs detected in urine samples from individuals with MetS. Multiple Chi-squared and t-tests were performed to identify variables significantly associated with MetS. Logistic regression analysis was performed on unmatched and age-matched 1:1 case-control data to evaluate whether an association exists among HMs, VOCs, and demographic factors and MetS. A hierarchical cluster analysis was performed to identify combinations of HMs and VOCs linked with an increased risk of MetS.

Results: Logistic regression analysis on unmatched and matched data showed that increasing age and female sex were significantly associated (p < 0.05) with MetS. Among the HMs and VOCs, only N-acetyl-S-(2-cyanoethyl)-l-cysteine and N-acetyl-S-(2-hydroxyethyl)-l-cysteine were found to be significantly associated with MetS. Cluster analysis showed that Cluster 3 was significantly associated with MetS (p < 0.05; OR = 1.49), suggesting that exposure to barium, cadmium, cesium, lead, and VOCs may increase the risk of MetS. After adjusting for covariates, none of the clusters showed a significant association (p > 0.05). In contrast, age (OR = 1.07) and monthly poverty level index ≤ 1.3 (OR = 1.16) were significantly associated with MetS (p < 0.05).

Conclusion: This study revealed that age, lower socioeconomic status, and multiple exposures to combined HM and VOC may have a greater impact with an increased risk of MetS. Cluster analysis highlighted the potential combination of the exposures linked to MetS and the likelihood that demographic factors affect MetS more than exposure to HMs and VOCs. However, further research is needed.

Supplementary information: The online version contains supplementary material available at 10.1007/s13755-025-00375-1.

背景:代谢综合征(MetS)影响着美国三分之一的人口,是心血管疾病、中风和2型糖尿病等慢性疾病的危险因素。重金属(HM)和挥发性有机化合物(VOC)是通常以混合物形式出现的环境因素。虽然对这些物质的暴露已经分别进行了研究,但HM和VOC联合暴露对人类的影响及其随后发生MetS的风险尚未得到探讨。本研究调查了混合暴露于有机污染物和挥发性有机化合物是否会影响患MetS的风险。方法:使用2011年至2020年国家健康与营养检查调查数据库,确定MetS患者尿液样本中检测到的HMs和VOCs暴露情况。进行多重卡方检验和t检验以确定与MetS显著相关的变量。对未匹配和年龄匹配的1:1病例对照数据进行Logistic回归分析,以评估HMs、VOCs、人口因素和MetS之间是否存在关联。进行了分层聚类分析,以确定与MetS风险增加相关的HMs和VOCs组合。结果:对未匹配和匹配的数据进行Logistic回归分析,年龄的增加与女性的性别显著相关(p p p > 0.05)。年龄(OR = 1.07)和月贫困水平指数(OR = 1.16)与MetS有显著相关性(p)。结论:年龄、较低的社会经济地位和多次暴露于HM和VOC联合暴露可能对MetS的风险增加有更大的影响。聚类分析强调了与MetS相关的暴露的潜在组合,以及人口因素比暴露于HMs和VOCs更可能影响MetS。然而,还需要进一步的研究。补充信息:在线版本包含补充资料,提供地址为10.1007/s13755-025-00375-1。
{"title":"Clustering environmental pollutants associated with increased risk of metabolic disease: a hierarchical analysis.","authors":"Brooke Scardino, Akshat Agrawal, Diensn G Xing, Jackson L St Pierre, Md Mostafizur Rahman Bhuiyan, Kanon Kamronnaher, Md Shenuarin Bhuiyan, Oren Rom, Steven A Conrad, John A Vanchiere, A Wayne Orr, Christopher G Kevil, Mohammad Alfrad Nobel Bhuiyan","doi":"10.1007/s13755-025-00375-1","DOIUrl":"10.1007/s13755-025-00375-1","url":null,"abstract":"<p><strong>Background: </strong>Metabolic syndrome (MetS), which affects one-third of the population of the United States, is a risk factor for chronic diseases such as cardiovascular diseases, stroke, and type 2 diabetes mellitus. Heavy metals (HM) and volatile organic compounds (VOC) are environmental factors typically occurring as mixtures. Although exposures to these substances have been studied separately, the impact of combined HM and VOC exposure on humans and their subsequent risk of developing MetS has not been explored. This study investigates whether combined exposure to HMs and VOCs affects the risk of developing MetS.</p><p><strong>Methods: </strong>The National Health and Nutrition Examination Survey database from 2011 to 2020 was used to determine exposure to HMs and VOCs detected in urine samples from individuals with MetS. Multiple Chi-squared and t-tests were performed to identify variables significantly associated with MetS. Logistic regression analysis was performed on unmatched and age-matched 1:1 case-control data to evaluate whether an association exists among HMs, VOCs, and demographic factors and MetS. A hierarchical cluster analysis was performed to identify combinations of HMs and VOCs linked with an increased risk of MetS.</p><p><strong>Results: </strong>Logistic regression analysis on unmatched and matched data showed that increasing age and female sex were significantly associated (<i>p</i> < 0.05) with MetS. Among the HMs and VOCs, only N-acetyl-S-(2-cyanoethyl)-l-cysteine and N-acetyl-S-(2-hydroxyethyl)-l-cysteine were found to be significantly associated with MetS. Cluster analysis showed that Cluster 3 was significantly associated with MetS (<i>p</i> < 0.05; OR = 1.49), suggesting that exposure to barium, cadmium, cesium, lead, and VOCs may increase the risk of MetS. After adjusting for covariates, none of the clusters showed a significant association (<i>p</i> > 0.05). In contrast, age (OR = 1.07) and monthly poverty level index ≤ 1.3 (OR = 1.16) were significantly associated with MetS (<i>p</i> < 0.05).</p><p><strong>Conclusion: </strong>This study revealed that age, lower socioeconomic status, and multiple exposures to combined HM and VOC may have a greater impact with an increased risk of MetS. Cluster analysis highlighted the potential combination of the exposures linked to MetS and the likelihood that demographic factors affect MetS more than exposure to HMs and VOCs. However, further research is needed.</p><p><strong>Supplementary information: </strong>The online version contains supplementary material available at 10.1007/s13755-025-00375-1.</p>","PeriodicalId":46312,"journal":{"name":"Health Information Science and Systems","volume":"13 1","pages":"59"},"PeriodicalIF":3.4,"publicationDate":"2025-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12460862/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145187131","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Identification and applications of disease-associated differential human and bacterial proteins with metaproteomic evidence. 基于元蛋白质组学证据的疾病相关人类和细菌差异蛋白的鉴定和应用。
IF 3.4 3区 医学 Q1 MEDICAL INFORMATICS Pub Date : 2025-08-29 eCollection Date: 2025-12-01 DOI: 10.1007/s13755-025-00369-z
Jamie Canderan, Moses Stamboulian, Yuzhen Ye

The gut microbiome plays a fundamental role in human health and disease. Individual variations in the microbiome and the corresponding functional implications are key considerations to enhance precision health and medicine. Metaproteomics has recently revealed protein expression that might be associated with human health and disease. Existing studies focused on either human proteins or bacterial proteins that can be identified from (meta)proteomics data sets, but not both. In this study, we examined the feasibility of identifying both human and bacterial proteins that are differentially expressed between healthy and diseased individuals from metaproteomics data sets. We further evaluated different strategies of using identified peptides and proteins for building predictive models. By leveraging existing metaproteomics data sets and a tool that we have developed for metaproteomics data analysis (MetaProD), we were able to derive both human and bacterial differentially expressed proteins that could serve as potential biomarkers for all diseases we studied. We also built predictive models using identified peptides and proteins as features for prediction of human diseases. Our results showed peptide-based identifications over protein-based ones often produce the most accurate models and that feature selection can offer improvements. Prediction accuracy could be further improved, in some cases, by including bacterial identifications, but missing data in bacterial identifications remains problematic.

肠道微生物群在人类健康和疾病中起着重要作用。微生物组的个体差异和相应的功能影响是提高精准健康和医学的关键考虑因素。宏蛋白质组学最近发现了可能与人类健康和疾病相关的蛋白表达。现有的研究要么集中在人类蛋白质,要么集中在可以从(元)蛋白质组学数据集中识别的细菌蛋白质上,但不是两者都可以。在这项研究中,我们研究了从宏蛋白质组学数据集中识别健康和患病个体之间差异表达的人类和细菌蛋白质的可行性。我们进一步评估了使用已识别肽和蛋白质构建预测模型的不同策略。通过利用现有的宏蛋白质组学数据集和我们开发的宏蛋白质组学数据分析工具(MetaProD),我们能够获得人类和细菌差异表达蛋白,这些蛋白可以作为我们研究的所有疾病的潜在生物标志物。我们还建立了预测模型,使用已识别的肽和蛋白质作为预测人类疾病的特征。我们的研究结果表明,基于肽的识别比基于蛋白质的识别通常产生最准确的模型,并且特征选择可以提供改进。在某些情况下,通过包括细菌鉴定,预测精度可以进一步提高,但细菌鉴定中的数据缺失仍然是一个问题。
{"title":"Identification and applications of disease-associated differential human and bacterial proteins with metaproteomic evidence.","authors":"Jamie Canderan, Moses Stamboulian, Yuzhen Ye","doi":"10.1007/s13755-025-00369-z","DOIUrl":"10.1007/s13755-025-00369-z","url":null,"abstract":"<p><p>The gut microbiome plays a fundamental role in human health and disease. Individual variations in the microbiome and the corresponding functional implications are key considerations to enhance precision health and medicine. Metaproteomics has recently revealed protein expression that might be associated with human health and disease. Existing studies focused on either human proteins or bacterial proteins that can be identified from (meta)proteomics data sets, but not both. In this study, we examined the feasibility of identifying both human and bacterial proteins that are differentially expressed between healthy and diseased individuals from metaproteomics data sets. We further evaluated different strategies of using identified peptides and proteins for building predictive models. By leveraging existing metaproteomics data sets and a tool that we have developed for metaproteomics data analysis (MetaProD), we were able to derive both human and bacterial differentially expressed proteins that could serve as potential biomarkers for all diseases we studied. We also built predictive models using identified peptides and proteins as features for prediction of human diseases. Our results showed peptide-based identifications over protein-based ones often produce the most accurate models and that feature selection can offer improvements. Prediction accuracy could be further improved, in some cases, by including bacterial identifications, but missing data in bacterial identifications remains problematic.</p>","PeriodicalId":46312,"journal":{"name":"Health Information Science and Systems","volume":"13 1","pages":"54"},"PeriodicalIF":3.4,"publicationDate":"2025-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12397042/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144973812","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Correction: Margin-aware optimized contrastive learning for enhanced self-supervised histopathological image classification. 修正:边缘感知优化对比学习增强自我监督组织病理图像分类。
IF 3.4 3区 医学 Q1 MEDICAL INFORMATICS Pub Date : 2024-12-28 eCollection Date: 2025-12-01 DOI: 10.1007/s13755-024-00325-3
Ekta Gupta, Varun Gupta

[This corrects the article DOI: 10.1007/s13755-024-00316-4.].

[这更正了文章DOI: 10.1007/s13755-024-00316-4]。
{"title":"Correction: <b>Margin-aware optimized contrastive learning for enhanced self-supervised histopathological image classification</b>.","authors":"Ekta Gupta, Varun Gupta","doi":"10.1007/s13755-024-00325-3","DOIUrl":"10.1007/s13755-024-00325-3","url":null,"abstract":"<p><p>[This corrects the article DOI: 10.1007/s13755-024-00316-4.].</p>","PeriodicalId":46312,"journal":{"name":"Health Information Science and Systems","volume":"13 1","pages":"8"},"PeriodicalIF":3.4,"publicationDate":"2024-12-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11682023/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142907731","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A new multivariate blood glucose prediction method with hybrid feature clustering and online transfer learning. 采用混合特征聚类和在线迁移学习的新型多变量血糖预测方法。
IF 3.4 3区 医学 Q1 MEDICAL INFORMATICS Pub Date : 2024-11-17 eCollection Date: 2024-12-01 DOI: 10.1007/s13755-024-00313-7
Fuqiang You, Guo Zhao, Xinyu Zhang, Ziheng Zhang, Jinli Cao, Hongru Li

Accurate blood glucose (BG) prediction is greatly benefit for the treatment of diabetes. Generally, clinical physicians are required to comprehensively analyze various factors, such as patient's body temperature, meal, sleep, insulin injection, continuous glucose monitoring (CGM), and other information, to evaluate the fluctuation trend of blood glucose. To address this problem, this paper proposes a multivariate blood glucose prediction method based on mixed feature clustering. It clusters time series data with diverse or mixed features related to blood glucose, effectively leveraging correlations and distribution characteristics. By combining incremental clustering of multivariate time series with transfer learning, this method achieves online prediction of blood glucose levels. The experimental results indicate that the proposed method can decrease the prediction error RMSE by 4.2% (PH=30min) and 5.9% (PH=60min). Compared with other prediction methods, the training time of the multivariate prediction method is reduced by 5.2% (PH=30min) and 4.7% (PH=60min). It was also validated and compared with other methods in a real dataset. The proposed method in this study has lower prediction error and better prediction performance in the prediction horizon (PH) of PH=30, 45, 60, 75, and 90 min, respectively. Compared with the traditional unitary and multivariate time series prediction method, the approach proposed in this paper significantly improves the accuracy and robustness of blood glucose prediction. According to the evaluation results on the data set from OhioT1DM and the Sixth People's Hospital of Shanghai, the proposed method has better generalization performance and clinical acceptability.

准确的血糖(BG)预测对糖尿病的治疗大有裨益。一般来说,临床医生需要综合分析患者的体温、进餐、睡眠、胰岛素注射、连续血糖监测(CGM)等多种因素,来评估血糖的波动趋势。针对这一问题,本文提出了一种基于混合特征聚类的多元血糖预测方法。该方法有效利用相关性和分布特征,对与血糖相关的具有多样化或混合特征的时间序列数据进行聚类。通过将多元时间序列的增量聚类与迁移学习相结合,该方法实现了血糖水平的在线预测。实验结果表明,所提出的方法可将预测误差 RMSE 降低 4.2%(PH=30min)和 5.9%(PH=60min)。与其他预测方法相比,多元预测方法的训练时间减少了 5.2%(PH=30min)和 4.7%(PH=60min)。该方法还在真实数据集中与其他方法进行了验证和比较。在 PH=30 分钟、45 分钟、60 分钟、75 分钟和 90 分钟的预测范围(PH)内,本研究提出的方法具有更低的预测误差和更好的预测性能。与传统的单变量和多变量时间序列预测方法相比,本文提出的方法显著提高了血糖预测的准确性和鲁棒性。根据对 OhioT1DM 和上海市第六人民医院数据集的评估结果,本文提出的方法具有更好的泛化性能和临床可接受性。
{"title":"A new multivariate blood glucose prediction method with hybrid feature clustering and online transfer learning.","authors":"Fuqiang You, Guo Zhao, Xinyu Zhang, Ziheng Zhang, Jinli Cao, Hongru Li","doi":"10.1007/s13755-024-00313-7","DOIUrl":"10.1007/s13755-024-00313-7","url":null,"abstract":"<p><p>Accurate blood glucose (BG) prediction is greatly benefit for the treatment of diabetes. Generally, clinical physicians are required to comprehensively analyze various factors, such as patient's body temperature, meal, sleep, insulin injection, continuous glucose monitoring (CGM), and other information, to evaluate the fluctuation trend of blood glucose. To address this problem, this paper proposes a multivariate blood glucose prediction method based on mixed feature clustering. It clusters time series data with diverse or mixed features related to blood glucose, effectively leveraging correlations and distribution characteristics. By combining incremental clustering of multivariate time series with transfer learning, this method achieves online prediction of blood glucose levels. The experimental results indicate that the proposed method can decrease the prediction error RMSE by 4.2% (PH=30min) and 5.9% (PH=60min). Compared with other prediction methods, the training time of the multivariate prediction method is reduced by 5.2% (PH=30min) and 4.7% (PH=60min). It was also validated and compared with other methods in a real dataset. The proposed method in this study has lower prediction error and better prediction performance in the prediction horizon (PH) of PH=30, 45, 60, 75, and 90 min, respectively. Compared with the traditional unitary and multivariate time series prediction method, the approach proposed in this paper significantly improves the accuracy and robustness of blood glucose prediction. According to the evaluation results on the data set from OhioT1DM and the Sixth People's Hospital of Shanghai, the proposed method has better generalization performance and clinical acceptability.</p>","PeriodicalId":46312,"journal":{"name":"Health Information Science and Systems","volume":"12 1","pages":"57"},"PeriodicalIF":3.4,"publicationDate":"2024-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11570574/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142677071","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Memetic ant colony optimization for multi-constrained cognitive diagnostic test construction. 用于构建多约束认知诊断测试的蚁群优化记忆法
IF 3.4 3区 医学 Q1 MEDICAL INFORMATICS Pub Date : 2024-11-16 eCollection Date: 2024-12-01 DOI: 10.1007/s13755-024-00314-6
Xi Cao, Yong-Feng Ge, Kate Wang, Ying Lin

Purpose: Cognitive diagnostic tests (CDTs) assess cognitive skills at a more granular level, providing detailed insights into the mastery profile of test-takers. Traditional algorithms for constructing CDTs have partially addressed these challenges, focusing on a limited number of constraints. This paper intends to utilize a meta-heuristic algorithm to produce high-quality tests and handle more constraints simultaneously.

Methods: This paper presents a memetic ant colony optimization (MACO) algorithm for constructing CDTs while considering multiple constraints. The MACO method utilizes pheromone trails to represent successful test constructions from the past. Additionally, it innovatively integrates item quality and constraint adherence into heuristic information to manage multiple constraints simultaneously. The method evaluates the assembled tests based on the diagnosis index and constraint satisfaction. Another innovation of MACO is the incorporation of a local search strategy to further enhance diagnostic accuracy by partially optimizing item selection. The optimal local search parameter settings are explored through a parameter investigation. A series of simulation experiments validate the effectiveness of MACO under various conditions.

Results: The results demonstrate the great ability of meta-heuristic algorithms to handle multiple constraints and achieve high statistical performance. MACO exhibited superior performance in generating high-quality CDTs while meeting multiple constraints, particularly for mixed and low discrimination item banks. It achieved faster convergence than the ant colony optimization in most scenarios.

Conclusions: MACO provides an effective solution for multi-constrained CDT construction, especially for shorter tests and item banks with mixed or lower discrimination. The experimental results also suggest that the suitability of different optimization approaches may depend on specific test conditions, such as the characteristics of the item bank and the length of the test.

目的:认知诊断测试(CDTs)从更细的层面评估认知技能,详细了解测试者的掌握情况。用于构建 CDT 的传统算法部分地解决了这些难题,但只关注了有限的几个约束条件。本文打算利用元启发式算法来生成高质量的测试,并同时处理更多的约束条件:本文提出了一种记忆蚁群优化(MACO)算法,用于构建 CDT,同时考虑多个约束条件。MACO 方法利用信息素轨迹来表示过去成功的测试构建。此外,它还创新性地将项目质量和约束遵守情况整合到启发式信息中,以同时管理多个约束条件。该方法根据诊断指数和约束满意度来评估组合测试。MACO 的另一项创新是采用局部搜索策略,通过部分优化项目选择来进一步提高诊断准确性。通过参数调查探索了最佳局部搜索参数设置。一系列模拟实验验证了 MACO 在各种条件下的有效性:结果表明,元启发式算法在处理多重约束条件和实现高统计性能方面具有很强的能力。MACO 在生成高质量 CDT 的同时满足多个约束条件方面表现出了卓越的性能,尤其是在混合和低区分度项目库中。在大多数情况下,它比蚁群优化收敛更快:结论:MACO 为多约束 CDT 的构建提供了一个有效的解决方案,尤其适用于较短的测试和具有混合或较低区分度的项目库。实验结果还表明,不同优化方法的适用性可能取决于具体的测试条件,如题目库的特点和测试的长度。
{"title":"Memetic ant colony optimization for multi-constrained cognitive diagnostic test construction.","authors":"Xi Cao, Yong-Feng Ge, Kate Wang, Ying Lin","doi":"10.1007/s13755-024-00314-6","DOIUrl":"10.1007/s13755-024-00314-6","url":null,"abstract":"<p><strong>Purpose: </strong>Cognitive diagnostic tests (CDTs) assess cognitive skills at a more granular level, providing detailed insights into the mastery profile of test-takers. Traditional algorithms for constructing CDTs have partially addressed these challenges, focusing on a limited number of constraints. This paper intends to utilize a meta-heuristic algorithm to produce high-quality tests and handle more constraints simultaneously.</p><p><strong>Methods: </strong>This paper presents a memetic ant colony optimization (MACO) algorithm for constructing CDTs while considering multiple constraints. The MACO method utilizes pheromone trails to represent successful test constructions from the past. Additionally, it innovatively integrates item quality and constraint adherence into heuristic information to manage multiple constraints simultaneously. The method evaluates the assembled tests based on the diagnosis index and constraint satisfaction. Another innovation of MACO is the incorporation of a local search strategy to further enhance diagnostic accuracy by partially optimizing item selection. The optimal local search parameter settings are explored through a parameter investigation. A series of simulation experiments validate the effectiveness of MACO under various conditions.</p><p><strong>Results: </strong>The results demonstrate the great ability of meta-heuristic algorithms to handle multiple constraints and achieve high statistical performance. MACO exhibited superior performance in generating high-quality CDTs while meeting multiple constraints, particularly for mixed and low discrimination item banks. It achieved faster convergence than the ant colony optimization in most scenarios.</p><p><strong>Conclusions: </strong>MACO provides an effective solution for multi-constrained CDT construction, especially for shorter tests and item banks with mixed or lower discrimination. The experimental results also suggest that the suitability of different optimization approaches may depend on specific test conditions, such as the characteristics of the item bank and the length of the test.</p>","PeriodicalId":46312,"journal":{"name":"Health Information Science and Systems","volume":"12 1","pages":"56"},"PeriodicalIF":3.4,"publicationDate":"2024-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11569084/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142668916","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Forecasting fMRI images from video sequences: linear model analysis. 从视频序列预测 fMRI 图像:线性模型分析。
IF 3.4 3区 医学 Q1 MEDICAL INFORMATICS Pub Date : 2024-11-15 eCollection Date: 2024-12-01 DOI: 10.1007/s13755-024-00315-5
Daniil Dorin, Nikita Kiselev, Andrey Grabovoy, Vadim Strijov

Over the past few decades, a variety of significant scientific breakthroughs have been achieved in the fields of brain encoding and decoding using the functional magnetic resonance imaging (fMRI). Many studies have been conducted on the topic of human brain reaction to visual stimuli. However, the relationship between fMRI images and video sequences viewed by humans remains complex and is often studied using large transformer models. In this paper, we investigate the correlation between videos presented to participants during an experiment and the resulting fMRI images. To achieve this, we propose a method for creating a linear model that predicts changes in fMRI signals based on video sequence images. A linear model is constructed for each individual voxel in the fMRI image, assuming that the image sequence follows a Markov property. Through the comprehensive qualitative experiments, we demonstrate the relationship between the two time series. We hope that our findings contribute to a deeper understanding of the human brain's reaction to external stimuli and provide a basis for future research in this area.

在过去的几十年里,利用功能磁共振成像(fMRI)进行大脑编码和解码的领域取得了各种重大科学突破。关于人脑对视觉刺激的反应这一主题,已经开展了许多研究。然而,fMRI 图像与人类观看的视频序列之间的关系仍然很复杂,通常使用大型变压器模型进行研究。在本文中,我们将研究实验过程中呈现给参与者的视频与产生的 fMRI 图像之间的相关性。为此,我们提出了一种创建线性模型的方法,该模型可根据视频序列图像预测 fMRI 信号的变化。假定图像序列遵循马尔可夫特性,为 fMRI 图像中的每个单独体素构建线性模型。通过综合定性实验,我们证明了两个时间序列之间的关系。我们希望我们的发现有助于加深对人脑对外部刺激反应的理解,并为这一领域未来的研究提供基础。
{"title":"Forecasting fMRI images from video sequences: linear model analysis.","authors":"Daniil Dorin, Nikita Kiselev, Andrey Grabovoy, Vadim Strijov","doi":"10.1007/s13755-024-00315-5","DOIUrl":"10.1007/s13755-024-00315-5","url":null,"abstract":"<p><p>Over the past few decades, a variety of significant scientific breakthroughs have been achieved in the fields of brain encoding and decoding using the functional magnetic resonance imaging (fMRI). Many studies have been conducted on the topic of human brain reaction to visual stimuli. However, the relationship between fMRI images and video sequences viewed by humans remains complex and is often studied using large transformer models. In this paper, we investigate the correlation between videos presented to participants during an experiment and the resulting fMRI images. To achieve this, we propose a method for creating a linear model that predicts changes in fMRI signals based on video sequence images. A linear model is constructed for each individual voxel in the fMRI image, assuming that the image sequence follows a Markov property. Through the comprehensive qualitative experiments, we demonstrate the relationship between the two time series. We hope that our findings contribute to a deeper understanding of the human brain's reaction to external stimuli and provide a basis for future research in this area.</p>","PeriodicalId":46312,"journal":{"name":"Health Information Science and Systems","volume":"12 1","pages":"55"},"PeriodicalIF":3.4,"publicationDate":"2024-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11568086/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142648946","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
KSDKG: construction and application of knowledge graph for kidney stone disease based on biomedical literature and public databases. KSDKG:基于生物医学文献和公共数据库的肾结石病知识图谱的构建与应用。
IF 3.4 3区 医学 Q1 MEDICAL INFORMATICS Pub Date : 2024-11-14 eCollection Date: 2024-12-01 DOI: 10.1007/s13755-024-00309-3
Jianping Man, Yufei Shi, Zhensheng Hu, Rui Yang, Zhisheng Huang, Yi Zhou

Purpose: Kidney stone disease (KSD) is a common urological disorder with an increasing incidence worldwide. The extensive knowledge about KSD is dispersed across multiple databases, challenging the visualization and representation of its hierarchy and connections. This paper aims at constructing a disease-specific knowledge graph for KSD to enhance the effective utilization of knowledge by medical professionals and promote clinical research and discovery.

Methods: Text parsing and semantic analysis were conducted on literature related to KSD from PubMed, with concept annotation based on biomedical ontology being utilized to generate semantic data in RDF format. Moreover, public databases were integrated to construct a large-scale knowledge graph for KSD. Additionally, case studies were carried out to demonstrate the practical utility of the developed knowledge graph.

Results: We proposed and implemented a Kidney Stone Disease Knowledge Graph (KSDKG), covering more than 90 million triples. This graph comprised semantic data extracted from 29,174 articles, integrating available data from UMLS, SNOMED CT, MeSH, DrugBank and Microbe-Disease Knowledge Graph. Through the application of three cases, we retrieved and discovered information on microbes, drugs and diseases associated with KSD. The results illustrated that the KSDKG can integrate diverse medical knowledge and provide new clinical insights for identifying the underlying mechanisms of KSD.

Conclusion: The KSDKG efficiently utilizes knowledge graph to reveal hidden knowledge associations, facilitating semantic search and response. As a blueprint for developing disease-specific knowledge graphs, it offers valuable contributions to medical research.

目的:肾结石病(KSD)是一种常见的泌尿系统疾病,在全球的发病率不断上升。有关 KSD 的大量知识分散在多个数据库中,对其层次和联系的可视化和表示提出了挑战。本文旨在构建针对 KSD 的特定疾病知识图谱,以提高医疗专业人员对知识的有效利用,促进临床研究和发现:方法:对PubMed上与KSD相关的文献进行文本解析和语义分析,并利用基于生物医学本体论的概念注释生成RDF格式的语义数据。此外,还整合了公共数据库,以构建大规模的 KSD 知识图谱。此外,我们还进行了案例研究,以展示所开发的知识图谱的实用性:我们提出并实现了肾结石疾病知识图谱(KSDKG),涵盖了9000多万个三元组。该图由从 29174 篇文章中提取的语义数据组成,整合了来自 UMLS、SNOMED CT、MeSH、DrugBank 和微生物-疾病知识图谱的可用数据。通过三个案例的应用,我们检索并发现了与 KSD 相关的微生物、药物和疾病信息。结果表明,KSDKG 可以整合各种医学知识,为确定 KSD 的潜在机制提供新的临床见解:结论:KSDKG 能有效利用知识图谱揭示隐藏的知识关联,促进语义搜索和响应。作为开发特定疾病知识图谱的蓝图,它为医学研究做出了宝贵的贡献。
{"title":"KSDKG: construction and application of knowledge graph for kidney stone disease based on biomedical literature and public databases.","authors":"Jianping Man, Yufei Shi, Zhensheng Hu, Rui Yang, Zhisheng Huang, Yi Zhou","doi":"10.1007/s13755-024-00309-3","DOIUrl":"10.1007/s13755-024-00309-3","url":null,"abstract":"<p><strong>Purpose: </strong>Kidney stone disease (KSD) is a common urological disorder with an increasing incidence worldwide. The extensive knowledge about KSD is dispersed across multiple databases, challenging the visualization and representation of its hierarchy and connections. This paper aims at constructing a disease-specific knowledge graph for KSD to enhance the effective utilization of knowledge by medical professionals and promote clinical research and discovery.</p><p><strong>Methods: </strong>Text parsing and semantic analysis were conducted on literature related to KSD from PubMed, with concept annotation based on biomedical ontology being utilized to generate semantic data in RDF format. Moreover, public databases were integrated to construct a large-scale knowledge graph for KSD. Additionally, case studies were carried out to demonstrate the practical utility of the developed knowledge graph.</p><p><strong>Results: </strong>We proposed and implemented a Kidney Stone Disease Knowledge Graph (KSDKG), covering more than 90 million triples. This graph comprised semantic data extracted from 29,174 articles, integrating available data from UMLS, SNOMED CT, MeSH, DrugBank and Microbe-Disease Knowledge Graph. Through the application of three cases, we retrieved and discovered information on microbes, drugs and diseases associated with KSD. The results illustrated that the KSDKG can integrate diverse medical knowledge and provide new clinical insights for identifying the underlying mechanisms of KSD.</p><p><strong>Conclusion: </strong>The KSDKG efficiently utilizes knowledge graph to reveal hidden knowledge associations, facilitating semantic search and response. As a blueprint for developing disease-specific knowledge graphs, it offers valuable contributions to medical research.</p>","PeriodicalId":46312,"journal":{"name":"Health Information Science and Systems","volume":"12 1","pages":"54"},"PeriodicalIF":3.4,"publicationDate":"2024-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11564440/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142648856","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Health Information Science and Systems
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1