Pub Date : 2025-10-28eCollection Date: 2025-12-01DOI: 10.1007/s13755-025-00388-w
Dachuan Song, Li Shen, Duy Duong-Tran, Xuan Wang
Purpose: Recently, there has been a revived interest in system neuroscience causation models, driven by their unique capability to unravel complex relationships in multi-scale brain networks. In this paper, we present a novel method that leverages causal dynamics to achieve effective fMRI-based subject and task fingerprinting.
Methods: By applying an implicit-explicit discretization scheme, we develop a two-timescale linear state-space model. Through data-driven identification of its parameters, the model captures causal signatures, including directed interactions among brain regions from a spatial perspective, and disentangled fast and slow dynamic modes of brain activity from a temporal perspective. These causal signatures are then integrated with: (i) a modal decomposition and projection method for model-based subject identification, and (ii) a Graph Neural Network (GNN) framework for learning-based task classification. Furthermore, we introduce the concept of the brain reachability landscape as a novel visualization tool, which quantitatively characterizes the maximum possible activation levels of brain regions under various fMRI tasks.
Results: We evaluate the proposed approach using the Human Connectome Project dataset and demonstrate its advantage over non-causality-based methods. The obtained causal signatures are visualized and demonstrate clear biological relevance with established understandings of brain function.
Conclusion: We verified the feasibility and effectiveness of utilizing brain causal signatures for subject and task fingerprinting. Additionally, our work paves the way for further studies on causal fingerprints with potential applications in both healthy controls and neurodegenerative diseases.
{"title":"Reconstructing brain causal dynamics for subject and task fingerprints using fMRI time-series data.","authors":"Dachuan Song, Li Shen, Duy Duong-Tran, Xuan Wang","doi":"10.1007/s13755-025-00388-w","DOIUrl":"10.1007/s13755-025-00388-w","url":null,"abstract":"<p><strong>Purpose: </strong>Recently, there has been a revived interest in system neuroscience causation models, driven by their unique capability to unravel complex relationships in multi-scale brain networks. In this paper, we present a novel method that leverages causal dynamics to achieve effective fMRI-based subject and task fingerprinting.</p><p><strong>Methods: </strong>By applying an implicit-explicit discretization scheme, we develop a two-timescale linear state-space model. Through data-driven identification of its parameters, the model captures causal signatures, including directed interactions among brain regions from a spatial perspective, and disentangled fast and slow dynamic modes of brain activity from a temporal perspective. These causal signatures are then integrated with: (i) a modal decomposition and projection method for model-based subject identification, and (ii) a Graph Neural Network (GNN) framework for learning-based task classification. Furthermore, we introduce the concept of the brain reachability landscape as a novel visualization tool, which quantitatively characterizes the maximum possible activation levels of brain regions under various fMRI tasks.</p><p><strong>Results: </strong>We evaluate the proposed approach using the Human Connectome Project dataset and demonstrate its advantage over non-causality-based methods. The obtained causal signatures are visualized and demonstrate clear biological relevance with established understandings of brain function.</p><p><strong>Conclusion: </strong>We verified the feasibility and effectiveness of utilizing brain causal signatures for subject and task fingerprinting. Additionally, our work paves the way for further studies on causal fingerprints with potential applications in both healthy controls and neurodegenerative diseases.</p>","PeriodicalId":46312,"journal":{"name":"Health Information Science and Systems","volume":"13 1","pages":"70"},"PeriodicalIF":3.4,"publicationDate":"2025-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12569264/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145410337","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-10-19eCollection Date: 2025-12-01DOI: 10.1007/s13755-025-00386-y
Toni Lozano-Bagén, Eloy Martinez-Heras, Giuseppe Pontillo, Elisabeth Solana, Francesc Vivó, Maria Petracca, Alberto Calvi, Sandra Garrido-Romero, Albert Solé-Ribalta, Sara Llufriu, Ferran Prados, Jordi Casas-Roma
Brain networks, or graphs, derived from magnetic resonance imaging (MRI) offer a powerful framework for representing the structural, morphological, and functional organization of the brain. Graph-theoretical metrics have been widely employed to characterize properties such as efficiency, integration, and communication within these networks. More recently, topological data analysis techniques, such as persistent homology and Betti curves, have emerged as complementary approaches for capturing higher-order network patterns. In this study, we present a comparative analysis of these feature-generation methodologies in the context of neurodegenerative disease. Specifically, we evaluate the effectiveness of Betti curves and graph-theoretical metrics in extracting features for distinguishing people with multiple sclerosis (PwMS) from healthy volunteers (HV). Features are derived from structural connectivity, morphological gray matter, and resting-state functional networks, using both single layer and multilayer graph architectures. Our experiments, conducted on a cohort of PwMS and HV, demonstrate that features extracted using Betti curves generally outperform those based on graph-theoretical metrics. Furthermore, we show that multimodal data in terms of feature concatenation and multilayer graph architectures provide a more comprehensive representation of alterations in complex brain mechanisms associated with MS, leading to improved classification performance. These findings highlight the potential of topological features and multimodal integration for enhancing the understanding and diagnosis of neurodegenerative disorders.
{"title":"Evaluating topological and graph-theoretical approaches to extract complex multimodal brain connectivity patterns in multiple sclerosis.","authors":"Toni Lozano-Bagén, Eloy Martinez-Heras, Giuseppe Pontillo, Elisabeth Solana, Francesc Vivó, Maria Petracca, Alberto Calvi, Sandra Garrido-Romero, Albert Solé-Ribalta, Sara Llufriu, Ferran Prados, Jordi Casas-Roma","doi":"10.1007/s13755-025-00386-y","DOIUrl":"10.1007/s13755-025-00386-y","url":null,"abstract":"<p><p>Brain networks, or graphs, derived from magnetic resonance imaging (MRI) offer a powerful framework for representing the structural, morphological, and functional organization of the brain. Graph-theoretical metrics have been widely employed to characterize properties such as efficiency, integration, and communication within these networks. More recently, topological data analysis techniques, such as persistent homology and Betti curves, have emerged as complementary approaches for capturing higher-order network patterns. In this study, we present a comparative analysis of these feature-generation methodologies in the context of neurodegenerative disease. Specifically, we evaluate the effectiveness of Betti curves and graph-theoretical metrics in extracting features for distinguishing people with multiple sclerosis (PwMS) from healthy volunteers (HV). Features are derived from structural connectivity, morphological gray matter, and resting-state functional networks, using both single layer and multilayer graph architectures. Our experiments, conducted on a cohort of PwMS and HV, demonstrate that features extracted using Betti curves generally outperform those based on graph-theoretical metrics. Furthermore, we show that multimodal data in terms of feature concatenation and multilayer graph architectures provide a more comprehensive representation of alterations in complex brain mechanisms associated with MS, leading to improved classification performance. These findings highlight the potential of topological features and multimodal integration for enhancing the understanding and diagnosis of neurodegenerative disorders.</p>","PeriodicalId":46312,"journal":{"name":"Health Information Science and Systems","volume":"13 1","pages":"68"},"PeriodicalIF":3.4,"publicationDate":"2025-10-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12535957/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145349210","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-10-14eCollection Date: 2025-12-01DOI: 10.1007/s13755-025-00384-0
Ziyang Song, Qincheng Lu, Hao Xu, Ziqi Yang, He Zhu, David Buckeridge, Yue Li
Purpose: Large-scale pre-trained models (PTMs) such as BERT and GPT have recently achieved great success in Natural Language Processing and Computer Vision domains. However, the development of PTMs on healthcare time-series data is lagging behind. This underscores the limitations of the existing transformer-based architectures, particularly their scalability to handle large-scale time series and ability to capture long-term temporal dependencies.
Methods: In this study, we present Timely Generative Pre-trained Transformer (TimelyGPT). TimelyGPT employs an extrapolatable position (xPos) embedding to encode trend and periodic patterns into time-series representations. It also integrates recurrent attention and temporal convolution modules to effectively capture global-local temporal dependencies.
Results: Our experiments show that TimelyGPT excels in modeling continuously monitored biosignals and irregularly-sampled time series data commonly observed in longitudinal electronic health records (EHRs). In forecasting continuous biosignals, TimelyGPT achieves accurate extrapolation up to 6000 timesteps of body temperature during the sleep stage transition given a short look-up window (i.e., prompt) containing only 2000 timesteps. For irregularly-sampled time series, TimelyGPT with a proposed time-specific inference demonstrates high top recall scores in predicting future diagnoses using early diagnostic records, effectively handling irregular intervals between clinical records. We further demonstrate that TimelyGPT achieves strong discriminative performance on both continuous and irregularly-sampled time series.
Conclusion: Together, we envision TimelyGPT to be useful in various health domains, including long-term patient health state forecasting, patient risk trajectory prediction, and disease classification. Its code is available at Github.
{"title":"Timelygpt: extrapolatable transformer pre-training for long-term time-series forecasting in healthcare.","authors":"Ziyang Song, Qincheng Lu, Hao Xu, Ziqi Yang, He Zhu, David Buckeridge, Yue Li","doi":"10.1007/s13755-025-00384-0","DOIUrl":"10.1007/s13755-025-00384-0","url":null,"abstract":"<p><strong>Purpose: </strong>Large-scale pre-trained models (PTMs) such as BERT and GPT have recently achieved great success in Natural Language Processing and Computer Vision domains. However, the development of PTMs on healthcare time-series data is lagging behind. This underscores the limitations of the existing transformer-based architectures, particularly their scalability to handle large-scale time series and ability to capture long-term temporal dependencies.</p><p><strong>Methods: </strong>In this study, we present Timely Generative Pre-trained Transformer (TimelyGPT). TimelyGPT employs an extrapolatable position (xPos) embedding to encode trend and periodic patterns into time-series representations. It also integrates recurrent attention and temporal convolution modules to effectively capture global-local temporal dependencies.</p><p><strong>Results: </strong>Our experiments show that TimelyGPT excels in modeling continuously monitored biosignals and irregularly-sampled time series data commonly observed in longitudinal electronic health records (EHRs). In forecasting continuous biosignals, TimelyGPT achieves accurate extrapolation up to 6000 timesteps of body temperature during the sleep stage transition given a short look-up window (i.e., prompt) containing only 2000 timesteps. For irregularly-sampled time series, TimelyGPT with a proposed time-specific inference demonstrates high top recall scores in predicting future diagnoses using early diagnostic records, effectively handling irregular intervals between clinical records. We further demonstrate that TimelyGPT achieves strong discriminative performance on both continuous and irregularly-sampled time series.</p><p><strong>Conclusion: </strong>Together, we envision TimelyGPT to be useful in various health domains, including long-term patient health state forecasting, patient risk trajectory prediction, and disease classification. Its code is available at Github.</p>","PeriodicalId":46312,"journal":{"name":"Health Information Science and Systems","volume":"13 1","pages":"64"},"PeriodicalIF":3.4,"publicationDate":"2025-10-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12521703/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145309483","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-09-24eCollection Date: 2025-12-01DOI: 10.1007/s13755-025-00375-1
Brooke Scardino, Akshat Agrawal, Diensn G Xing, Jackson L St Pierre, Md Mostafizur Rahman Bhuiyan, Kanon Kamronnaher, Md Shenuarin Bhuiyan, Oren Rom, Steven A Conrad, John A Vanchiere, A Wayne Orr, Christopher G Kevil, Mohammad Alfrad Nobel Bhuiyan
Background: Metabolic syndrome (MetS), which affects one-third of the population of the United States, is a risk factor for chronic diseases such as cardiovascular diseases, stroke, and type 2 diabetes mellitus. Heavy metals (HM) and volatile organic compounds (VOC) are environmental factors typically occurring as mixtures. Although exposures to these substances have been studied separately, the impact of combined HM and VOC exposure on humans and their subsequent risk of developing MetS has not been explored. This study investigates whether combined exposure to HMs and VOCs affects the risk of developing MetS.
Methods: The National Health and Nutrition Examination Survey database from 2011 to 2020 was used to determine exposure to HMs and VOCs detected in urine samples from individuals with MetS. Multiple Chi-squared and t-tests were performed to identify variables significantly associated with MetS. Logistic regression analysis was performed on unmatched and age-matched 1:1 case-control data to evaluate whether an association exists among HMs, VOCs, and demographic factors and MetS. A hierarchical cluster analysis was performed to identify combinations of HMs and VOCs linked with an increased risk of MetS.
Results: Logistic regression analysis on unmatched and matched data showed that increasing age and female sex were significantly associated (p < 0.05) with MetS. Among the HMs and VOCs, only N-acetyl-S-(2-cyanoethyl)-l-cysteine and N-acetyl-S-(2-hydroxyethyl)-l-cysteine were found to be significantly associated with MetS. Cluster analysis showed that Cluster 3 was significantly associated with MetS (p < 0.05; OR = 1.49), suggesting that exposure to barium, cadmium, cesium, lead, and VOCs may increase the risk of MetS. After adjusting for covariates, none of the clusters showed a significant association (p > 0.05). In contrast, age (OR = 1.07) and monthly poverty level index ≤ 1.3 (OR = 1.16) were significantly associated with MetS (p < 0.05).
Conclusion: This study revealed that age, lower socioeconomic status, and multiple exposures to combined HM and VOC may have a greater impact with an increased risk of MetS. Cluster analysis highlighted the potential combination of the exposures linked to MetS and the likelihood that demographic factors affect MetS more than exposure to HMs and VOCs. However, further research is needed.
Supplementary information: The online version contains supplementary material available at 10.1007/s13755-025-00375-1.
背景:代谢综合征(MetS)影响着美国三分之一的人口,是心血管疾病、中风和2型糖尿病等慢性疾病的危险因素。重金属(HM)和挥发性有机化合物(VOC)是通常以混合物形式出现的环境因素。虽然对这些物质的暴露已经分别进行了研究,但HM和VOC联合暴露对人类的影响及其随后发生MetS的风险尚未得到探讨。本研究调查了混合暴露于有机污染物和挥发性有机化合物是否会影响患MetS的风险。方法:使用2011年至2020年国家健康与营养检查调查数据库,确定MetS患者尿液样本中检测到的HMs和VOCs暴露情况。进行多重卡方检验和t检验以确定与MetS显著相关的变量。对未匹配和年龄匹配的1:1病例对照数据进行Logistic回归分析,以评估HMs、VOCs、人口因素和MetS之间是否存在关联。进行了分层聚类分析,以确定与MetS风险增加相关的HMs和VOCs组合。结果:对未匹配和匹配的数据进行Logistic回归分析,年龄的增加与女性的性别显著相关(p p p > 0.05)。年龄(OR = 1.07)和月贫困水平指数(OR = 1.16)与MetS有显著相关性(p)。结论:年龄、较低的社会经济地位和多次暴露于HM和VOC联合暴露可能对MetS的风险增加有更大的影响。聚类分析强调了与MetS相关的暴露的潜在组合,以及人口因素比暴露于HMs和VOCs更可能影响MetS。然而,还需要进一步的研究。补充信息:在线版本包含补充资料,提供地址为10.1007/s13755-025-00375-1。
{"title":"Clustering environmental pollutants associated with increased risk of metabolic disease: a hierarchical analysis.","authors":"Brooke Scardino, Akshat Agrawal, Diensn G Xing, Jackson L St Pierre, Md Mostafizur Rahman Bhuiyan, Kanon Kamronnaher, Md Shenuarin Bhuiyan, Oren Rom, Steven A Conrad, John A Vanchiere, A Wayne Orr, Christopher G Kevil, Mohammad Alfrad Nobel Bhuiyan","doi":"10.1007/s13755-025-00375-1","DOIUrl":"10.1007/s13755-025-00375-1","url":null,"abstract":"<p><strong>Background: </strong>Metabolic syndrome (MetS), which affects one-third of the population of the United States, is a risk factor for chronic diseases such as cardiovascular diseases, stroke, and type 2 diabetes mellitus. Heavy metals (HM) and volatile organic compounds (VOC) are environmental factors typically occurring as mixtures. Although exposures to these substances have been studied separately, the impact of combined HM and VOC exposure on humans and their subsequent risk of developing MetS has not been explored. This study investigates whether combined exposure to HMs and VOCs affects the risk of developing MetS.</p><p><strong>Methods: </strong>The National Health and Nutrition Examination Survey database from 2011 to 2020 was used to determine exposure to HMs and VOCs detected in urine samples from individuals with MetS. Multiple Chi-squared and t-tests were performed to identify variables significantly associated with MetS. Logistic regression analysis was performed on unmatched and age-matched 1:1 case-control data to evaluate whether an association exists among HMs, VOCs, and demographic factors and MetS. A hierarchical cluster analysis was performed to identify combinations of HMs and VOCs linked with an increased risk of MetS.</p><p><strong>Results: </strong>Logistic regression analysis on unmatched and matched data showed that increasing age and female sex were significantly associated (<i>p</i> < 0.05) with MetS. Among the HMs and VOCs, only N-acetyl-S-(2-cyanoethyl)-l-cysteine and N-acetyl-S-(2-hydroxyethyl)-l-cysteine were found to be significantly associated with MetS. Cluster analysis showed that Cluster 3 was significantly associated with MetS (<i>p</i> < 0.05; OR = 1.49), suggesting that exposure to barium, cadmium, cesium, lead, and VOCs may increase the risk of MetS. After adjusting for covariates, none of the clusters showed a significant association (<i>p</i> > 0.05). In contrast, age (OR = 1.07) and monthly poverty level index ≤ 1.3 (OR = 1.16) were significantly associated with MetS (<i>p</i> < 0.05).</p><p><strong>Conclusion: </strong>This study revealed that age, lower socioeconomic status, and multiple exposures to combined HM and VOC may have a greater impact with an increased risk of MetS. Cluster analysis highlighted the potential combination of the exposures linked to MetS and the likelihood that demographic factors affect MetS more than exposure to HMs and VOCs. However, further research is needed.</p><p><strong>Supplementary information: </strong>The online version contains supplementary material available at 10.1007/s13755-025-00375-1.</p>","PeriodicalId":46312,"journal":{"name":"Health Information Science and Systems","volume":"13 1","pages":"59"},"PeriodicalIF":3.4,"publicationDate":"2025-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12460862/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145187131","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-08-29eCollection Date: 2025-12-01DOI: 10.1007/s13755-025-00369-z
Jamie Canderan, Moses Stamboulian, Yuzhen Ye
The gut microbiome plays a fundamental role in human health and disease. Individual variations in the microbiome and the corresponding functional implications are key considerations to enhance precision health and medicine. Metaproteomics has recently revealed protein expression that might be associated with human health and disease. Existing studies focused on either human proteins or bacterial proteins that can be identified from (meta)proteomics data sets, but not both. In this study, we examined the feasibility of identifying both human and bacterial proteins that are differentially expressed between healthy and diseased individuals from metaproteomics data sets. We further evaluated different strategies of using identified peptides and proteins for building predictive models. By leveraging existing metaproteomics data sets and a tool that we have developed for metaproteomics data analysis (MetaProD), we were able to derive both human and bacterial differentially expressed proteins that could serve as potential biomarkers for all diseases we studied. We also built predictive models using identified peptides and proteins as features for prediction of human diseases. Our results showed peptide-based identifications over protein-based ones often produce the most accurate models and that feature selection can offer improvements. Prediction accuracy could be further improved, in some cases, by including bacterial identifications, but missing data in bacterial identifications remains problematic.
{"title":"Identification and applications of disease-associated differential human and bacterial proteins with metaproteomic evidence.","authors":"Jamie Canderan, Moses Stamboulian, Yuzhen Ye","doi":"10.1007/s13755-025-00369-z","DOIUrl":"10.1007/s13755-025-00369-z","url":null,"abstract":"<p><p>The gut microbiome plays a fundamental role in human health and disease. Individual variations in the microbiome and the corresponding functional implications are key considerations to enhance precision health and medicine. Metaproteomics has recently revealed protein expression that might be associated with human health and disease. Existing studies focused on either human proteins or bacterial proteins that can be identified from (meta)proteomics data sets, but not both. In this study, we examined the feasibility of identifying both human and bacterial proteins that are differentially expressed between healthy and diseased individuals from metaproteomics data sets. We further evaluated different strategies of using identified peptides and proteins for building predictive models. By leveraging existing metaproteomics data sets and a tool that we have developed for metaproteomics data analysis (MetaProD), we were able to derive both human and bacterial differentially expressed proteins that could serve as potential biomarkers for all diseases we studied. We also built predictive models using identified peptides and proteins as features for prediction of human diseases. Our results showed peptide-based identifications over protein-based ones often produce the most accurate models and that feature selection can offer improvements. Prediction accuracy could be further improved, in some cases, by including bacterial identifications, but missing data in bacterial identifications remains problematic.</p>","PeriodicalId":46312,"journal":{"name":"Health Information Science and Systems","volume":"13 1","pages":"54"},"PeriodicalIF":3.4,"publicationDate":"2025-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12397042/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144973812","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Accurate blood glucose (BG) prediction is greatly benefit for the treatment of diabetes. Generally, clinical physicians are required to comprehensively analyze various factors, such as patient's body temperature, meal, sleep, insulin injection, continuous glucose monitoring (CGM), and other information, to evaluate the fluctuation trend of blood glucose. To address this problem, this paper proposes a multivariate blood glucose prediction method based on mixed feature clustering. It clusters time series data with diverse or mixed features related to blood glucose, effectively leveraging correlations and distribution characteristics. By combining incremental clustering of multivariate time series with transfer learning, this method achieves online prediction of blood glucose levels. The experimental results indicate that the proposed method can decrease the prediction error RMSE by 4.2% (PH=30min) and 5.9% (PH=60min). Compared with other prediction methods, the training time of the multivariate prediction method is reduced by 5.2% (PH=30min) and 4.7% (PH=60min). It was also validated and compared with other methods in a real dataset. The proposed method in this study has lower prediction error and better prediction performance in the prediction horizon (PH) of PH=30, 45, 60, 75, and 90 min, respectively. Compared with the traditional unitary and multivariate time series prediction method, the approach proposed in this paper significantly improves the accuracy and robustness of blood glucose prediction. According to the evaluation results on the data set from OhioT1DM and the Sixth People's Hospital of Shanghai, the proposed method has better generalization performance and clinical acceptability.
{"title":"A new multivariate blood glucose prediction method with hybrid feature clustering and online transfer learning.","authors":"Fuqiang You, Guo Zhao, Xinyu Zhang, Ziheng Zhang, Jinli Cao, Hongru Li","doi":"10.1007/s13755-024-00313-7","DOIUrl":"10.1007/s13755-024-00313-7","url":null,"abstract":"<p><p>Accurate blood glucose (BG) prediction is greatly benefit for the treatment of diabetes. Generally, clinical physicians are required to comprehensively analyze various factors, such as patient's body temperature, meal, sleep, insulin injection, continuous glucose monitoring (CGM), and other information, to evaluate the fluctuation trend of blood glucose. To address this problem, this paper proposes a multivariate blood glucose prediction method based on mixed feature clustering. It clusters time series data with diverse or mixed features related to blood glucose, effectively leveraging correlations and distribution characteristics. By combining incremental clustering of multivariate time series with transfer learning, this method achieves online prediction of blood glucose levels. The experimental results indicate that the proposed method can decrease the prediction error RMSE by 4.2% (PH=30min) and 5.9% (PH=60min). Compared with other prediction methods, the training time of the multivariate prediction method is reduced by 5.2% (PH=30min) and 4.7% (PH=60min). It was also validated and compared with other methods in a real dataset. The proposed method in this study has lower prediction error and better prediction performance in the prediction horizon (PH) of PH=30, 45, 60, 75, and 90 min, respectively. Compared with the traditional unitary and multivariate time series prediction method, the approach proposed in this paper significantly improves the accuracy and robustness of blood glucose prediction. According to the evaluation results on the data set from OhioT1DM and the Sixth People's Hospital of Shanghai, the proposed method has better generalization performance and clinical acceptability.</p>","PeriodicalId":46312,"journal":{"name":"Health Information Science and Systems","volume":"12 1","pages":"57"},"PeriodicalIF":3.4,"publicationDate":"2024-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11570574/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142677071","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-11-16eCollection Date: 2024-12-01DOI: 10.1007/s13755-024-00314-6
Xi Cao, Yong-Feng Ge, Kate Wang, Ying Lin
Purpose: Cognitive diagnostic tests (CDTs) assess cognitive skills at a more granular level, providing detailed insights into the mastery profile of test-takers. Traditional algorithms for constructing CDTs have partially addressed these challenges, focusing on a limited number of constraints. This paper intends to utilize a meta-heuristic algorithm to produce high-quality tests and handle more constraints simultaneously.
Methods: This paper presents a memetic ant colony optimization (MACO) algorithm for constructing CDTs while considering multiple constraints. The MACO method utilizes pheromone trails to represent successful test constructions from the past. Additionally, it innovatively integrates item quality and constraint adherence into heuristic information to manage multiple constraints simultaneously. The method evaluates the assembled tests based on the diagnosis index and constraint satisfaction. Another innovation of MACO is the incorporation of a local search strategy to further enhance diagnostic accuracy by partially optimizing item selection. The optimal local search parameter settings are explored through a parameter investigation. A series of simulation experiments validate the effectiveness of MACO under various conditions.
Results: The results demonstrate the great ability of meta-heuristic algorithms to handle multiple constraints and achieve high statistical performance. MACO exhibited superior performance in generating high-quality CDTs while meeting multiple constraints, particularly for mixed and low discrimination item banks. It achieved faster convergence than the ant colony optimization in most scenarios.
Conclusions: MACO provides an effective solution for multi-constrained CDT construction, especially for shorter tests and item banks with mixed or lower discrimination. The experimental results also suggest that the suitability of different optimization approaches may depend on specific test conditions, such as the characteristics of the item bank and the length of the test.
{"title":"Memetic ant colony optimization for multi-constrained cognitive diagnostic test construction.","authors":"Xi Cao, Yong-Feng Ge, Kate Wang, Ying Lin","doi":"10.1007/s13755-024-00314-6","DOIUrl":"10.1007/s13755-024-00314-6","url":null,"abstract":"<p><strong>Purpose: </strong>Cognitive diagnostic tests (CDTs) assess cognitive skills at a more granular level, providing detailed insights into the mastery profile of test-takers. Traditional algorithms for constructing CDTs have partially addressed these challenges, focusing on a limited number of constraints. This paper intends to utilize a meta-heuristic algorithm to produce high-quality tests and handle more constraints simultaneously.</p><p><strong>Methods: </strong>This paper presents a memetic ant colony optimization (MACO) algorithm for constructing CDTs while considering multiple constraints. The MACO method utilizes pheromone trails to represent successful test constructions from the past. Additionally, it innovatively integrates item quality and constraint adherence into heuristic information to manage multiple constraints simultaneously. The method evaluates the assembled tests based on the diagnosis index and constraint satisfaction. Another innovation of MACO is the incorporation of a local search strategy to further enhance diagnostic accuracy by partially optimizing item selection. The optimal local search parameter settings are explored through a parameter investigation. A series of simulation experiments validate the effectiveness of MACO under various conditions.</p><p><strong>Results: </strong>The results demonstrate the great ability of meta-heuristic algorithms to handle multiple constraints and achieve high statistical performance. MACO exhibited superior performance in generating high-quality CDTs while meeting multiple constraints, particularly for mixed and low discrimination item banks. It achieved faster convergence than the ant colony optimization in most scenarios.</p><p><strong>Conclusions: </strong>MACO provides an effective solution for multi-constrained CDT construction, especially for shorter tests and item banks with mixed or lower discrimination. The experimental results also suggest that the suitability of different optimization approaches may depend on specific test conditions, such as the characteristics of the item bank and the length of the test.</p>","PeriodicalId":46312,"journal":{"name":"Health Information Science and Systems","volume":"12 1","pages":"56"},"PeriodicalIF":3.4,"publicationDate":"2024-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11569084/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142668916","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Over the past few decades, a variety of significant scientific breakthroughs have been achieved in the fields of brain encoding and decoding using the functional magnetic resonance imaging (fMRI). Many studies have been conducted on the topic of human brain reaction to visual stimuli. However, the relationship between fMRI images and video sequences viewed by humans remains complex and is often studied using large transformer models. In this paper, we investigate the correlation between videos presented to participants during an experiment and the resulting fMRI images. To achieve this, we propose a method for creating a linear model that predicts changes in fMRI signals based on video sequence images. A linear model is constructed for each individual voxel in the fMRI image, assuming that the image sequence follows a Markov property. Through the comprehensive qualitative experiments, we demonstrate the relationship between the two time series. We hope that our findings contribute to a deeper understanding of the human brain's reaction to external stimuli and provide a basis for future research in this area.
{"title":"Forecasting fMRI images from video sequences: linear model analysis.","authors":"Daniil Dorin, Nikita Kiselev, Andrey Grabovoy, Vadim Strijov","doi":"10.1007/s13755-024-00315-5","DOIUrl":"10.1007/s13755-024-00315-5","url":null,"abstract":"<p><p>Over the past few decades, a variety of significant scientific breakthroughs have been achieved in the fields of brain encoding and decoding using the functional magnetic resonance imaging (fMRI). Many studies have been conducted on the topic of human brain reaction to visual stimuli. However, the relationship between fMRI images and video sequences viewed by humans remains complex and is often studied using large transformer models. In this paper, we investigate the correlation between videos presented to participants during an experiment and the resulting fMRI images. To achieve this, we propose a method for creating a linear model that predicts changes in fMRI signals based on video sequence images. A linear model is constructed for each individual voxel in the fMRI image, assuming that the image sequence follows a Markov property. Through the comprehensive qualitative experiments, we demonstrate the relationship between the two time series. We hope that our findings contribute to a deeper understanding of the human brain's reaction to external stimuli and provide a basis for future research in this area.</p>","PeriodicalId":46312,"journal":{"name":"Health Information Science and Systems","volume":"12 1","pages":"55"},"PeriodicalIF":3.4,"publicationDate":"2024-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11568086/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142648946","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Purpose: Kidney stone disease (KSD) is a common urological disorder with an increasing incidence worldwide. The extensive knowledge about KSD is dispersed across multiple databases, challenging the visualization and representation of its hierarchy and connections. This paper aims at constructing a disease-specific knowledge graph for KSD to enhance the effective utilization of knowledge by medical professionals and promote clinical research and discovery.
Methods: Text parsing and semantic analysis were conducted on literature related to KSD from PubMed, with concept annotation based on biomedical ontology being utilized to generate semantic data in RDF format. Moreover, public databases were integrated to construct a large-scale knowledge graph for KSD. Additionally, case studies were carried out to demonstrate the practical utility of the developed knowledge graph.
Results: We proposed and implemented a Kidney Stone Disease Knowledge Graph (KSDKG), covering more than 90 million triples. This graph comprised semantic data extracted from 29,174 articles, integrating available data from UMLS, SNOMED CT, MeSH, DrugBank and Microbe-Disease Knowledge Graph. Through the application of three cases, we retrieved and discovered information on microbes, drugs and diseases associated with KSD. The results illustrated that the KSDKG can integrate diverse medical knowledge and provide new clinical insights for identifying the underlying mechanisms of KSD.
Conclusion: The KSDKG efficiently utilizes knowledge graph to reveal hidden knowledge associations, facilitating semantic search and response. As a blueprint for developing disease-specific knowledge graphs, it offers valuable contributions to medical research.
{"title":"KSDKG: construction and application of knowledge graph for kidney stone disease based on biomedical literature and public databases.","authors":"Jianping Man, Yufei Shi, Zhensheng Hu, Rui Yang, Zhisheng Huang, Yi Zhou","doi":"10.1007/s13755-024-00309-3","DOIUrl":"10.1007/s13755-024-00309-3","url":null,"abstract":"<p><strong>Purpose: </strong>Kidney stone disease (KSD) is a common urological disorder with an increasing incidence worldwide. The extensive knowledge about KSD is dispersed across multiple databases, challenging the visualization and representation of its hierarchy and connections. This paper aims at constructing a disease-specific knowledge graph for KSD to enhance the effective utilization of knowledge by medical professionals and promote clinical research and discovery.</p><p><strong>Methods: </strong>Text parsing and semantic analysis were conducted on literature related to KSD from PubMed, with concept annotation based on biomedical ontology being utilized to generate semantic data in RDF format. Moreover, public databases were integrated to construct a large-scale knowledge graph for KSD. Additionally, case studies were carried out to demonstrate the practical utility of the developed knowledge graph.</p><p><strong>Results: </strong>We proposed and implemented a Kidney Stone Disease Knowledge Graph (KSDKG), covering more than 90 million triples. This graph comprised semantic data extracted from 29,174 articles, integrating available data from UMLS, SNOMED CT, MeSH, DrugBank and Microbe-Disease Knowledge Graph. Through the application of three cases, we retrieved and discovered information on microbes, drugs and diseases associated with KSD. The results illustrated that the KSDKG can integrate diverse medical knowledge and provide new clinical insights for identifying the underlying mechanisms of KSD.</p><p><strong>Conclusion: </strong>The KSDKG efficiently utilizes knowledge graph to reveal hidden knowledge associations, facilitating semantic search and response. As a blueprint for developing disease-specific knowledge graphs, it offers valuable contributions to medical research.</p>","PeriodicalId":46312,"journal":{"name":"Health Information Science and Systems","volume":"12 1","pages":"54"},"PeriodicalIF":3.4,"publicationDate":"2024-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11564440/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142648856","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}