首页 > 最新文献

Health Information Science and Systems最新文献

英文 中文
Knowledge and data-driven prediction of organ failure in critical care patients. 重症监护患者器官衰竭的知识和数据驱动预测。
IF 4.7 3区 医学 Q1 MEDICAL INFORMATICS Pub Date : 2023-01-23 eCollection Date: 2023-12-01 DOI: 10.1007/s13755-023-00210-5
Xinyu Ma, Meng Wang, Sihan Lin, Yuhao Zhang, Yanjian Zhang, Wen Ouyang, Xing Liu

Purpose: The early detection of organ failure mitigates the risk of post-intensive care syndrome and long-term functional impairment. The aim of this study is to predict organ failure in real-time for critical care patients based on a data-driven and knowledge-driven machine learning method (DKM) and provide explanations for the prediction by incorporating a medical knowledge graph.

Methods: The cohort of this study was a subset of the 4,386 adult Intensive Care Unit (ICU) patients from the MIMIC-III dataset collected between 2001 and 2012, and the primary outcome was the Delta Sequential Organ Failure Assessment (SOFA) score. A real-time Delta SOFA score prediction model was developed with two key components: an improved deep learning temporal convolutional network (S-TCN) and a graph-embedding feature extraction method based on a medical knowledge graph. Entities and relations related to organ failure were extracted from the Unified Medical Language System to build the medical knowledge graph, and patient data were mapped onto the graph to extract the embeddings. We measured the performance of our DKM approach with cross-validation to avoid the formation of biased assessments.

Results: An area under the receiver operating characteristic curve (AUC) of 0.973, a precision of 0.923, a NPV of 0.989, and an F1 score of 0.927 were achieved using the DKM approach, which significantly outperformed the baseline methods. Additionally, the performance remained stable following external validation on the eICU dataset, which consists of 2,816 admissions (AUC = 0.981, precision = 0.860, NPV = 0.984). Visualization of feature importance for the Delta SOFA score and their relationships on the basic clinical medical (BCM) knowledge graph provided a model explanation.

Conclusion: The use of an improved TCN model and a medical knowledge graph led to substantial improvement in prediction accuracy, providing generalizability and an independent explanation for organ failure prediction in critical care patients. These findings show the potential of incorporating prior domain knowledge into machine learning models to inform care and service planning.

Supplementary information: The online version of this article contains supplementary material available 10.1007/s13755-023-00210-5.

目的:早期发现器官衰竭可降低重症监护后综合征和长期功能损害的风险。本研究的目的是基于数据驱动和知识驱动的机器学习方法(DKM)实时预测重症监护患者的器官衰竭,并通过结合医学知识图为预测提供解释。方法:本研究的队列是2001年至2012年间收集的MIMIC-III数据集中4386名成人重症监护室(ICU)患者的子集,主要结果是德尔塔顺序器官衰竭评估(SOFA)评分。开发了一个实时Delta SOFA分数预测模型,该模型由两个关键组件组成:改进的深度学习时间卷积网络(S-TCN)和基于医学知识图的图嵌入特征提取方法。从统一医学语言系统中提取与器官衰竭相关的实体和关系以构建医学知识图,并将患者数据映射到图上以提取嵌入。我们通过交叉验证来衡量DKM方法的性能,以避免形成有偏见的评估。结果:使用DKM方法获得了0.973的受试者工作特征曲线下面积(AUC)、0.923的精度、0.989的NPV和0.927的F1分数,显著优于基线方法。此外,在eICU数据集上进行外部验证后,性能保持稳定,该数据集包括2816例入院(AUC = 0.981,精度 = 0.860,NPV = 0.984)。德尔塔SOFA评分的特征重要性及其在基础临床医学(BCM)知识图上的关系的可视化提供了模型解释。结论:使用改进的TCN模型和医学知识图显著提高了预测准确性,为重症监护患者的器官衰竭预测提供了可推广性和独立解释。这些发现显示了将先验领域知识纳入机器学习模型以为护理和服务规划提供信息的潜力。补充信息:本文的在线版本包含补充材料10.1007/s13755-023-00210-5。
{"title":"Knowledge and data-driven prediction of organ failure in critical care patients.","authors":"Xinyu Ma, Meng Wang, Sihan Lin, Yuhao Zhang, Yanjian Zhang, Wen Ouyang, Xing Liu","doi":"10.1007/s13755-023-00210-5","DOIUrl":"10.1007/s13755-023-00210-5","url":null,"abstract":"<p><strong>Purpose: </strong>The early detection of organ failure mitigates the risk of post-intensive care syndrome and long-term functional impairment. The aim of this study is to predict organ failure in real-time for critical care patients based on a data-driven and knowledge-driven machine learning method (DKM) and provide explanations for the prediction by incorporating a medical knowledge graph.</p><p><strong>Methods: </strong>The cohort of this study was a subset of the 4,386 adult Intensive Care Unit (ICU) patients from the MIMIC-III dataset collected between 2001 and 2012, and the primary outcome was the Delta Sequential Organ Failure Assessment (SOFA) score. A real-time Delta SOFA score prediction model was developed with two key components: an improved deep learning temporal convolutional network (S-TCN) and a graph-embedding feature extraction method based on a medical knowledge graph. Entities and relations related to organ failure were extracted from the Unified Medical Language System to build the medical knowledge graph, and patient data were mapped onto the graph to extract the embeddings. We measured the performance of our DKM approach with cross-validation to avoid the formation of biased assessments.</p><p><strong>Results: </strong>An area under the receiver operating characteristic curve (AUC) of 0.973, a precision of 0.923, a NPV of 0.989, and an F1 score of 0.927 were achieved using the DKM approach, which significantly outperformed the baseline methods. Additionally, the performance remained stable following external validation on the eICU dataset, which consists of 2,816 admissions (AUC = 0.981, precision = 0.860, NPV = 0.984). Visualization of feature importance for the Delta SOFA score and their relationships on the basic clinical medical (BCM) knowledge graph provided a model explanation.</p><p><strong>Conclusion: </strong>The use of an improved TCN model and a medical knowledge graph led to substantial improvement in prediction accuracy, providing generalizability and an independent explanation for organ failure prediction in critical care patients. These findings show the potential of incorporating prior domain knowledge into machine learning models to inform care and service planning.</p><p><strong>Supplementary information: </strong>The online version of this article contains supplementary material available 10.1007/s13755-023-00210-5.</p>","PeriodicalId":46312,"journal":{"name":"Health Information Science and Systems","volume":"11 1","pages":"7"},"PeriodicalIF":4.7,"publicationDate":"2023-01-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9871106/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9182134","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Using nutrigenomics to guide personalized nutrition supplementation for bolstering immune system. 利用营养基因组学指导个性化营养补充,增强免疫系统。
IF 6 3区 医学 Q1 MEDICAL INFORMATICS Pub Date : 2023-01-18 eCollection Date: 2023-12-01 DOI: 10.1007/s13755-022-00208-5
Jitao Yang

Immunity refers to the ability of the human immune system to resist pathogen infection. Immune system has the basic functions of immune defense, immune self stabilization and immune surveillance. Balanced nutrition is the cornerstone of the immune system to play its immune function, and nutritional intervention is also an important means to maintain and improve immunity. Previous studies have confirmed that T cells have individual differences in recognizing viral antigens of virus infected cells, and the body's response to antigens is controlled by a variety of genetic genes, such as human leukocyte antigen (HLA) genes, immune response (Ir) genes, etc. In this paper, through immunity genetic testing, we screen out genetically susceptible people with low immunity and people with the risk of nutrient metabolism disorders; through using lifestyle questionnaire and physical examination results, we analyze people's physical condition, dietary habits, and exercise habits to evaluate people's nutrient deficiency degree. Then, combining multi-dimensional health data, we evaluate users' immune status and nutritional deficiency risk comprehensively, further, we implemented personalized nutrition intervention on the types and doses of nutritional supplements to improve immunity. We also validated the effectiveness of our personalized nutrition solution through a population-based cohort study.

免疫力是指人体免疫系统抵抗病原体感染的能力。免疫系统具有免疫防御、免疫自我稳定和免疫监视的基本功能。均衡营养是免疫系统发挥免疫功能的基石,营养干预也是维持和提高免疫力的重要手段。先前的研究证实,T细胞在识别病毒感染细胞的病毒抗原方面存在个体差异,身体对抗原的反应由多种遗传基因控制,如人类白细胞抗原(HLA)基因、免疫反应(Ir)基因等,我们筛选出免疫力低的基因易感人群和有营养代谢障碍风险的人群;通过生活方式问卷和体检结果,分析人们的身体状况、饮食习惯和运动习惯,评价人们的营养缺乏程度。然后,结合多维健康数据,全面评估用户的免疫状态和营养缺乏风险,进一步对营养补充剂的类型和剂量进行个性化营养干预,以提高免疫力。我们还通过一项基于人群的队列研究验证了个性化营养解决方案的有效性。
{"title":"Using nutrigenomics to guide personalized nutrition supplementation for bolstering immune system.","authors":"Jitao Yang","doi":"10.1007/s13755-022-00208-5","DOIUrl":"10.1007/s13755-022-00208-5","url":null,"abstract":"<p><p>Immunity refers to the ability of the human immune system to resist pathogen infection. Immune system has the basic functions of immune defense, immune self stabilization and immune surveillance. Balanced nutrition is the cornerstone of the immune system to play its immune function, and nutritional intervention is also an important means to maintain and improve immunity. Previous studies have confirmed that T cells have individual differences in recognizing viral antigens of virus infected cells, and the body's response to antigens is controlled by a variety of genetic genes, such as human leukocyte antigen (HLA) genes, immune response (Ir) genes, etc. In this paper, through immunity genetic testing, we screen out genetically susceptible people with low immunity and people with the risk of nutrient metabolism disorders; through using lifestyle questionnaire and physical examination results, we analyze people's physical condition, dietary habits, and exercise habits to evaluate people's nutrient deficiency degree. Then, combining multi-dimensional health data, we evaluate users' immune status and nutritional deficiency risk comprehensively, further, we implemented personalized nutrition intervention on the types and doses of nutritional supplements to improve immunity. We also validated the effectiveness of our personalized nutrition solution through a population-based cohort study.</p>","PeriodicalId":46312,"journal":{"name":"Health Information Science and Systems","volume":"11 1","pages":"4"},"PeriodicalIF":6.0,"publicationDate":"2023-01-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9846659/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10554512","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Meta-path guided graph attention network for explainable herb recommendation. 可解释草药推荐的元路径引导图注意网络。
IF 6 3区 医学 Q1 MEDICAL INFORMATICS Pub Date : 2023-01-18 eCollection Date: 2023-12-01 DOI: 10.1007/s13755-022-00207-6
Yuanyuan Jin, Wendi Ji, Yao Shi, Xiaoling Wang, Xiaochun Yang

Traditional Chinese Medicine (TCM) has been widely adopted in clinical practice by Eastern Asia people for thousands of years. Nowadays, TCM still plays a critical role in Chinese society and receives increasing attention worldwide. The existing herb recommenders learn the complex relations between symptoms and herbs by mining the TCM prescriptions. Given a set of symptoms, they will provide a set of herbs and explanations from the TCM theory. However, the foundation of TCM is Yinyangism (i.e. the combination of Five Phases theory with Yin-yang theory), which is very different from modern medicine philosophy. Only recommending herbs from the TCM theory aspect largely prevents TCM from modern medical treatment. As TCM and modern medicine share a common view at the molecular level, it is necessary to integrate the ancient practice of TCM and standards of modern medicine. In this paper, we explore the underlying action mechanisms of herbs from both TCM and modern medicine, and propose a Meta-path guided Graph Attention Network (MGAT) to provide the explainable herb recommendations. Technically, to translate TCM from an experience-based medicine to an evidence-based medicine system, we incorporate the pharmacology knowledge of modern Chinese medicine with the TCM knowledge. We design a meta-path guided information propagation scheme based on the extended knowledge graph, which combines information propagation and decision process. This scheme adopts meta-paths (predefined relation sequences) to guide neighbor selection in the propagation process. Furthermore, the attention mechanism is utilized in aggregation to help distinguish the salience of different paths connecting a symptom with a herb. In this way, our model can distill the long-range semantics along meta-paths and generate fine-grained explanations. We conduct extensive experiments on a public TCM dataset, demonstrating comparable performance to the state-of-the-art herb recommendation models and the strong explainability.

数千年来,中医药在东亚人民的临床实践中被广泛采用。如今,中医药仍然在中国社会中发挥着至关重要的作用,并在世界范围内受到越来越多的关注。现有的草药推荐者通过挖掘中药处方来了解症状与草药之间的复杂关系。给定一组症状,他们将从中医理论中提供一组草药和解释。然而,中医的基础是阴阳学说(即五相学说与阴阳学说的结合),这与现代医学哲学有很大的不同。仅仅从中医理论方面推荐草药,在很大程度上阻碍了中医现代医学的发展。由于中医与现代医学在分子水平上有着共同的观点,因此有必要将中医的古老实践与现代医学的标准相结合。在本文中,我们从中医和现代医学中探索了草药的潜在作用机制,并提出了一个元路径引导的图形注意力网络(MGAT)来提供可解释的草药推荐。从技术上讲,要将中医从经验医学转化为循证医学,我们需要将现代中医的药理学知识与中医知识相结合。我们设计了一种基于扩展知识图的元路径引导信息传播方案,该方案将信息传播和决策过程相结合。该方案采用元路径(预定义的关系序列)来指导传播过程中的邻居选择。此外,注意力机制被用于聚合,以帮助区分连接症状和草药的不同路径的显著性。通过这种方式,我们的模型可以沿着元路径提取长程语义,并生成细粒度的解释。我们在公共中药数据集上进行了广泛的实验,证明了与最先进的草药推荐模型相当的性能和强大的可解释性。
{"title":"Meta-path guided graph attention network for explainable herb recommendation.","authors":"Yuanyuan Jin, Wendi Ji, Yao Shi, Xiaoling Wang, Xiaochun Yang","doi":"10.1007/s13755-022-00207-6","DOIUrl":"10.1007/s13755-022-00207-6","url":null,"abstract":"<p><p>Traditional Chinese Medicine (TCM) has been widely adopted in clinical practice by Eastern Asia people for thousands of years. Nowadays, TCM still plays a critical role in Chinese society and receives increasing attention worldwide. The existing herb recommenders learn the complex relations between symptoms and herbs by mining the TCM prescriptions. Given a set of symptoms, they will provide a set of herbs and explanations from the TCM theory. However, the foundation of TCM is Yinyangism (i.e. the combination of Five Phases theory with Yin-yang theory), which is very different from modern medicine philosophy. Only recommending herbs from the TCM theory aspect largely prevents TCM from modern medical treatment. As TCM and modern medicine share a common view at the molecular level, it is necessary to integrate the ancient practice of TCM and standards of modern medicine. In this paper, we explore the underlying action mechanisms of herbs from both TCM and modern medicine, and propose a Meta-path guided Graph Attention Network (MGAT) to provide the explainable herb recommendations. Technically, to translate TCM from an experience-based medicine to an evidence-based medicine system, we incorporate the pharmacology knowledge of modern Chinese medicine with the TCM knowledge. We design a meta-path guided information propagation scheme based on the extended knowledge graph, which combines information propagation and decision process. This scheme adopts meta-paths (predefined relation sequences) to guide neighbor selection in the propagation process. Furthermore, the attention mechanism is utilized in aggregation to help distinguish the salience of different paths connecting a symptom with a herb. In this way, our model can distill the long-range semantics along meta-paths and generate fine-grained explanations. We conduct extensive experiments on a public TCM dataset, demonstrating comparable performance to the state-of-the-art herb recommendation models and the strong explainability.</p>","PeriodicalId":46312,"journal":{"name":"Health Information Science and Systems","volume":"11 1","pages":"5"},"PeriodicalIF":6.0,"publicationDate":"2023-01-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9847457/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10554191","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
MHA: a multimodal hierarchical attention model for depression detection in social media. MHA:社交媒体抑郁检测的多模态分层注意模型。
IF 6 3区 医学 Q1 MEDICAL INFORMATICS Pub Date : 2023-01-18 eCollection Date: 2023-12-01 DOI: 10.1007/s13755-022-00197-5
Zepeng Li, Zhengyi An, Wenchuan Cheng, Jiawei Zhou, Fang Zheng, Bin Hu

As a serious mental disease, depression causes great harm to the physical and mental health of individuals, and becomes an important cause of suicide. Therefore, it is necessary to accurately identify and treat depressed patients. Compared with traditional clinical diagnosis methods, a large amount of real and different types of data on social media provides new ideas for depression detection research. In this paper, we construct a depression detection data set based on Weibo, and propose a Multimodal Hierarchical Attention (MHA) model for social media depression detection. Multimodal data is fed into the model and the attention mechanism is applied within and between modalities at the same time. Experimental results show that the proposed model achieves the best classification performance. In addition, we propose a distribution normalization method, which can optimize the data distribution and improve the accuracy of depression detection.

抑郁症作为一种严重的精神疾病,对个体的身心健康造成极大危害,成为自杀的重要原因。因此,有必要准确识别和治疗抑郁症患者。与传统的临床诊断方法相比,社交媒体上大量真实且不同类型的数据为抑郁症检测研究提供了新的思路。在本文中,我们构建了一个基于微博的抑郁症检测数据集,并提出了一个用于社交媒体抑郁症检测的多模式层次注意力(MHA)模型。多模态数据被输入到模型中,同时在模态内部和模态之间应用注意力机制。实验结果表明,该模型取得了最佳的分类性能。此外,我们提出了一种分布归一化方法,该方法可以优化数据分布,提高抑郁症检测的准确性。
{"title":"MHA: a multimodal hierarchical attention model for depression detection in social media.","authors":"Zepeng Li, Zhengyi An, Wenchuan Cheng, Jiawei Zhou, Fang Zheng, Bin Hu","doi":"10.1007/s13755-022-00197-5","DOIUrl":"10.1007/s13755-022-00197-5","url":null,"abstract":"<p><p>As a serious mental disease, depression causes great harm to the physical and mental health of individuals, and becomes an important cause of suicide. Therefore, it is necessary to accurately identify and treat depressed patients. Compared with traditional clinical diagnosis methods, a large amount of real and different types of data on social media provides new ideas for depression detection research. In this paper, we construct a depression detection data set based on Weibo, and propose a Multimodal Hierarchical Attention (MHA) model for social media depression detection. Multimodal data is fed into the model and the attention mechanism is applied within and between modalities at the same time. Experimental results show that the proposed model achieves the best classification performance. In addition, we propose a distribution normalization method, which can optimize the data distribution and improve the accuracy of depression detection.</p>","PeriodicalId":46312,"journal":{"name":"Health Information Science and Systems","volume":"11 1","pages":"6"},"PeriodicalIF":6.0,"publicationDate":"2023-01-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9846704/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10554511","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
BTC-fCNN: Fast Convolution Neural Network for Multi-class Brain Tumor Classification. BTC-fCNN:快速卷积神经网络用于多类脑肿瘤分类。
IF 6 3区 医学 Q1 MEDICAL INFORMATICS Pub Date : 2023-01-02 eCollection Date: 2023-12-01 DOI: 10.1007/s13755-022-00203-w
Basant S Abd El-Wahab, Mohamed E Nasr, Salah Khamis, Amira S Ashour

Timely prognosis of brain tumors has a crucial role for powerful healthcare of remedy-making plans. Manual classification of the brain tumors in magnetic resonance imaging (MRI) images is a challenging task, which relies on the experienced radiologists to identify and classify the brain tumor. Automated classification of different brain tumors is significant based on designing computer-aided diagnosis (CAD) systems. Existing classification methods suffer from unsatisfactory performance and/or large computational cost/ time. This paper proposed a fast and efficient classification process, called BTC-fCNN, which is a deep learning-based system to distinguish between different views of three brain tumor types, namely meningioma, glioma, and pituitary tumors. The proposed system's model was applied on MRI images from the Figshare dataset. It consists of 13 layers with few trainable parameters involving convolution layer, 1 × 1 convolution layer, average pooling, fully connected layer, and softmax layer. Five iterations including transfer learning and five-fold cross-validation for retraining are considered to increase the proposed model performance. The proposed model achieved 98.63% average accuracy, using five iterations with transfer learning, and 98.86% using retrained five-fold cross-validation (internal transfer learning between the folds). Various evaluation metrics were measured to evaluate the proposed model, such as precision, F-score, recall, specificity and confusion matrix. The proposed BTC-fCNN model outstrips the state-of-the-art and other well-known convolution neural networks (CNN).

脑肿瘤的及时预后对于制定强有力的治疗计划具有至关重要的作用。核磁共振成像(MRI)图像中脑肿瘤的手动分类是一项具有挑战性的任务,它依赖于经验丰富的放射科医生来识别和分类脑肿瘤。在设计计算机辅助诊断(CAD)系统的基础上,对不同的脑肿瘤进行自动分类具有重要意义。现有的分类方法存在不令人满意的性能和/或大的计算成本/时间。本文提出了一种快速高效的分类过程,称为BTC-fCNN,这是一种基于深度学习的系统,用于区分三种脑肿瘤类型的不同观点,即脑膜瘤、神经胶质瘤和垂体瘤。所提出的系统模型应用于Figshare数据集的MRI图像。它由13层组成,其中涉及卷积层的可训练参数很少,1 × 1个卷积层、平均池、全连接层和softmax层。考虑了五次迭代,包括迁移学习和五次交叉验证,以提高所提出的模型性能。使用带有迁移学习的五次迭代,所提出的模型实现了98.63%的平均准确率,使用重新训练的五次交叉验证(折叠之间的内部迁移学习)实现了98.86%的平均准确度。测量了各种评估指标来评估所提出的模型,如精确度、F评分、召回率、特异性和混淆矩阵。所提出的BTC-fCNN模型超过了最先进的和其他著名的卷积神经网络(CNN)。
{"title":"BTC-fCNN: Fast Convolution Neural Network for Multi-class Brain Tumor Classification.","authors":"Basant S Abd El-Wahab, Mohamed E Nasr, Salah Khamis, Amira S Ashour","doi":"10.1007/s13755-022-00203-w","DOIUrl":"10.1007/s13755-022-00203-w","url":null,"abstract":"<p><p>Timely prognosis of brain tumors has a crucial role for powerful healthcare of remedy-making plans. Manual classification of the brain tumors in magnetic resonance imaging (MRI) images is a challenging task, which relies on the experienced radiologists to identify and classify the brain tumor. Automated classification of different brain tumors is significant based on designing computer-aided diagnosis (CAD) systems. Existing classification methods suffer from unsatisfactory performance and/or large computational cost/ time. This paper proposed a fast and efficient classification process, called BTC-fCNN, which is a deep learning-based system to distinguish between different views of three brain tumor types, namely meningioma, glioma, and pituitary tumors. The proposed system's model was applied on MRI images from the Figshare dataset. It consists of 13 layers with few trainable parameters involving convolution layer, 1 × 1 convolution layer, average pooling, fully connected layer, and softmax layer. Five iterations including transfer learning and five-fold cross-validation for retraining are considered to increase the proposed model performance. The proposed model achieved 98.63% average accuracy, using five iterations with transfer learning, and 98.86% using retrained five-fold cross-validation (internal transfer learning between the folds). Various evaluation metrics were measured to evaluate the proposed model, such as precision, F-score, recall, specificity and confusion matrix. The proposed BTC-fCNN model outstrips the state-of-the-art and other well-known convolution neural networks (CNN).</p>","PeriodicalId":46312,"journal":{"name":"Health Information Science and Systems","volume":"11 1","pages":"3"},"PeriodicalIF":6.0,"publicationDate":"2023-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9807719/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10843717","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Embedding-based link predictions to explore latent comorbidity of chronic diseases. 基于嵌入的链接预测,以探索慢性病的潜在共病。
IF 4.7 3区 医学 Q1 MEDICAL INFORMATICS Pub Date : 2022-12-30 eCollection Date: 2023-12-01 DOI: 10.1007/s13755-022-00206-7
Haohui Lu, Shahadat Uddin

Purpose: Comorbidity is a term used to describe when a patient simultaneously has more than one chronic disease. Comorbidity is a significant health issue that affects people worldwide. This study aims to use machine learning and graph theory to predict the comorbidity of chronic diseases.

Methods: A patient-disease bipartite graph is constructed based on the administrative claim data. The bipartite graph projection approach was used to create the comorbidity network. For the link prediction task, three graph machine learning embedding-based models (node2vec, graph neural networks and hand-crafted approach) with different variants were used on the comorbidity network to compare their performance. This study also considered three commonly used similarity-based link prediction approaches (Jaccard coefficient, Adamic-Adar index and Resource allocation index) for performance comparison.

Results: The results showed that the embedding-based hand-crafted features technique achieved outstanding performance compared with the remaining similarity-based and embedding-based models. Especially, the hand-crafted technique with the extreme gradient boosting classifier achieved the highest accuracy (91.67%), followed by the same technique with the Logistic regression classifier (90.26%). For this shallow embedding method, the Jaccard coefficient and the degree centrality of the original chronic disease were the most important features for comorbidity prediction.

Conclusion: The proposed framework can be used to predict the comorbidity of chronic disease at an early stage of hospital admission. Thus, the prediction outcome could be valuable for medical practice, giving healthcare providers more control over their services and lowering expenses.

目的:合并症是一个术语,用于描述患者同时患有一种以上的慢性病。共病是一个影响全世界人民的重大健康问题。本研究旨在利用机器学习和图论来预测慢性病的合并症。方法:以行政索赔数据为基础,构建患者疾病二分图。使用二分图投影方法来创建共病网络。对于链接预测任务,在共病网络上使用了三个具有不同变体的基于图机器学习嵌入的模型(node2vec、图神经网络和手工方法)来比较它们的性能。本研究还考虑了三种常用的基于相似性的链路预测方法(Jaccard系数、Adamic-Adar指数和资源分配指数)进行性能比较。结果:与其余的基于相似性和基于嵌入的模型相比,基于嵌入的手工特征技术取得了优异的性能。特别是,手工制作的极端梯度增强分类器的准确率最高(91.67%),其次是与Logistic回归分类器相同的技术(90.26%)。对于这种浅嵌入方法,Jaccard系数和原始慢性病的程度中心性是预测共病的最重要特征。结论:该框架可用于预测住院早期的慢性病合并症。因此,预测结果可能对医疗实践有价值,使医疗保健提供者能够更好地控制他们的服务并降低费用。
{"title":"Embedding-based link predictions to explore latent comorbidity of chronic diseases.","authors":"Haohui Lu, Shahadat Uddin","doi":"10.1007/s13755-022-00206-7","DOIUrl":"10.1007/s13755-022-00206-7","url":null,"abstract":"<p><strong>Purpose: </strong>Comorbidity is a term used to describe when a patient simultaneously has more than one chronic disease. Comorbidity is a significant health issue that affects people worldwide. This study aims to use machine learning and graph theory to predict the comorbidity of chronic diseases.</p><p><strong>Methods: </strong>A patient-disease bipartite graph is constructed based on the administrative claim data. The bipartite graph projection approach was used to create the comorbidity network. For the link prediction task, three graph machine learning embedding-based models (node2vec, graph neural networks and hand-crafted approach) with different variants were used on the comorbidity network to compare their performance. This study also considered three commonly used similarity-based link prediction approaches (Jaccard coefficient, Adamic-Adar index and Resource allocation index) for performance comparison.</p><p><strong>Results: </strong>The results showed that the embedding-based hand-crafted features technique achieved outstanding performance compared with the remaining similarity-based and embedding-based models. Especially, the hand-crafted technique with the extreme gradient boosting classifier achieved the highest accuracy (91.67%), followed by the same technique with the Logistic regression classifier (90.26%). For this shallow embedding method, the Jaccard coefficient and the degree centrality of the original chronic disease were the most important features for comorbidity prediction.</p><p><strong>Conclusion: </strong>The proposed framework can be used to predict the comorbidity of chronic disease at an early stage of hospital admission. Thus, the prediction outcome could be valuable for medical practice, giving healthcare providers more control over their services and lowering expenses.</p>","PeriodicalId":46312,"journal":{"name":"Health Information Science and Systems","volume":"11 1","pages":"2"},"PeriodicalIF":4.7,"publicationDate":"2022-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9803807/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10477040","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Efficacy of novel attention-based gated recurrent units transformer for depression detection using electroencephalogram signals. 新型基于注意力的门控复发单元变换器用于使用脑电图信号检测抑郁症的疗效。
IF 4.7 3区 医学 Q1 MEDICAL INFORMATICS Pub Date : 2022-12-29 eCollection Date: 2023-12-01 DOI: 10.1007/s13755-022-00205-8
Neha Prerna Tigga, Shruti Garg

Purpose: Depression is a global challenge causing psychological and intellectual problems that require efficient diagnosis. Electroencephalogram (EEG) signals represent the functional state of the human brain and can help build an accurate and viable technique for the early prediction and treatment of depression.

Methods: An attention-based gated recurrent units transformer (AttGRUT) time-series model is proposed to efficiently identify EEG perturbations in depressive patients. Statistical, spectral and wavelet features were first extracted from the 60-channel EEG signal data. Then, two feature selection techniques, recursive feature elimination and the Boruta algorithm, both with Shapley additive explanations, were utilised for selecting essential features.

Results: The proposed model outperformed the two baseline and two hybrid time-series models-long short-term memory (LSTM), gated recurrent units (GRU), convolutional neural network-LSTM (CNN-LSTM), and CNN-GRU-achieving an accuracy of up to 98.67%. Feature selection considerably increased the performance across all time-series models.

Conclusion: Based on the obtained results, novel feature selection greatly affected the results of the baseline and hybrid time-series models. The proposed AttGRUT can be implemented and tested in other domains by using different modalities for prediction.

Supplementary information: The online version contains supplementary material available at 10.1007/s13755-022-00205-8.

目的:抑郁症是一个全球性的挑战,导致心理和智力问题,需要有效的诊断。脑电图(EEG)信号代表了人类大脑的功能状态,可以帮助建立一种准确可行的技术来早期预测和治疗抑郁症。方法:提出了一种基于注意力的门控递归单元变换器(AttGRUT)时间序列模型来有效识别抑郁症患者的脑电图扰动。首先从60通道脑电信号数据中提取统计、频谱和小波特征。然后,使用两种特征选择技术,递归特征消除和Boruta算法,都具有Shapley加法解释,来选择基本特征。结果:所提出的模型优于两个基线和两个混合时间序列模型——长短期记忆(LSTM)、门控递归单元(GRU)、卷积神经网络LSTM(CNN-LSTM)和CNN-GRU,准确率高达98.67%。特征选择显著提高了所有时间序列模型的性能。结论:基于所获得的结果,新的特征选择极大地影响了基线和混合时间序列模型的结果。所提出的AttGRUT可以通过使用不同的预测模式在其他领域中实现和测试。补充信息:在线版本包含补充材料,可访问10.1007/s13755-022-00205-8。
{"title":"Efficacy of novel attention-based gated recurrent units transformer for depression detection using electroencephalogram signals.","authors":"Neha Prerna Tigga, Shruti Garg","doi":"10.1007/s13755-022-00205-8","DOIUrl":"10.1007/s13755-022-00205-8","url":null,"abstract":"<p><strong>Purpose: </strong>Depression is a global challenge causing psychological and intellectual problems that require efficient diagnosis. Electroencephalogram (EEG) signals represent the functional state of the human brain and can help build an accurate and viable technique for the early prediction and treatment of depression.</p><p><strong>Methods: </strong>An attention-based gated recurrent units transformer (AttGRUT) time-series model is proposed to efficiently identify EEG perturbations in depressive patients. Statistical, spectral and wavelet features were first extracted from the 60-channel EEG signal data. Then, two feature selection techniques, recursive feature elimination and the Boruta algorithm, both with Shapley additive explanations, were utilised for selecting essential features.</p><p><strong>Results: </strong>The proposed model outperformed the two baseline and two hybrid time-series models-long short-term memory (LSTM), gated recurrent units (GRU), convolutional neural network-LSTM (CNN-LSTM), and CNN-GRU-achieving an accuracy of up to 98.67%. Feature selection considerably increased the performance across all time-series models.</p><p><strong>Conclusion: </strong>Based on the obtained results, novel feature selection greatly affected the results of the baseline and hybrid time-series models. The proposed AttGRUT can be implemented and tested in other domains by using different modalities for prediction.</p><p><strong>Supplementary information: </strong>The online version contains supplementary material available at 10.1007/s13755-022-00205-8.</p>","PeriodicalId":46312,"journal":{"name":"Health Information Science and Systems","volume":"11 1","pages":"1"},"PeriodicalIF":4.7,"publicationDate":"2022-12-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9800680/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10276794","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A GIS enhanced data analytics approach for predicting nursing home hurricane evacuation response. 预测养老院飓风疏散响应的GIS增强数据分析方法。
IF 6 3区 医学 Q1 MEDICAL INFORMATICS Pub Date : 2022-09-14 eCollection Date: 2022-12-01 DOI: 10.1007/s13755-022-00190-y
Nazmus Sakib, Kathryn Hyer, Debra Dobbs, Lindsay Peterson, Dylan J Jester, Nan Kong, Mingyang Li

Nursing homes (NHs) are responsible for caring for frail, older adults, who are highly vulnerable to natural disasters, such as hurricanes. Due to the influence of highly uncertain environmental conditions and varied NH characteristics (e.g., geo-location, staffing, residents' health conditions), the NH evacuation response, namely evacuating or sheltering-in-place, is highly uncertain. Accurate prediction of NH evacuation response is important for emergency management agencies to accurately anticipate the NH evacuation demand surge with healthcare resources proactively planned. Existing hurricane evacuation research mainly focuses on the general population. For NH evacuation, existing studies mainly focus on conceptual studies and/or qualitative analysis using a single source of data, such as surveys or resident health data. There is a lack of research to develop analytics-based method by fusing rich environmental data with NH data to improve the prediction accuracy. In this paper, we propose a Geographic Information System (GIS) data enhanced predictive analytics approach for forecasting NH evacuation response by fusing multi-source data related to storm conditions, geographical information, NH organizational characteristics as well as staffing and residents characteristics of each NH. In particular, multiple GIS features, such as distance to storm trajectory, projected wind speed, potential storm surge and NH elevation, are extracted from rich GIS information and incorporated to improve the prediction performance. A real-world case study of NH evacuation during Hurricane Irma in 2017 is examined to demonstrate superior prediction performance of the proposed work over a large number of predictive analytics methods without GIS information.

养老院(NHs)负责照顾体弱多病的老年人,他们极易受到自然灾害(如飓风)的伤害。由于高度不确定的环境条件和不同的NH特征(例如地理位置、人员配备、居民健康状况)的影响,NH疏散响应,即疏散或就地避难,具有高度不确定性。准确预测NH疏散响应对于应急管理机构准确预测NH疏散需求激增以及主动规划医疗资源非常重要。现有的飓风疏散研究主要集中在一般人群。对于NH疏散,现有研究主要侧重于概念性研究和/或定性分析,使用单一数据来源,如调查或居民健康数据。将丰富的环境数据与NH数据相融合,开发基于分析的方法来提高预测精度,目前还缺乏相关研究。本文提出了一种地理信息系统(GIS)数据增强预测分析方法,通过融合与风暴条件、地理信息、NH组织特征以及每个NH的人员配备和居民特征相关的多源数据,预测NH疏散响应。特别地,从丰富的GIS信息中提取多个GIS特征,如与风暴轨迹的距离、预计风速、潜在风暴潮和NH高程,并将其结合起来以提高预测性能。对2017年飓风Irma期间NH疏散的实际案例进行了研究,以证明所提出的工作比大量没有GIS信息的预测分析方法具有更好的预测性能。
{"title":"A GIS enhanced data analytics approach for predicting nursing home hurricane evacuation response.","authors":"Nazmus Sakib, Kathryn Hyer, Debra Dobbs, Lindsay Peterson, Dylan J Jester, Nan Kong, Mingyang Li","doi":"10.1007/s13755-022-00190-y","DOIUrl":"10.1007/s13755-022-00190-y","url":null,"abstract":"<p><p>Nursing homes (NHs) are responsible for caring for frail, older adults, who are highly vulnerable to natural disasters, such as hurricanes. Due to the influence of highly uncertain environmental conditions and varied NH characteristics (e.g., geo-location, staffing, residents' health conditions), the NH evacuation response, namely evacuating or sheltering-in-place, is highly uncertain. Accurate prediction of NH evacuation response is important for emergency management agencies to accurately anticipate the NH evacuation demand surge with healthcare resources proactively planned. Existing hurricane evacuation research mainly focuses on the general population. For NH evacuation, existing studies mainly focus on conceptual studies and/or qualitative analysis using a single source of data, such as surveys or resident health data. There is a lack of research to develop analytics-based method by fusing rich environmental data with NH data to improve the prediction accuracy. In this paper, we propose a Geographic Information System (GIS) data enhanced predictive analytics approach for forecasting NH evacuation response by fusing multi-source data related to storm conditions, geographical information, NH organizational characteristics as well as staffing and residents characteristics of each NH. In particular, multiple GIS features, such as distance to storm trajectory, projected wind speed, potential storm surge and NH elevation, are extracted from rich GIS information and incorporated to improve the prediction performance. A real-world case study of NH evacuation during Hurricane Irma in 2017 is examined to demonstrate superior prediction performance of the proposed work over a large number of predictive analytics methods without GIS information.</p>","PeriodicalId":46312,"journal":{"name":"Health Information Science and Systems","volume":"10 1","pages":"28"},"PeriodicalIF":6.0,"publicationDate":"2022-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9474783/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10106276","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Wrist pulse signal based vascular age calculation using mixed Gaussian model and support vector regression. 基于腕部脉搏信号的血管年龄计算,采用混合高斯模型和支持向量回归。
IF 6 3区 医学 Q1 MEDICAL INFORMATICS Pub Date : 2022-04-21 eCollection Date: 2022-12-01 DOI: 10.1007/s13755-022-00172-0
Qingfeng Tang, Shoujiang Xu, Mengjuan Guo, Guangjun Wang, Zhigeng Pan, Benyue Su

Purpose: Vascular age (VA) is the direct index to reflect vascular aging, so it plays a particular role in public health. How to obtain VA conveniently and cheaply has always been a research hotspot. This study proposes a new method to evaluate VA with wrist pulse signal.

Methods: Firstly, we fit the pulse signal by mixed Gaussian model (MGM) to extract the shape features, and adopt principal component analysis (PCA) to optimize the dimension of the shape features. Secondly, the principal components and chronological age (CA) are respectively taken as the independent variables and dependent variable to establish support vector regression (SVR) model. Thirdly, the principal components are fed into the SVR model to predicted the vascular aging of each subject. The predicted value is regarded as the description of VA. Finally, we compare the correlation coefficients of VA with pulse width (PW), inflection point area ratio (IPA), Ratio b/a (RBA), augmentation index (AIx), diastolic augmentation index (DAI) and pulse transit time (PTT) with those of CA with these six indices.

Results: Compared with the CA, the VA is closer to PW (r = 0.539, P < 0.001 to r = 0.589, P < 0.001 in men; r = 0.325, P < 0.001 to r = 0.400, P < 0.001 in women), IPA (r =  - 0.446, P < 0.001 to r =  - 0.534, P < 0.001 in men; r =  - 0.623, P < 0.001 to r =  - 0.660, P < 0.001 in women), RBA (r = 0.328, P < 0.001 to r = 0.371, P < 0.001 in women), AIx (r = 0.659, P < 0.001 to r = 0.738, P < 0.001 in men; r = 0.547, P < 0.001 to r = 0.573, P < 0.001 in women), DAI (r = 0.517, P < 0.001 to r = 0.532, P < 0.001 in men; r = 0.507, P < 0.001 to r = 0.570, P < 0.001 in women) and PTT (r = 0.526, P < 0.001 to r = 0.659, P < 0.001 in men; r = 0.577, P < 0.001 to r = 0.814, P < 0.001 in women).

Conclusion: The VA is more representative of vascular aging than CA. The method presented in this study provides a new way to directly and objectively assess vascular aging in public health.

目的:血管年龄(Vascular age, VA)是反映血管老化的直接指标,在公共卫生中具有特殊的作用。如何方便、廉价地获取VA一直是研究的热点。本研究提出了一种利用腕部脉搏信号评估VA的新方法。方法:首先采用混合高斯模型(MGM)对脉冲信号进行拟合提取形状特征,并采用主成分分析(PCA)对形状特征进行维数优化;其次,分别以主成分和实足年龄作为自变量和因变量,建立支持向量回归(SVR)模型;第三,将主成分输入到SVR模型中,对受试者血管老化进行预测。最后,将VA与脉宽(PW)、拐点面积比(IPA)、b/a比(RBA)、增强指数(AIx)、舒张增强指数(DAI)、脉冲传递时间(PTT)的相关系数与CA与这6个指标的相关系数进行比较。结果:与CA相比,我们更接近PW (r = 0.539, P r = 0.589, P r = 0.325, P r = 0.400, P r = - 0.446, P r = - 0.534, P r = - 0.623, P r = - 0.660, P r = 0.328, P r = 0.371, P r = 0.659, P r = 0.738, P r = 0.547, P r = 0.573, P r = 0.517, P r = 0.532, P r = 0.507, P r = 0.570, P r = 0.526, P r = 0.659, P r = 0.577, P r = 0.814, P结论:VA比CA更能代表血管老化,为直接、客观地评价公共卫生血管老化提供了一种新的方法。
{"title":"Wrist pulse signal based vascular age calculation using mixed Gaussian model and support vector regression.","authors":"Qingfeng Tang, Shoujiang Xu, Mengjuan Guo, Guangjun Wang, Zhigeng Pan, Benyue Su","doi":"10.1007/s13755-022-00172-0","DOIUrl":"https://doi.org/10.1007/s13755-022-00172-0","url":null,"abstract":"<p><strong>Purpose: </strong>Vascular age (VA) is the direct index to reflect vascular aging, so it plays a particular role in public health. How to obtain VA conveniently and cheaply has always been a research hotspot. This study proposes a new method to evaluate VA with wrist pulse signal.</p><p><strong>Methods: </strong>Firstly, we fit the pulse signal by mixed Gaussian model (MGM) to extract the shape features, and adopt principal component analysis (PCA) to optimize the dimension of the shape features. Secondly, the principal components and chronological age (CA) are respectively taken as the independent variables and dependent variable to establish support vector regression (SVR) model. Thirdly, the principal components are fed into the SVR model to predicted the vascular aging of each subject. The predicted value is regarded as the description of VA. Finally, we compare the correlation coefficients of VA with pulse width (PW), inflection point area ratio (IPA), Ratio b/a (RBA), augmentation index (AIx), diastolic augmentation index (DAI) and pulse transit time (PTT) with those of CA with these six indices.</p><p><strong>Results: </strong>Compared with the CA, the VA is closer to PW (<i>r</i> = 0.539, <i>P</i> < 0.001 to <i>r</i> = 0.589, <i>P</i> < 0.001 in men; <i>r</i> = 0.325, <i>P</i> < 0.001 to <i>r</i> = 0.400, <i>P</i> < 0.001 in women), IPA (<i>r</i> =  - 0.446, <i>P</i> < 0.001 to <i>r</i> =  - 0.534, <i>P</i> < 0.001 in men; <i>r</i> =  - 0.623, <i>P</i> < 0.001 to <i>r</i> =  - 0.660, <i>P</i> < 0.001 in women), RBA (<i>r</i> = 0.328, <i>P</i> < 0.001 to <i>r</i> = 0.371, <i>P</i> < 0.001 in women), AIx (<i>r</i> = 0.659, <i>P</i> < 0.001 to <i>r</i> = 0.738, <i>P</i> < 0.001 in men; <i>r</i> = 0.547, <i>P</i> < 0.001 to <i>r</i> = 0.573, <i>P</i> < 0.001 in women), DAI (<i>r</i> = 0.517, <i>P</i> < 0.001 to <i>r</i> = 0.532, <i>P</i> < 0.001 in men; <i>r</i> = 0.507, <i>P</i> < 0.001 to <i>r</i> = 0.570, <i>P</i> < 0.001 in women) and PTT (<i>r</i> = 0.526, <i>P</i> < 0.001 to <i>r</i> = 0.659, <i>P</i> < 0.001 in men; <i>r</i> = 0.577, <i>P</i> < 0.001 to <i>r</i> = 0.814, <i>P</i> < 0.001 in women).</p><p><strong>Conclusion: </strong>The VA is more representative of vascular aging than CA. The method presented in this study provides a new way to directly and objectively assess vascular aging in public health.</p>","PeriodicalId":46312,"journal":{"name":"Health Information Science and Systems","volume":"10 1","pages":"7"},"PeriodicalIF":6.0,"publicationDate":"2022-04-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9023627/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138471037","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A divisive hierarchical clustering methodology for enhancing the ensemble prediction power in large scale population studies: the ATHLOS project. 用于提高大规模人口研究中集合预测能力的分裂分层聚类方法:ATHLOS项目。
IF 4.7 3区 医学 Q1 MEDICAL INFORMATICS Pub Date : 2022-04-18 eCollection Date: 2022-12-01 DOI: 10.1007/s13755-022-00171-1
Petros Barmpas, Sotiris Tasoulis, Aristidis G Vrahatis, Spiros V Georgakopoulos, Panagiotis Anagnostou, Matthew Prina, José Luis Ayuso-Mateos, Jerome Bickenbach, Ivet Bayes, Martin Bobak, Francisco Félix Caballero, Somnath Chatterji, Laia Egea-Cortés, Esther García-Esquinas, Matilde Leonardi, Seppo Koskinen, Ilona Koupil, Andrzej Paja K, Martin Prince, Warren Sanderson, Sergei Scherbov, Abdonas Tamosiunas, Aleksander Galas, Josep Maria Haro, Albert Sanchez-Niubo, Vassilis P Plagianakos, Demosthenes Panagiotakos

The ATHLOS cohort is composed of several harmonized datasets of international groups related to health and aging. As a result, the Healthy Aging index has been constructed based on a selection of variables from 16 individual studies. In this paper, we consider additional variables found in ATHLOS and investigate their utilization for predicting the Healthy Aging index. For this purpose, motivated by the volume and diversity of the dataset, we focus our attention upon data clustering, where unsupervised learning is utilized to enhance prediction power. Thus we show the predictive utility of exploiting hidden data structures. In addition, we demonstrate that imposed computation bottlenecks can be surpassed when using appropriate hierarchical clustering, within a clustering for ensemble classification scheme, while retaining prediction benefits. We propose a complete methodology that is evaluated against baseline methods and the original concept. The results are very encouraging suggesting further developments in this direction along with applications in tasks with similar characteristics. A straightforward open source implementation for the R project is also provided (https://github.com/Petros-Barmpas/HCEP).

Supplementary information: The online version contains supplementary material available at 10.1007/s13755-022-00171-1.

ATHLOS队列由几个与健康和老龄化相关的国际群体的协调数据集组成。因此,健康老龄化指数是基于从16项单独研究中选择的变量构建的。在本文中,我们考虑了ATHLOS中发现的其他变量,并研究了它们在预测健康老龄化指数中的应用。为此,由于数据集的数量和多样性,我们将注意力集中在数据聚类上,其中使用无监督学习来增强预测能力。因此,我们展示了利用隐藏数据结构的预测效用。此外,我们还证明,在集成分类方案的聚类中,使用适当的分层聚类可以超越强加的计算瓶颈,同时保留预测优势。我们提出了一个完整的方法,对基线方法和原始概念进行评估。结果非常令人鼓舞,表明在这一方向上的进一步发展以及在具有类似特征的任务中的应用。还提供了R项目的直接开源实现(https://github.com/Petros-Barmpas/HCEP).Supplementary信息:在线版本包含补充材料,可在10.1007/s13755-022-00171-1获得。
{"title":"A divisive hierarchical clustering methodology for enhancing the ensemble prediction power in large scale population studies: the ATHLOS project.","authors":"Petros Barmpas, Sotiris Tasoulis, Aristidis G Vrahatis, Spiros V Georgakopoulos, Panagiotis Anagnostou, Matthew Prina, José Luis Ayuso-Mateos, Jerome Bickenbach, Ivet Bayes, Martin Bobak, Francisco Félix Caballero, Somnath Chatterji, Laia Egea-Cortés, Esther García-Esquinas, Matilde Leonardi, Seppo Koskinen, Ilona Koupil, Andrzej Paja K, Martin Prince, Warren Sanderson, Sergei Scherbov, Abdonas Tamosiunas, Aleksander Galas, Josep Maria Haro, Albert Sanchez-Niubo, Vassilis P Plagianakos, Demosthenes Panagiotakos","doi":"10.1007/s13755-022-00171-1","DOIUrl":"10.1007/s13755-022-00171-1","url":null,"abstract":"<p><p>The ATHLOS cohort is composed of several harmonized datasets of international groups related to health and aging. As a result, the Healthy Aging index has been constructed based on a selection of variables from 16 individual studies. In this paper, we consider additional variables found in ATHLOS and investigate their utilization for predicting the Healthy Aging index. For this purpose, motivated by the volume and diversity of the dataset, we focus our attention upon data clustering, where unsupervised learning is utilized to enhance prediction power. Thus we show the predictive utility of exploiting hidden data structures. In addition, we demonstrate that imposed computation bottlenecks can be surpassed when using appropriate hierarchical clustering, within a clustering for ensemble classification scheme, while retaining prediction benefits. We propose a complete methodology that is evaluated against baseline methods and the original concept. The results are very encouraging suggesting further developments in this direction along with applications in tasks with similar characteristics. A straightforward open source implementation for the R project is also provided (https://github.com/Petros-Barmpas/HCEP).</p><p><strong>Supplementary information: </strong>The online version contains supplementary material available at 10.1007/s13755-022-00171-1.</p>","PeriodicalId":46312,"journal":{"name":"Health Information Science and Systems","volume":"10 1","pages":"6"},"PeriodicalIF":4.7,"publicationDate":"2022-04-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9013733/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10866298","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Health Information Science and Systems
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1