基于Framingham数据集的增强递归神经网络(RNN)用于心脏病风险预测

Surenthiran Krishnan, Pritheega Magalingam, Roslina Ibrahim
{"title":"基于Framingham数据集的增强递归神经网络(RNN)用于心脏病风险预测","authors":"Surenthiran Krishnan, Pritheega Magalingam, Roslina Ibrahim","doi":"10.11113/oiji2023.11n1.242","DOIUrl":null,"url":null,"abstract":"Heart disease is one of the leading causes of death globally, which takes 17.9 million lives each year. The existing heart disease prediction techniques have a gap that does not consider the smoking attributes from the heart disease data. So, the accuracy is based on the limited number of medical data and the deep learning model. The existing deep learning models which use the Recurrent Neural Network (RNN) for heart disease prediction consume more processing and analysing time, mainly due to the delay of data retrieval. This delay causes the prediction process to become slower and leads to a moderate prediction only. The backpropagation of the RNN with an update gate and internal memory to carry the updated data cause a minor data glitch that leads to lower accuracy. Therefore, an efficient heart disease prediction model is very crucial to provide early detection among patients. This research proposes a Heart Disease Risk Prediction Model (HDRPM) with an enhanced RNN to improve the prediction accuracy using Framingham heart disease datasets. The specificity and sensitivity are imposed to improve the quality of the predictions. Sensitivity measure is used for detecting patients with heart disease perfectly and specificity measure is used for detecting patients without the disease perfectly. Besides the accuracy and quality of the prediction problem, the imbalance of minority classes in the dataset occurred in most deep learning prediction fields. This research aims to improve the quality of imbalanced Framingham datasets using Synthetic Minority Over-sampling Technique (SMOTe), which will synthetic instances in a small class to be equalized. The existing RNN model faces vanishing gradients that impede the learning of long data sequences. These gradients that carry information in the RNN cells will become smaller gradually till it minimises the parameter updates and leads to poor learning. For this purpose, the presence of multiple Gated Recurrent Unit (GRU) is used to overcome the vanishing gradients and ensure the hidden layers. The neurons of RNN rapidly cater for the essential information during the training and validation phase of the HDRPM. The integration of multiple GRU with the RNN, operating on the Tensorflow as back-end and Keras as the core for the neural network library has increased the performance of the proposed model. The proposed model provides up to 98.78%, the highest accuracy achieved compared to related previous work, which is a quantum neural network model with 98.57. This HDRPM is expected to significantly contribute to early detection of heart disease patients.","PeriodicalId":379468,"journal":{"name":"Open International Journal of Informatics","volume":"116 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Enhanced Recurrent Neural Network (RNN) For Heart Disease Risk Prediction Using Framingham Datasets\",\"authors\":\"Surenthiran Krishnan, Pritheega Magalingam, Roslina Ibrahim\",\"doi\":\"10.11113/oiji2023.11n1.242\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Heart disease is one of the leading causes of death globally, which takes 17.9 million lives each year. The existing heart disease prediction techniques have a gap that does not consider the smoking attributes from the heart disease data. So, the accuracy is based on the limited number of medical data and the deep learning model. The existing deep learning models which use the Recurrent Neural Network (RNN) for heart disease prediction consume more processing and analysing time, mainly due to the delay of data retrieval. This delay causes the prediction process to become slower and leads to a moderate prediction only. The backpropagation of the RNN with an update gate and internal memory to carry the updated data cause a minor data glitch that leads to lower accuracy. Therefore, an efficient heart disease prediction model is very crucial to provide early detection among patients. This research proposes a Heart Disease Risk Prediction Model (HDRPM) with an enhanced RNN to improve the prediction accuracy using Framingham heart disease datasets. The specificity and sensitivity are imposed to improve the quality of the predictions. Sensitivity measure is used for detecting patients with heart disease perfectly and specificity measure is used for detecting patients without the disease perfectly. Besides the accuracy and quality of the prediction problem, the imbalance of minority classes in the dataset occurred in most deep learning prediction fields. This research aims to improve the quality of imbalanced Framingham datasets using Synthetic Minority Over-sampling Technique (SMOTe), which will synthetic instances in a small class to be equalized. The existing RNN model faces vanishing gradients that impede the learning of long data sequences. These gradients that carry information in the RNN cells will become smaller gradually till it minimises the parameter updates and leads to poor learning. For this purpose, the presence of multiple Gated Recurrent Unit (GRU) is used to overcome the vanishing gradients and ensure the hidden layers. The neurons of RNN rapidly cater for the essential information during the training and validation phase of the HDRPM. The integration of multiple GRU with the RNN, operating on the Tensorflow as back-end and Keras as the core for the neural network library has increased the performance of the proposed model. The proposed model provides up to 98.78%, the highest accuracy achieved compared to related previous work, which is a quantum neural network model with 98.57. This HDRPM is expected to significantly contribute to early detection of heart disease patients.\",\"PeriodicalId\":379468,\"journal\":{\"name\":\"Open International Journal of Informatics\",\"volume\":\"116 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-06-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Open International Journal of Informatics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.11113/oiji2023.11n1.242\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Open International Journal of Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.11113/oiji2023.11n1.242","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

心脏病是全球死亡的主要原因之一,每年夺走1790万人的生命。现有的心脏病预测技术没有考虑到心脏病数据中的吸烟属性,存在一定的差距。因此,准确性是基于有限数量的医疗数据和深度学习模型。现有的深度学习模型使用递归神经网络(RNN)进行心脏病预测,主要是由于数据检索的延迟,需要花费更多的处理和分析时间。这种延迟会导致预测过程变慢,并且只能进行适度的预测。带有更新门和内部存储器来携带更新数据的RNN的反向传播会导致导致精度降低的小数据故障。因此,建立一个有效的心脏病预测模型对于早期发现患者至关重要。为了提高Framingham心脏病数据集的预测精度,本研究提出了一种增强RNN的心脏病风险预测模型(HDRPM)。特异性和敏感性是为了提高预测的质量。灵敏度法用于完美检测心脏病患者,特异性法用于完美检测无心脏病患者。除了预测问题的准确性和质量外,在大多数深度学习预测领域中,数据集中的少数类的不平衡都存在。本研究旨在利用合成少数派过采样技术(SMOTe)提高不平衡Framingham数据集的质量,该技术将小类别中的合成实例进行均衡。现有的RNN模型面临着阻碍长数据序列学习的梯度消失问题。这些在RNN细胞中携带信息的梯度会逐渐变小,直到参数更新最小化,导致学习效果不佳。为此,利用多个门控循环单元(GRU)的存在来克服梯度消失并保证隐藏层的存在。在HDRPM的训练和验证阶段,RNN神经元能够快速地满足基本信息的需求。多个GRU与RNN的集成,以Tensorflow为后端,Keras为神经网络库的核心,提高了所提出模型的性能。该模型的准确率高达98.78%,是前人研究中准确率最高的量子神经网络模型,准确率为98.57。这种HDRPM预计将大大有助于心脏病患者的早期发现。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Enhanced Recurrent Neural Network (RNN) For Heart Disease Risk Prediction Using Framingham Datasets
Heart disease is one of the leading causes of death globally, which takes 17.9 million lives each year. The existing heart disease prediction techniques have a gap that does not consider the smoking attributes from the heart disease data. So, the accuracy is based on the limited number of medical data and the deep learning model. The existing deep learning models which use the Recurrent Neural Network (RNN) for heart disease prediction consume more processing and analysing time, mainly due to the delay of data retrieval. This delay causes the prediction process to become slower and leads to a moderate prediction only. The backpropagation of the RNN with an update gate and internal memory to carry the updated data cause a minor data glitch that leads to lower accuracy. Therefore, an efficient heart disease prediction model is very crucial to provide early detection among patients. This research proposes a Heart Disease Risk Prediction Model (HDRPM) with an enhanced RNN to improve the prediction accuracy using Framingham heart disease datasets. The specificity and sensitivity are imposed to improve the quality of the predictions. Sensitivity measure is used for detecting patients with heart disease perfectly and specificity measure is used for detecting patients without the disease perfectly. Besides the accuracy and quality of the prediction problem, the imbalance of minority classes in the dataset occurred in most deep learning prediction fields. This research aims to improve the quality of imbalanced Framingham datasets using Synthetic Minority Over-sampling Technique (SMOTe), which will synthetic instances in a small class to be equalized. The existing RNN model faces vanishing gradients that impede the learning of long data sequences. These gradients that carry information in the RNN cells will become smaller gradually till it minimises the parameter updates and leads to poor learning. For this purpose, the presence of multiple Gated Recurrent Unit (GRU) is used to overcome the vanishing gradients and ensure the hidden layers. The neurons of RNN rapidly cater for the essential information during the training and validation phase of the HDRPM. The integration of multiple GRU with the RNN, operating on the Tensorflow as back-end and Keras as the core for the neural network library has increased the performance of the proposed model. The proposed model provides up to 98.78%, the highest accuracy achieved compared to related previous work, which is a quantum neural network model with 98.57. This HDRPM is expected to significantly contribute to early detection of heart disease patients.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Literature Review on Metadata Governance Enterprise Architecture for Equipment Performance Analysis Based on Internet of Things (IoT) Technology in the Semiconductor Manufacturing Industry Enterprise Architecture: Enabling Digital Transformation for Healthcare Organization A Review of Enterprise Architecture for Strategic Performance Management in the Transportation Sector Digital Transformation A Quick Start Approach of Enterprise Architecture Implementation in the Investment Industry Sector
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1