将深度学习应用于公共卫生:使用不平衡人口统计数据预测甲状腺疾病

Yasser Attiga, Shih-Yin Chen, J. LaGue, Anaelia Ovalle, Nathan Stott, T. Brander, Abdullah Khaled, Gaurika Tyagi, P. Francis-Lyon
{"title":"将深度学习应用于公共卫生:使用不平衡人口统计数据预测甲状腺疾病","authors":"Yasser Attiga, Shih-Yin Chen, J. LaGue, Anaelia Ovalle, Nathan Stott, T. Brander, Abdullah Khaled, Gaurika Tyagi, P. Francis-Lyon","doi":"10.1109/IEMCON.2018.8614888","DOIUrl":null,"url":null,"abstract":"This study investigates the use of Deep Neural Learning to predict propensity for disease from demographic information alone, with thyroid disease as the test application. The imbalanced dataset of 747,301 samples contained 13 demographic predictor variables that were not known to be associated with the disease, and had much missing information. A TensorFlow feed-forward neural network was trained to predict thyroid disease. Different activation functions and a variety of up-sampling and down-sampling methods were employed. The lift statistic was used to evaluate success in identifying patients with a propensity for thyroid disease. The DNN model outperformed the Random Forest model with a 36.63% improvement in the lift statistic. These results suggest that deep learning may be successfully employed to select candidates for early intervention for improved health outcomes, utilizing a large dataset with only minimal demographic variables, similar to datasets that are held by the marketing arms of healthcare providers.","PeriodicalId":368939,"journal":{"name":"2018 IEEE 9th Annual Information Technology, Electronics and Mobile Communication Conference (IEMCON)","volume":"43 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Applying Deep Learning to Public Health: Using Unbalanced Demographic Data to Predict Thyroid Disorder\",\"authors\":\"Yasser Attiga, Shih-Yin Chen, J. LaGue, Anaelia Ovalle, Nathan Stott, T. Brander, Abdullah Khaled, Gaurika Tyagi, P. Francis-Lyon\",\"doi\":\"10.1109/IEMCON.2018.8614888\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This study investigates the use of Deep Neural Learning to predict propensity for disease from demographic information alone, with thyroid disease as the test application. The imbalanced dataset of 747,301 samples contained 13 demographic predictor variables that were not known to be associated with the disease, and had much missing information. A TensorFlow feed-forward neural network was trained to predict thyroid disease. Different activation functions and a variety of up-sampling and down-sampling methods were employed. The lift statistic was used to evaluate success in identifying patients with a propensity for thyroid disease. The DNN model outperformed the Random Forest model with a 36.63% improvement in the lift statistic. These results suggest that deep learning may be successfully employed to select candidates for early intervention for improved health outcomes, utilizing a large dataset with only minimal demographic variables, similar to datasets that are held by the marketing arms of healthcare providers.\",\"PeriodicalId\":368939,\"journal\":{\"name\":\"2018 IEEE 9th Annual Information Technology, Electronics and Mobile Communication Conference (IEMCON)\",\"volume\":\"43 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 IEEE 9th Annual Information Technology, Electronics and Mobile Communication Conference (IEMCON)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IEMCON.2018.8614888\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE 9th Annual Information Technology, Electronics and Mobile Communication Conference (IEMCON)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IEMCON.2018.8614888","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

摘要

本研究探讨了使用深度神经学习来预测疾病倾向,仅从人口统计信息,甲状腺疾病作为测试应用。747,301个样本的不平衡数据集包含13个未知与疾病相关的人口统计学预测变量,并且有许多缺失信息。训练TensorFlow前馈神经网络预测甲状腺疾病。采用了不同的激活函数和多种上采样和下采样方法。lift统计量被用来评估成功识别甲状腺疾病倾向的患者。DNN模型在提升统计量上优于随机森林模型,提高了36.63%。这些结果表明,深度学习可以成功地用于选择早期干预的候选人,以改善健康结果,利用只有最小人口变量的大型数据集,类似于医疗保健提供者的营销部门持有的数据集。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Applying Deep Learning to Public Health: Using Unbalanced Demographic Data to Predict Thyroid Disorder
This study investigates the use of Deep Neural Learning to predict propensity for disease from demographic information alone, with thyroid disease as the test application. The imbalanced dataset of 747,301 samples contained 13 demographic predictor variables that were not known to be associated with the disease, and had much missing information. A TensorFlow feed-forward neural network was trained to predict thyroid disease. Different activation functions and a variety of up-sampling and down-sampling methods were employed. The lift statistic was used to evaluate success in identifying patients with a propensity for thyroid disease. The DNN model outperformed the Random Forest model with a 36.63% improvement in the lift statistic. These results suggest that deep learning may be successfully employed to select candidates for early intervention for improved health outcomes, utilizing a large dataset with only minimal demographic variables, similar to datasets that are held by the marketing arms of healthcare providers.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
On the Fog Node Model for Multi-purpose Fog Computing Systems Research-Practice Gap in Passive House Standard Propagation Modeling of IoT Devices for Deployment in Multi-level Hilly Urban Environments Architectures and Challenges Towards Software Defined Cloud of Things (SDCoT) Unveiling Topics from Scientific Literature on the Subject of Self-driving Cars using Latent Dirichlet Allocation
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1