A Knowledge Graph Approach for the Secondary Use of Cancer Registry Data

S. Hasan, D. Rivera, Xiao-Cheng Wu, J. B. Christian, G. Tourassi
{"title":"A Knowledge Graph Approach for the Secondary Use of Cancer Registry Data","authors":"S. Hasan, D. Rivera, Xiao-Cheng Wu, J. B. Christian, G. Tourassi","doi":"10.1109/BHI.2019.8834538","DOIUrl":null,"url":null,"abstract":"Population-based central cancer registries collect valuable structured and unstructured cancer data primarily for surveillance and reporting. The collected data includes (1) categorization of each cancer case (tumor) at the time of diagnosis, (2) demographic information about the patient such as age, gender, and location at time of diagnosis, (3) first course of treatment information, and (4) survival outcomes when available. While advanced analytical approaches such as SEER*Stat and SAS exist, we provide a knowledge graph approach to organizing cancer registry data for advanced analytics which offers unique advantages over existing approaches. This knowledge graph approach semantically enriches the data and enables straightforward linking capability with third-party data to help understand variation in cancer outcomes. A knowledge graph was developed using Louisiana Tumor Registry data. We present the advantages of the knowledge graph approach by examining: i) scenario-specific queries and ii) linkages with publicly available external datasets. Our results demonstrate this graph based solution can perform complex queries, improve query run-time performance by 81%, and more easily conduct iterative analyses to enhance researchers understanding of cancer registry data.","PeriodicalId":281971,"journal":{"name":"2019 IEEE EMBS International Conference on Biomedical & Health Informatics (BHI)","volume":"53 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE EMBS International Conference on Biomedical & Health Informatics (BHI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BHI.2019.8834538","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

Population-based central cancer registries collect valuable structured and unstructured cancer data primarily for surveillance and reporting. The collected data includes (1) categorization of each cancer case (tumor) at the time of diagnosis, (2) demographic information about the patient such as age, gender, and location at time of diagnosis, (3) first course of treatment information, and (4) survival outcomes when available. While advanced analytical approaches such as SEER*Stat and SAS exist, we provide a knowledge graph approach to organizing cancer registry data for advanced analytics which offers unique advantages over existing approaches. This knowledge graph approach semantically enriches the data and enables straightforward linking capability with third-party data to help understand variation in cancer outcomes. A knowledge graph was developed using Louisiana Tumor Registry data. We present the advantages of the knowledge graph approach by examining: i) scenario-specific queries and ii) linkages with publicly available external datasets. Our results demonstrate this graph based solution can perform complex queries, improve query run-time performance by 81%, and more easily conduct iterative analyses to enhance researchers understanding of cancer registry data.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
癌症登记数据二次使用的知识图谱方法
以人群为基础的中央癌症登记处收集有价值的结构化和非结构化癌症数据,主要用于监测和报告。收集的数据包括(1)诊断时每个癌症病例(肿瘤)的分类,(2)患者的人口学信息,如诊断时的年龄、性别和位置,(3)第一个疗程信息,(4)可用的生存结果。虽然存在先进的分析方法,如SEER*Stat和SAS,但我们提供了一种知识图谱方法来组织癌症登记数据进行高级分析,这比现有方法提供了独特的优势。这种知识图谱方法在语义上丰富了数据,并实现了与第三方数据的直接链接能力,以帮助理解癌症结果的变化。利用路易斯安那州肿瘤登记处的数据开发了一个知识图谱。我们通过检查:i)场景特定查询和ii)与公开可用的外部数据集的链接来展示知识图方法的优势。我们的研究结果表明,这种基于图的解决方案可以执行复杂的查询,将查询运行时性能提高81%,并且更容易进行迭代分析,从而增强研究人员对癌症注册数据的理解。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
ResHNet: Spectrograms Based Efficient Heart Sounds Classification Using Stacked Residual Networks Towards EEG Generation Using GANs for BCI Applications Neurophysiological Variations in Food Decision-Making within Virtual and Real Environments Predicting Drug-Target Interactions Using Weisfeiler-Lehman Neural Network Inverse Regression for Extraction of Tumor Site from Cancer Pathology Reports
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1