National Database of Health Insurance Claims and Specific Health Checkups of Japan (NDB): Outline and Patient-Matching Technique

Kubo Shinichiro, Noda Tatsuya, Myojin Tomoya, Nishioka Yuichi, Higashino Tsuneyuki, Matsui Hiroki, Kato Genta, Imamura Tomoaki
{"title":"National Database of Health Insurance Claims and Specific Health Checkups of Japan (NDB): Outline and Patient-Matching Technique","authors":"Kubo Shinichiro, Noda Tatsuya, Myojin Tomoya, Nishioka Yuichi, Higashino Tsuneyuki, Matsui Hiroki, Kato Genta, Imamura Tomoaki","doi":"10.1101/280008","DOIUrl":null,"url":null,"abstract":"Background The National Database of Health Insurance Claims and Specific Health Checkups of Japan (NDB) is a comprehensive database of health insurance claims data under Japan’s National Health Insurance system. The NDB uses two types of personal identification variables (referred to in the database as “ID1” and “ID2”) to link the insurance claims of individual patients. However, the information entered against these ID variables is prone to change for several reasons, such as when claimants find or change employment, or due to variations in the spelling of their name. In the present study, we developed a new patient-matching technique that improves upon the existing system of using ID1 and ID2 variables. We also sought to validate a new personal ID variable (ID0) that we propose in order to enhance the efficiency of patient matching in the NDB database. Methods Our study targeted data from health insurance claims filed between April 2013 and March 2016 for hospitalization, combined diagnostic procedures, outpatient treatment, and dispensing of prescription medication. We developed a new patient-matching algorithm based on the ID1 and ID2 variables, as well as variables for treatment date and clinical outcome. We then attempted to validate our algorithm by comparing the number of patients identified by patient matching with the current ID1 variable and our proposed ID0 variable against the estimated patient population as of 1 October 2015. Results The numbers of patients in each sex and age group that were identified with the ID0 variable were lower than those identified using the ID1 variable. By using the ID0 variable, we were able to reduce the number of duplicate records for male and female patients by 5.8% and 6.4%, respectively. The numbers of children, adults older than 75 years, and women of reproductive age identified using the ID1 patient-matching variable were all higher than their corresponding estimates. Conversely, the numbers of these patients identified with the ID0 patient-matching variable were all within their corresponding estimates. Conclusion Our findings show that the proposed ID0 variable delivers more precise patient-matching results than the existing ID1 variable. The ID0 variable is currently the best available technique for patient matching in the NDB database. Future patient population estimates should therefore rely on the ID0 variable instead of the ID1 variable.","PeriodicalId":72407,"journal":{"name":"bioRxiv : the preprint server for biology","volume":"105 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2018-03-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"40","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"bioRxiv : the preprint server for biology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1101/280008","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 40

Abstract

Background The National Database of Health Insurance Claims and Specific Health Checkups of Japan (NDB) is a comprehensive database of health insurance claims data under Japan’s National Health Insurance system. The NDB uses two types of personal identification variables (referred to in the database as “ID1” and “ID2”) to link the insurance claims of individual patients. However, the information entered against these ID variables is prone to change for several reasons, such as when claimants find or change employment, or due to variations in the spelling of their name. In the present study, we developed a new patient-matching technique that improves upon the existing system of using ID1 and ID2 variables. We also sought to validate a new personal ID variable (ID0) that we propose in order to enhance the efficiency of patient matching in the NDB database. Methods Our study targeted data from health insurance claims filed between April 2013 and March 2016 for hospitalization, combined diagnostic procedures, outpatient treatment, and dispensing of prescription medication. We developed a new patient-matching algorithm based on the ID1 and ID2 variables, as well as variables for treatment date and clinical outcome. We then attempted to validate our algorithm by comparing the number of patients identified by patient matching with the current ID1 variable and our proposed ID0 variable against the estimated patient population as of 1 October 2015. Results The numbers of patients in each sex and age group that were identified with the ID0 variable were lower than those identified using the ID1 variable. By using the ID0 variable, we were able to reduce the number of duplicate records for male and female patients by 5.8% and 6.4%, respectively. The numbers of children, adults older than 75 years, and women of reproductive age identified using the ID1 patient-matching variable were all higher than their corresponding estimates. Conversely, the numbers of these patients identified with the ID0 patient-matching variable were all within their corresponding estimates. Conclusion Our findings show that the proposed ID0 variable delivers more precise patient-matching results than the existing ID1 variable. The ID0 variable is currently the best available technique for patient matching in the NDB database. Future patient population estimates should therefore rely on the ID0 variable instead of the ID1 variable.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
日本国家健康保险理赔和特定健康检查数据库(NDB):大纲和患者匹配技术
日本国民健康保险理赔和特定健康检查数据库(NDB)是日本国民健康保险制度下的综合健康保险理赔数据数据库。新开发银行使用两种类型的个人识别变量(在数据库中称为“ID1”和“ID2”)将个别患者的保险索赔联系起来。但是,针对这些ID变量输入的信息很容易因以下几个原因而更改,例如当索赔人找到或更改工作时,或者由于其姓名拼写的变化。在本研究中,我们开发了一种新的患者匹配技术,改进了现有的使用ID1和ID2变量的系统。我们还试图验证我们提出的新的个人ID变量(ID0),以提高NDB数据库中患者匹配的效率。方法本研究针对2013年4月至2016年3月期间住院、联合诊断程序、门诊治疗和处方药物分配的医疗保险索赔数据。我们基于ID1和ID2变量以及治疗日期和临床结果变量开发了一种新的患者匹配算法。然后,我们试图通过将患者匹配识别的患者数量与当前ID1变量和我们提出的ID0变量与截至2015年10月1日的估计患者人群进行比较,来验证我们的算法。结果各性别、各年龄组用ID0变量识别的患者人数低于用ID1变量识别的患者人数。通过使用ID0变量,我们能够将男性和女性患者的重复记录数量分别减少5.8%和6.4%。使用ID1患者匹配变量确定的儿童、75岁以上的成年人和育龄妇女的数量都高于相应的估计值。相反,这些患者的ID0患者匹配变量的数量都在相应的估计范围内。我们的研究结果表明,与现有的ID1变量相比,提出的ID0变量提供了更精确的患者匹配结果。ID0变量是目前NDB数据库中患者匹配的最佳技术。因此,未来的患者群体估计应该依赖于ID0变量而不是ID1变量。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Audiovisual cues must be predictable and win-paired to drive risky choice. High-resolution promoter interaction analysis implicates genes involved in the activation of Type 3 Innate Lymphoid Cells in autoimmune disease risk. Deriving genetic codes for molecular phenotypes from first principles. High frequency spike inference with particle Gibbs sampling. Spontaneous replication fork collapse regulates telomere length homeostasis in wild type yeast.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1