Feasibility Study of Federated Learning on the Distributed Research Network of OMOP Common Data Model.

IF 2.1 Q3 MEDICAL INFORMATICS Healthcare Informatics Research Pub Date : 2023-04-01 DOI:10.4258/hir.2023.29.2.168

Geun Hyeong Lee, Jonggul Park, Jihyeong Kim, Yeesuk Kim, Byungjin Choi, Rae Woong Park, Sang Youl Rhee, Soo-Yong Shin

{"title":"Feasibility Study of Federated Learning on the Distributed Research Network of OMOP Common Data Model.","authors":"Geun Hyeong Lee, Jonggul Park, Jihyeong Kim, Yeesuk Kim, Byungjin Choi, Rae Woong Park, Sang Youl Rhee, Soo-Yong Shin","doi":"10.4258/hir.2023.29.2.168","DOIUrl":null,"url":null,"abstract":"Objectives: Since protecting patients' privacy is a major concern in clinical research, there has been a growing need for privacy-preserving data analysis platforms. For this purpose, a federated learning (FL) method based on the Observational Medical Outcomes Partnership (OMOP) common data model (CDM) was implemented, and its feasibility was demonstrated.Methods: We implemented an FL platform on FeederNet, which is a distributed clinical data analysis platform based on the OMOP CDM in Korea. We trained it through an artificial neural network (ANN) using data from patients who received steroid prescriptions or injections, with the aim of predicting the occurrence of side effects depending on the prescribed dose. The ANN was trained using the FL platform with the OMOP CDMs of Kyung Hee University Medical Center (KHMC) and Ajou University Hospital (AUH).Results: The area under the receiver operating characteristic curves (AUROCs) for predicting bone fracture, osteonecrosis, and osteoporosis using only data from each hospital were 0.8426, 0.6920, and 0.7727 for KHMC and 0.7891, 0.7049, and 0.7544 for AUH, respectively. In contrast, when using FL, the corresponding AUROCs were 0.8260, 0.7001, and 0.7928 for KHMC and 0.7912, 0.8076, and 0.7441 for AUH, respectively. In particular, FL led to a 14% improvement in performance for osteonecrosis at AUH.Conclusions: FL can be performed with the OMOP CDM, and FL often shows better performance than using only a single institution's data. Therefore, research using OMOP CDM has been expanded from statistical analysis to machine learning so that researchers can conduct more diverse research.","PeriodicalId":12947,"journal":{"name":"Healthcare Informatics Research","volume":"29 2","pages":"168-173"},"PeriodicalIF":2.1000,"publicationDate":"2023-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/d8/e2/hir-2023-29-2-168.PMC10209729.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Healthcare Informatics Research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4258/hir.2023.29.2.168","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MEDICAL INFORMATICS","Score":null,"Total":0}

引用次数: 0

Abstract

Objectives: Since protecting patients' privacy is a major concern in clinical research, there has been a growing need for privacy-preserving data analysis platforms. For this purpose, a federated learning (FL) method based on the Observational Medical Outcomes Partnership (OMOP) common data model (CDM) was implemented, and its feasibility was demonstrated.

Methods: We implemented an FL platform on FeederNet, which is a distributed clinical data analysis platform based on the OMOP CDM in Korea. We trained it through an artificial neural network (ANN) using data from patients who received steroid prescriptions or injections, with the aim of predicting the occurrence of side effects depending on the prescribed dose. The ANN was trained using the FL platform with the OMOP CDMs of Kyung Hee University Medical Center (KHMC) and Ajou University Hospital (AUH).

Results: The area under the receiver operating characteristic curves (AUROCs) for predicting bone fracture, osteonecrosis, and osteoporosis using only data from each hospital were 0.8426, 0.6920, and 0.7727 for KHMC and 0.7891, 0.7049, and 0.7544 for AUH, respectively. In contrast, when using FL, the corresponding AUROCs were 0.8260, 0.7001, and 0.7928 for KHMC and 0.7912, 0.8076, and 0.7441 for AUH, respectively. In particular, FL led to a 14% improvement in performance for osteonecrosis at AUH.

Conclusions: FL can be performed with the OMOP CDM, and FL often shows better performance than using only a single institution's data. Therefore, research using OMOP CDM has been expanded from statistical analysis to machine learning so that researchers can conduct more diverse research.

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

OMOP公共数据模型分布式研究网络上联邦学习的可行性研究。

目的:由于保护患者隐私是临床研究中的一个主要问题，因此对保护隐私的数据分析平台的需求日益增长。为此，实现了一种基于OMOP公共数据模型(CDM)的联邦学习(FL)方法，并对其可行性进行了论证。方法:在韩国基于OMOP CDM的分布式临床数据分析平台federnet上实现FL平台。我们通过人工神经网络(ANN)训练它，使用接受类固醇处方或注射的患者的数据，目的是根据处方剂量预测副作用的发生。人工神经网络使用FL平台与庆熙大学医学中心(KHMC)和亚洲大学医院(AUH)的OMOP CDMs进行训练。结果:仅使用各医院数据预测骨折、骨坏死和骨质疏松的受试者工作特征曲线下面积(auroc) KHMC分别为0.8426、0.6920和0.7727,AUH分别为0.7891、0.7049和0.7544。而使用FL时，KHMC的auroc分别为0.8260、0.7001和0.7928,AUH的auroc分别为0.7912、0.8076和0.7441。特别是，FL使AUH骨坏死的治疗效果提高了14%。结论:使用OMOP CDM可以进行FL，并且FL通常比使用单一机构的数据表现更好。因此，使用OMOP CDM的研究已经从统计分析扩展到机器学习，使研究人员可以进行更多样化的研究。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Healthcare Informatics Research MEDICAL INFORMATICS-

CiteScore

4.90

自引率

6.90%

发文量