Khalid Mahmood, Varun Sathyan, H. Kanaan, G. Malik, Hafiz Malik
{"title":"Autonomous Decentralized Privacy-Enabled Data Preparation Architecture for Multicenter Clinical Observational Research","authors":"Khalid Mahmood, Varun Sathyan, H. Kanaan, G. Malik, Hafiz Malik","doi":"10.1109/ISADS.2017.38","DOIUrl":null,"url":null,"abstract":"Tailoring treatment and clinical decision making to a person's unique characteristics is the next milestone for healthcare informatics, but for it to be accomplished, big data analytics for identifying risk factors and other hidden patterns among patients become paramount. In future these analytics will take the form of multicenter observational research, for which data preparation is vital. Specifically, quality data must be obtained in a timely manner while protecting the privacy of patients in the health records shared among researchers. Furthermore, the coordination and cooperation of a fluctuating number of medical data sources containing these records for clinical data distribution is an additional requirement in multicenter studies. Thus, we propose an autonomous decentralized, privacy-enabled data preparation architecture and novel SEDTM algorithm to meet these requirements, censuring sensitive information via filtration, and extracting relevant clinical data with a fully automated approach. Our evaluation demonstrates a 40% - 60% increase in the retrieval of quality patient data, compared to traditional semantic similarity, for our proposed SEDTM algorithm.","PeriodicalId":303882,"journal":{"name":"2017 IEEE 13th International Symposium on Autonomous Decentralized System (ISADS)","volume":"126 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-03-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE 13th International Symposium on Autonomous Decentralized System (ISADS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISADS.2017.38","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Tailoring treatment and clinical decision making to a person's unique characteristics is the next milestone for healthcare informatics, but for it to be accomplished, big data analytics for identifying risk factors and other hidden patterns among patients become paramount. In future these analytics will take the form of multicenter observational research, for which data preparation is vital. Specifically, quality data must be obtained in a timely manner while protecting the privacy of patients in the health records shared among researchers. Furthermore, the coordination and cooperation of a fluctuating number of medical data sources containing these records for clinical data distribution is an additional requirement in multicenter studies. Thus, we propose an autonomous decentralized, privacy-enabled data preparation architecture and novel SEDTM algorithm to meet these requirements, censuring sensitive information via filtration, and extracting relevant clinical data with a fully automated approach. Our evaluation demonstrates a 40% - 60% increase in the retrieval of quality patient data, compared to traditional semantic similarity, for our proposed SEDTM algorithm.