{"title":"Personalized Treatment through Biosensors and Machine Learning ML","authors":"Anne Thomas Homescu","doi":"10.2139/ssrn.3610445","DOIUrl":null,"url":null,"abstract":"Precision medicine and personalized treatment can be effectively achieved through a combination of machine learning ML techniques and data gathered through specialized biosensors. \n \nHowever, the biggest challenge appears to be need of better data – in terms of both quality and quantity – on which to apply appropriate ML techniques. While ML can uncover deeper dependencies between the data, it requires enough good data to capture these dependencies. No matter how good the ML algorithms are, they cannot find something which is not in the training set. Hence, better (in terms of both quality and quantity) datasets are required to train algorithms. This, in turn, means that improved biosensors are needed to deliver such comprehensive, specialized data. \n \nThe first section of the report provides a comprehensive literature review of personalized medicine applications employing ML methods on sensor data, under categories of Mobile sensing and portable devices, Neuroimaging, Brain Machine Interface, Omics and electronic health records, and Biosensor systems. \n \nWhile the first section is focused on opportunities and successes described in the literature, the second section highlights the key challenges which need to be addressed in order to take full advantage of the benefits ML can offer in the areas of personalized medicine. They also underscore the need for biosensors which provide better quality and quantity of data on which ML may operate. \n \nThe third section describes select “best practices” for ML applications: data quality management and validation, data visualization, and privacy preserving data analysis. Very effective interactive visualization can be delivered through a Shiny framework and R packages, and we present a representative interactive visualizer using real data. \n \nLastly, the appendices provide additional information and references on data quality procedures, data visualization techniques, and topics in ML.","PeriodicalId":433297,"journal":{"name":"EngRN: Signal Processing (Topic)","volume":"18 5","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-05-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"EngRN: Signal Processing (Topic)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2139/ssrn.3610445","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Precision medicine and personalized treatment can be effectively achieved through a combination of machine learning ML techniques and data gathered through specialized biosensors.
However, the biggest challenge appears to be need of better data – in terms of both quality and quantity – on which to apply appropriate ML techniques. While ML can uncover deeper dependencies between the data, it requires enough good data to capture these dependencies. No matter how good the ML algorithms are, they cannot find something which is not in the training set. Hence, better (in terms of both quality and quantity) datasets are required to train algorithms. This, in turn, means that improved biosensors are needed to deliver such comprehensive, specialized data.
The first section of the report provides a comprehensive literature review of personalized medicine applications employing ML methods on sensor data, under categories of Mobile sensing and portable devices, Neuroimaging, Brain Machine Interface, Omics and electronic health records, and Biosensor systems.
While the first section is focused on opportunities and successes described in the literature, the second section highlights the key challenges which need to be addressed in order to take full advantage of the benefits ML can offer in the areas of personalized medicine. They also underscore the need for biosensors which provide better quality and quantity of data on which ML may operate.
The third section describes select “best practices” for ML applications: data quality management and validation, data visualization, and privacy preserving data analysis. Very effective interactive visualization can be delivered through a Shiny framework and R packages, and we present a representative interactive visualizer using real data.
Lastly, the appendices provide additional information and references on data quality procedures, data visualization techniques, and topics in ML.