D. Dicostanzo, A. Ayan, S. Jhawar, T. Allen, E. Patterson
{"title":"Machine Learning Data Pipeline for the Democratization of AI","authors":"D. Dicostanzo, A. Ayan, S. Jhawar, T. Allen, E. Patterson","doi":"10.1177/2327857923121029","DOIUrl":null,"url":null,"abstract":"The use of artificial intelligence continues to increase. In healthcare, there has been a recent increase in AI applications to real-time individual patient clinical care, as opposed to population-based research or quality improvement efforts. However, the expertise to evaluate and implement these solutions is limited and often congregates in academic medical centers, creating barriers to adoption for smaller community and rural centers. Lowering the barrier to entry for innovative tools can help address disparities in patient outcomes due to access and other urban/rural contributors. We describe a strategy for evaluating commercially available machine learning models to disseminate lessons learned from developing, validating, and implementing machine learning-based models in clinical care in radiation therapy. In addition, we share an end-to-end data pipeline as open-source code with the tools necessary to identify, extract, organize, and process the data for use in machine-learning applications. We illustrate the application of this data pipeline to the use of brachytherapy to treat female cervical cancer patients. The example will show how we used the proposed pipeline to extract 708 potential participants and applied the developed methods and visualizations to clean the data providing 144 study participants for inclusion in our study. Finally, we discuss the anticipated challenges in implementing machine learning models in commercially available FDA-approved devices and suggest solutions using discrete tools built in different programming languages.","PeriodicalId":74550,"journal":{"name":"Proceedings of the International Symposium of Human Factors and Ergonomics in Healthcare. International Symposium of Human Factors and Ergonomics in Healthcare","volume":"12 1","pages":"120 - 124"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the International Symposium of Human Factors and Ergonomics in Healthcare. International Symposium of Human Factors and Ergonomics in Healthcare","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1177/2327857923121029","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The use of artificial intelligence continues to increase. In healthcare, there has been a recent increase in AI applications to real-time individual patient clinical care, as opposed to population-based research or quality improvement efforts. However, the expertise to evaluate and implement these solutions is limited and often congregates in academic medical centers, creating barriers to adoption for smaller community and rural centers. Lowering the barrier to entry for innovative tools can help address disparities in patient outcomes due to access and other urban/rural contributors. We describe a strategy for evaluating commercially available machine learning models to disseminate lessons learned from developing, validating, and implementing machine learning-based models in clinical care in radiation therapy. In addition, we share an end-to-end data pipeline as open-source code with the tools necessary to identify, extract, organize, and process the data for use in machine-learning applications. We illustrate the application of this data pipeline to the use of brachytherapy to treat female cervical cancer patients. The example will show how we used the proposed pipeline to extract 708 potential participants and applied the developed methods and visualizations to clean the data providing 144 study participants for inclusion in our study. Finally, we discuss the anticipated challenges in implementing machine learning models in commercially available FDA-approved devices and suggest solutions using discrete tools built in different programming languages.