Marc Jermaine Pontiveros, Geoffrey A. Solano, C. Tee, M. Tee
{"title":"Explainable Machine Learning applied to Single-Nucleotide Polymorphisms for Systemic Lupus Erythematosus Prediction","authors":"Marc Jermaine Pontiveros, Geoffrey A. Solano, C. Tee, M. Tee","doi":"10.1109/IISA50023.2020.9284372","DOIUrl":null,"url":null,"abstract":"Systemic lupus erythematosus (SLE) is a type of autoimmune disease that affects multiple organ systems. The exact cause is unknown, but it is believed that predisposition to SLE is caused by multiple genetic factors. In this work we explored approaches to exploration and explanation of machine learning models for quantifying the risk of an individual to SLE using single nucleotide polymorphism (SNP) as features. Various model-agnostic explanation techniques were applied to further understand the factors that drive model predictions and allow comparison of the models. A web-based dashboard was developed to facilitate exploration and comparison of the models. The user can identify which features are important for predictions of each model, as well as to understand how a model comes up with a prediction for a given observation. The best performing model is the random forest model with AUC of 92.26% and AUCPR of 93.70g%.","PeriodicalId":109238,"journal":{"name":"2020 11th International Conference on Information, Intelligence, Systems and Applications (IISA","volume":"75 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 11th International Conference on Information, Intelligence, Systems and Applications (IISA","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IISA50023.2020.9284372","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Systemic lupus erythematosus (SLE) is a type of autoimmune disease that affects multiple organ systems. The exact cause is unknown, but it is believed that predisposition to SLE is caused by multiple genetic factors. In this work we explored approaches to exploration and explanation of machine learning models for quantifying the risk of an individual to SLE using single nucleotide polymorphism (SNP) as features. Various model-agnostic explanation techniques were applied to further understand the factors that drive model predictions and allow comparison of the models. A web-based dashboard was developed to facilitate exploration and comparison of the models. The user can identify which features are important for predictions of each model, as well as to understand how a model comes up with a prediction for a given observation. The best performing model is the random forest model with AUC of 92.26% and AUCPR of 93.70g%.