Interpretable Machine Learning for Self-Service High-Risk Decision-Making

Charles Recaido, B. Kovalerchuk
{"title":"Interpretable Machine Learning for Self-Service High-Risk Decision-Making","authors":"Charles Recaido, B. Kovalerchuk","doi":"10.1109/IV56949.2022.00061","DOIUrl":null,"url":null,"abstract":"This paper contributes to interpretable machine learning via visual knowledge discovery in general line coordinates (GLC). The concepts of hyperblocks as interpretable dataset units and general line coordinates are combined to create a visual self-service machine learning model. Dynamic Scaffolding Coordinates as lossless multidimensional coordinate systems are proposed, and their applications as visual models is shown. DSC1 and DSC2 can map multiple dataset attributes to a single two-dimensional (X, Y) Cartesian plane using a graph construction algorithm. The hyperblock analysis was used to determine visually appealing dataset attribute orders and to reduce line occlusion. It is shown that hyperblocks can generalize decision tree rules and a series of DSC1 or DSC2 plots can visualize a decision tree. The DSC1 and DSC2 plots were tested on benchmark datasets from the UCI ML repository. They allowed for visual classification of data. Additionally, areas of hyperblock impurity were discovered and used to establish dataset splits that highlight the upper estimate of worst-case model accuracy to guide model selection for high-risk decision-making. Major benefits of DSC1 and DSC2 is their highly interpretable nature. They allow domain experts to control or establish new machine learning models through visual pattern discovery.","PeriodicalId":153161,"journal":{"name":"2022 26th International Conference Information Visualisation (IV)","volume":"101 9","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 26th International Conference Information Visualisation (IV)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IV56949.2022.00061","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

This paper contributes to interpretable machine learning via visual knowledge discovery in general line coordinates (GLC). The concepts of hyperblocks as interpretable dataset units and general line coordinates are combined to create a visual self-service machine learning model. Dynamic Scaffolding Coordinates as lossless multidimensional coordinate systems are proposed, and their applications as visual models is shown. DSC1 and DSC2 can map multiple dataset attributes to a single two-dimensional (X, Y) Cartesian plane using a graph construction algorithm. The hyperblock analysis was used to determine visually appealing dataset attribute orders and to reduce line occlusion. It is shown that hyperblocks can generalize decision tree rules and a series of DSC1 or DSC2 plots can visualize a decision tree. The DSC1 and DSC2 plots were tested on benchmark datasets from the UCI ML repository. They allowed for visual classification of data. Additionally, areas of hyperblock impurity were discovered and used to establish dataset splits that highlight the upper estimate of worst-case model accuracy to guide model selection for high-risk decision-making. Major benefits of DSC1 and DSC2 is their highly interpretable nature. They allow domain experts to control or establish new machine learning models through visual pattern discovery.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
自助式高风险决策的可解释机器学习
本文通过在一般直线坐标(GLC)中的视觉知识发现为可解释的机器学习做出了贡献。超块作为可解释数据集单元的概念和一般的直线坐标相结合,创建了一个可视化的自助机器学习模型。提出了动态脚手架坐标系作为无损多维坐标系,并给出了动态脚手架坐标系作为可视化模型的应用。DSC1和DSC2可以使用图构建算法将多个数据集属性映射到单个二维(X, Y)笛卡尔平面上。使用超块分析来确定视觉上吸引人的数据集属性顺序并减少线遮挡。结果表明,超块可以泛化决策树规则,一系列DSC1或DSC2图可以可视化决策树。DSC1和DSC2图在来自UCI ML存储库的基准数据集上进行测试。它们允许对数据进行可视化分类。此外,超块杂质区域被发现并用于建立数据集分割,突出最坏情况模型精度的上限估计,以指导高风险决策的模型选择。DSC1和DSC2的主要优点是它们的高度可解释性。它们允许领域专家通过视觉模式发现来控制或建立新的机器学习模型。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Phrase Features in Essay Report Sentences for Developing Critical Thinking Ability in a Fully Online Course Preoperative Image Segmentation for Organ Visualization Using Augmented Reality Technology During Open Liver Surgery Data. Information and Knowledge Visualization for Frequent Patterns VRGrid: Efficient Transformation of 2D Data into Pixel Grid Layout Augmenting the Reality of Situated Visualization
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1