{"title":"信任问题:可信赖人工智能可靠评估的新旧指标","authors":"A. Campagner, Riccardo Angius, F. Cabitza","doi":"10.5220/0011679600003414","DOIUrl":null,"url":null,"abstract":": This work contributes to the evaluation of the quality of decision support systems constructed with Machine Learning (ML) techniques in Medical Artificial Intelligence (MAI). In particular, we propose and discuss metrics that complement and go beyond traditional assessment practices based on the evaluation of accuracy, by focusing on two different dimensions related to the trustworthiness of a MAI system: reputation/ability, which relates to the accuracy or predictive ability of the system itself; and expertise/source reliability, which relates instead to the trustworthiness of the data which have been used to construct the MAI system. Then, we will discuss some previous, but so far mostly neglected, proposals as well novel metrics, visualizations and procedures for the sound evaluation of a MAI system’s trustworthiness, by focusing on six different concepts: advice accuracy, advice reliability, pragmatic utility, advice value, decision benefit and potential robustness. Finally, we will illustrate the application of the proposed concepts through two realistic medical case studies.","PeriodicalId":20676,"journal":{"name":"Proceedings of the International Conference on Health Informatics and Medical Application Technology","volume":"42 1","pages":"132-143"},"PeriodicalIF":0.0000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Question of Trust: Old and New Metrics for the Reliable Assessment of Trustworthy AI\",\"authors\":\"A. Campagner, Riccardo Angius, F. Cabitza\",\"doi\":\"10.5220/0011679600003414\",\"DOIUrl\":null,\"url\":null,\"abstract\":\": This work contributes to the evaluation of the quality of decision support systems constructed with Machine Learning (ML) techniques in Medical Artificial Intelligence (MAI). In particular, we propose and discuss metrics that complement and go beyond traditional assessment practices based on the evaluation of accuracy, by focusing on two different dimensions related to the trustworthiness of a MAI system: reputation/ability, which relates to the accuracy or predictive ability of the system itself; and expertise/source reliability, which relates instead to the trustworthiness of the data which have been used to construct the MAI system. Then, we will discuss some previous, but so far mostly neglected, proposals as well novel metrics, visualizations and procedures for the sound evaluation of a MAI system’s trustworthiness, by focusing on six different concepts: advice accuracy, advice reliability, pragmatic utility, advice value, decision benefit and potential robustness. Finally, we will illustrate the application of the proposed concepts through two realistic medical case studies.\",\"PeriodicalId\":20676,\"journal\":{\"name\":\"Proceedings of the International Conference on Health Informatics and Medical Application Technology\",\"volume\":\"42 1\",\"pages\":\"132-143\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the International Conference on Health Informatics and Medical Application Technology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.5220/0011679600003414\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the International Conference on Health Informatics and Medical Application Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5220/0011679600003414","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Question of Trust: Old and New Metrics for the Reliable Assessment of Trustworthy AI
: This work contributes to the evaluation of the quality of decision support systems constructed with Machine Learning (ML) techniques in Medical Artificial Intelligence (MAI). In particular, we propose and discuss metrics that complement and go beyond traditional assessment practices based on the evaluation of accuracy, by focusing on two different dimensions related to the trustworthiness of a MAI system: reputation/ability, which relates to the accuracy or predictive ability of the system itself; and expertise/source reliability, which relates instead to the trustworthiness of the data which have been used to construct the MAI system. Then, we will discuss some previous, but so far mostly neglected, proposals as well novel metrics, visualizations and procedures for the sound evaluation of a MAI system’s trustworthiness, by focusing on six different concepts: advice accuracy, advice reliability, pragmatic utility, advice value, decision benefit and potential robustness. Finally, we will illustrate the application of the proposed concepts through two realistic medical case studies.