Sirwe Saeedi, A. Fong, Ajay K. Gupta, Steve M. Carr
{"title":"Improving Healthcare Outcomes with Learning Models for Machine Commonsense Reasoning Systems","authors":"Sirwe Saeedi, A. Fong, Ajay K. Gupta, Steve M. Carr","doi":"10.1109/RASSE54974.2022.10019733","DOIUrl":null,"url":null,"abstract":"Machine commonsense reasoning (MCR) systems can significantly improve the way we interact with machines. MCR systems are therefore an important element in any human-centric applications. Recent advances in machine learning (ML) have enabled breakthroughs in MCR technologies. This paper aims to improve healthcare outcomes by making human-machine interactions more intuitive than before. It presents learning models developed for MCR. Specifically, it presents a critical analysis of state-of-the-art deep learning (DL) models for MCR. These include recurrent neural network (RNN), transfer learning (TL), and transformers. Transformers, in particular, have been found to be effective for a range of natural language processing (NLP) applications, including MCR. Based on the analysis, another contribution of this paper is to assemble useful MCR tools into an adaptable MCR toolbox. To ensure broad applicability, the toolbox can be customizable for different MCR applications. Our research focuses on two specific MCR applications: commonsense validation and commonsense explanation. The former concerns identifying statements that do not make commonsense. The latter aims at explaining the reason why a given statement does not make commonsense. The paper presents some preliminary results of applying elements of the assembled toolbox to the two MCR applications. These results indicate that it is possible to achieve near human performances using finely-tuned state-of-the-art DL methods for the two MCR applications.","PeriodicalId":382440,"journal":{"name":"2022 IEEE International Conference on Recent Advances in Systems Science and Engineering (RASSE)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2022-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Conference on Recent Advances in Systems Science and Engineering (RASSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/RASSE54974.2022.10019733","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Machine commonsense reasoning (MCR) systems can significantly improve the way we interact with machines. MCR systems are therefore an important element in any human-centric applications. Recent advances in machine learning (ML) have enabled breakthroughs in MCR technologies. This paper aims to improve healthcare outcomes by making human-machine interactions more intuitive than before. It presents learning models developed for MCR. Specifically, it presents a critical analysis of state-of-the-art deep learning (DL) models for MCR. These include recurrent neural network (RNN), transfer learning (TL), and transformers. Transformers, in particular, have been found to be effective for a range of natural language processing (NLP) applications, including MCR. Based on the analysis, another contribution of this paper is to assemble useful MCR tools into an adaptable MCR toolbox. To ensure broad applicability, the toolbox can be customizable for different MCR applications. Our research focuses on two specific MCR applications: commonsense validation and commonsense explanation. The former concerns identifying statements that do not make commonsense. The latter aims at explaining the reason why a given statement does not make commonsense. The paper presents some preliminary results of applying elements of the assembled toolbox to the two MCR applications. These results indicate that it is possible to achieve near human performances using finely-tuned state-of-the-art DL methods for the two MCR applications.