{"title":"基于马尔可夫决策过程的人工智能购物辅助机器人","authors":"Rida Gillani, Ali Nasir","doi":"10.1109/INTELSE.2016.7475168","DOIUrl":null,"url":null,"abstract":"There are many challenges involved in the realization of a shopping assistance robot (SAR). The specific challenge addressed in this paper is that of incorporating artificial intelligence or decision making capability in such robot. Markov Decision Process (MDP) based formulation of the problem has been presented for this purpose. The major advantage of the MDP based approach over simple search based artificial intelligence techniques is that it can incorporate uncertainty. The proposed MDP model has been solved for optimal policy using value iteration algorithm. Furthermore, it has been shown how the reward function influences the structure of the resulting policy. The results show encouraging potential in the use of MDP based formulation for SAR.","PeriodicalId":127671,"journal":{"name":"2016 International Conference on Intelligent Systems Engineering (ICISE)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Incorporating artificial intelligence in shopping assistance robot using Markov Decision Process\",\"authors\":\"Rida Gillani, Ali Nasir\",\"doi\":\"10.1109/INTELSE.2016.7475168\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"There are many challenges involved in the realization of a shopping assistance robot (SAR). The specific challenge addressed in this paper is that of incorporating artificial intelligence or decision making capability in such robot. Markov Decision Process (MDP) based formulation of the problem has been presented for this purpose. The major advantage of the MDP based approach over simple search based artificial intelligence techniques is that it can incorporate uncertainty. The proposed MDP model has been solved for optimal policy using value iteration algorithm. Furthermore, it has been shown how the reward function influences the structure of the resulting policy. The results show encouraging potential in the use of MDP based formulation for SAR.\",\"PeriodicalId\":127671,\"journal\":{\"name\":\"2016 International Conference on Intelligent Systems Engineering (ICISE)\",\"volume\":\"19 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1900-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 International Conference on Intelligent Systems Engineering (ICISE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/INTELSE.2016.7475168\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 International Conference on Intelligent Systems Engineering (ICISE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INTELSE.2016.7475168","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Incorporating artificial intelligence in shopping assistance robot using Markov Decision Process
There are many challenges involved in the realization of a shopping assistance robot (SAR). The specific challenge addressed in this paper is that of incorporating artificial intelligence or decision making capability in such robot. Markov Decision Process (MDP) based formulation of the problem has been presented for this purpose. The major advantage of the MDP based approach over simple search based artificial intelligence techniques is that it can incorporate uncertainty. The proposed MDP model has been solved for optimal policy using value iteration algorithm. Furthermore, it has been shown how the reward function influences the structure of the resulting policy. The results show encouraging potential in the use of MDP based formulation for SAR.