Renjith R J, UmaMaheswari M, N. N, Velmurugan P G S
{"title":"空间调制MIMO系统能量收集的强化学习","authors":"Renjith R J, UmaMaheswari M, N. N, Velmurugan P G S","doi":"10.1109/ICIICT1.2019.8741446","DOIUrl":null,"url":null,"abstract":"To enhance the lifetime of a power constrained relay network, Energy Harvesting (EH) is a promising solution. This paper introduces Reinforcement Learning (RL) algorithm for high data rate bidirectional relay network with the merits of Spatial Modulation (SM) and Decode and Forward (DF) relay protocol. As the relay network is energy constrained, the required power for forwarding the data is harvested from the received radio frequency signals. Based on real-time scenarios, it is assumed that relay has knowledge only about the past and the current state of the EH process. The proposed system is modelled as Markov Decision Process (MDP) and power allocation policy is formulated using RL algorithm. This policy maximizes the overall throughput of the system and reduces the outage. Further, the concept of linear function approximation is proposed to handle the real-time scenarios.","PeriodicalId":118897,"journal":{"name":"2019 1st International Conference on Innovations in Information and Communication Technology (ICIICT)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Reinforcement Learning for Energy Harvesting in Spatial Modulated MIMO Systems\",\"authors\":\"Renjith R J, UmaMaheswari M, N. N, Velmurugan P G S\",\"doi\":\"10.1109/ICIICT1.2019.8741446\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"To enhance the lifetime of a power constrained relay network, Energy Harvesting (EH) is a promising solution. This paper introduces Reinforcement Learning (RL) algorithm for high data rate bidirectional relay network with the merits of Spatial Modulation (SM) and Decode and Forward (DF) relay protocol. As the relay network is energy constrained, the required power for forwarding the data is harvested from the received radio frequency signals. Based on real-time scenarios, it is assumed that relay has knowledge only about the past and the current state of the EH process. The proposed system is modelled as Markov Decision Process (MDP) and power allocation policy is formulated using RL algorithm. This policy maximizes the overall throughput of the system and reduces the outage. Further, the concept of linear function approximation is proposed to handle the real-time scenarios.\",\"PeriodicalId\":118897,\"journal\":{\"name\":\"2019 1st International Conference on Innovations in Information and Communication Technology (ICIICT)\",\"volume\":\"16 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-04-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 1st International Conference on Innovations in Information and Communication Technology (ICIICT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICIICT1.2019.8741446\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 1st International Conference on Innovations in Information and Communication Technology (ICIICT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIICT1.2019.8741446","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Reinforcement Learning for Energy Harvesting in Spatial Modulated MIMO Systems
To enhance the lifetime of a power constrained relay network, Energy Harvesting (EH) is a promising solution. This paper introduces Reinforcement Learning (RL) algorithm for high data rate bidirectional relay network with the merits of Spatial Modulation (SM) and Decode and Forward (DF) relay protocol. As the relay network is energy constrained, the required power for forwarding the data is harvested from the received radio frequency signals. Based on real-time scenarios, it is assumed that relay has knowledge only about the past and the current state of the EH process. The proposed system is modelled as Markov Decision Process (MDP) and power allocation policy is formulated using RL algorithm. This policy maximizes the overall throughput of the system and reduces the outage. Further, the concept of linear function approximation is proposed to handle the real-time scenarios.