Maryam Karimi, M. Dehghan, Seyyed Majid Nourhoseini
{"title":"动态优化延迟敏感无线传输的系统框架","authors":"Maryam Karimi, M. Dehghan, Seyyed Majid Nourhoseini","doi":"10.1109/AISP.2015.7123489","DOIUrl":null,"url":null,"abstract":"Delay sensitive applications need to overcome the service problems in dynamic environments with respect to both the multimedia source data (e.g., variable bit-rate) and the wireless channels (e.g., fading channel). This paper considers the problem of point to point transmission of scalable video coding over a fading channel. We formulate the rate adaptation challenge of WLAN multimedia networks as a Markov Decision Process and resolve this problem online based on reinforcement learning. The buffer state, channel state, and video state were considered as a joint state of system to maximize the average Quality of Service under delay constraints. To improve the convergence speed of learning, system's underlying dynamics were partitioned into a priori known and a priori unknown components. The proposed learning algorithm exploits known information about the system, so that less information needs to be learned compared with that in conventional reinforcement learning algorithms.","PeriodicalId":405857,"journal":{"name":"2015 The International Symposium on Artificial Intelligence and Signal Processing (AISP)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-03-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A systematic framework for dynamically optimizing delay-sensitive wireless transmission\",\"authors\":\"Maryam Karimi, M. Dehghan, Seyyed Majid Nourhoseini\",\"doi\":\"10.1109/AISP.2015.7123489\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Delay sensitive applications need to overcome the service problems in dynamic environments with respect to both the multimedia source data (e.g., variable bit-rate) and the wireless channels (e.g., fading channel). This paper considers the problem of point to point transmission of scalable video coding over a fading channel. We formulate the rate adaptation challenge of WLAN multimedia networks as a Markov Decision Process and resolve this problem online based on reinforcement learning. The buffer state, channel state, and video state were considered as a joint state of system to maximize the average Quality of Service under delay constraints. To improve the convergence speed of learning, system's underlying dynamics were partitioned into a priori known and a priori unknown components. The proposed learning algorithm exploits known information about the system, so that less information needs to be learned compared with that in conventional reinforcement learning algorithms.\",\"PeriodicalId\":405857,\"journal\":{\"name\":\"2015 The International Symposium on Artificial Intelligence and Signal Processing (AISP)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-03-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2015 The International Symposium on Artificial Intelligence and Signal Processing (AISP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/AISP.2015.7123489\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 The International Symposium on Artificial Intelligence and Signal Processing (AISP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AISP.2015.7123489","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A systematic framework for dynamically optimizing delay-sensitive wireless transmission
Delay sensitive applications need to overcome the service problems in dynamic environments with respect to both the multimedia source data (e.g., variable bit-rate) and the wireless channels (e.g., fading channel). This paper considers the problem of point to point transmission of scalable video coding over a fading channel. We formulate the rate adaptation challenge of WLAN multimedia networks as a Markov Decision Process and resolve this problem online based on reinforcement learning. The buffer state, channel state, and video state were considered as a joint state of system to maximize the average Quality of Service under delay constraints. To improve the convergence speed of learning, system's underlying dynamics were partitioned into a priori known and a priori unknown components. The proposed learning algorithm exploits known information about the system, so that less information needs to be learned compared with that in conventional reinforcement learning algorithms.