| IEEE Access | |
| Non-Stationary Bandit Strategy for Rate Adaptation With Delayed Feedback | |
| Yanliang Jin1  Yapeng Zhao1  Kai Kang2  Hua Qian2  | |
| [1] School of Communication and Information Engineering, Shanghai University, Shanghai, China;Shanghai Advanced Research Institute, China Academy of Sciences, Shanghai, China; | |
| 关键词: Throughput; rate adaptation; wireless communication; time-division duplex; multi-armed bandit; | |
| DOI : 10.1109/ACCESS.2020.2988671 | |
| 来源: DOAJ | |
【 摘 要 】
Rate adaptation is an efficient mechanism to utilize the channel capacity by adjusting the modulation and coding scheme in a dynamic wireless environment. The channel feedback, such as acknowledgment/negative acknowledgment (ACK/NACK) messages or the channel measurement such as received signal strength indicator (RSSI) can be applied to the rate adaptation. Existing rate adaptation algorithms are mainly driven by heuristics. They can not achieve satisfactory transmission rates in the time-varying environment. In this paper, we focus on the rate adaptation problem in a time-division duplex (TDD) system. A multi-armed bandit (MAB) strategy is applied to learn the changes of the channel condition from both RSSI and ACK/NACK signals. A discounted upper confidence bound based rate adaptation (DUCB-RA) algorithm is proposed. We show that the performance of the proposed algorithm is converged to the optimal with mathematical proofs. Simulation results demonstrate that the proposed algorithm can adapt to the time-varying channel and achieve better transmission throughput compared to existing rate adaptation algorithms.
【 授权许可】
Unknown