Back-off Improvement By Using Q-learning in IEEE 802.11p Vehicular Network

Dong-jin Lee, Yafeng Deng, Young-June Choi

Published: 2020, Last Modified: 27 Jul 2025ICTC 2020EveryoneRevisionsBibTeXCC BY-SA 4.0

Abstract: The vehicular ad-hoc networks (VANETs) support wireless communication among moving vehicles, infrastructures as well as other devices. In VANETs, the problem of sharing the same channel is complex, which results in more packet collisions in resource allocation unless the resource information is unified for each vehicle. The process of resource allocation among vehicles must be optimized for efficiently using the possible wireless bandwidths and the successful configuration of VANETs. For efficient resource allocation, we apply Q-learning that allows many vehicles in a network, which can make the process of exchanging data among them more efficient. The policy of choosing contention window size can be learned, where a hybrid linear and exponential contention window size adjustment is considered. Vehicles learn in the process of maximizing successful transmission of data packets and minimizing bandwidth waste. Furthermore, the proposed algorithm performs better than existing back-off algorithms.