Stateless reinforcement learning for multi-agent systems: The case of spectrum allocation in dynamic channel bonding WLANs

Sergio Barrachina-Muñoz, Alessandro Chiumento, Boris Bellalta

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

6 Citations (Scopus)
30 Downloads (Pure)

Abstract

Spectrum allocation in the form of primary channel and bandwidth selection is a key factor for dynamic channel bonding (DCB) wireless local area networks (WLANs). To cope with varying environments, where networks change their configurations on their own, the wireless community is looking towards solutions aided by machine learning (ML), and especially reinforcement learning (RL) given its trial-and-error approach. However, strong assumptions are normally made to let complex RL models converge to near-optimal solutions. Our goal with this paper is two-fold: justify in a comprehensible way why RL should be the approach for wireless networks problems like decentralized spectrum allocation, and call into question whether the use of complex RL algorithms helps the quest of rapid learning in realistic scenarios. We derive that stateless RL in the form of lightweight multi-armed-bandits (MABs) is an efficient solution for rapid adaptation avoiding the definition of extensive or meaningless RL states.

Original languageEnglish
Title of host publicationProceedings - 12th Wireless Days Conference, WD 2021
PublisherIEEE
ISBN (Electronic)9781665425599
DOIs
Publication statusPublished - 10 Aug 2021
Event12th Wireless Days Conference, WD 2021 - Paris, France, Virtual Conference, France
Duration: 30 Jun 20212 Jul 2021
Conference number: 12

Publication series

NameIFIP Wireless Days
Volume2021-June
ISSN (Print)2156-9711
ISSN (Electronic)2156-972X

Conference

Conference12th Wireless Days Conference, WD 2021
Abbreviated titleWD 2021
Country/TerritoryFrance
CityVirtual Conference
Period30/06/212/07/21

Keywords

  • 2024 OA procedure

Fingerprint

Dive into the research topics of 'Stateless reinforcement learning for multi-agent systems: The case of spectrum allocation in dynamic channel bonding WLANs'. Together they form a unique fingerprint.

Cite this