Abstract
Reinforcement learning (RL) is a powerful approach for training agents to perform tasks, but designing an appropriate reward mechanism is critical to its success. However, in many cases, the complexity of the learning objectives goes beyond the capabilities of the Markovian assumption, necessitating a more sophisticated reward mechanism. Reward machines and ω-regular languages are two formalisms used to express non-Markovian rewards for quantitative and qualitative objectives, respectively. This paper introduces ω-regular reward machines, which integrate reward machines with ω-regular languages to enable an expressive and effective reward mechanism for RL. We present a model-free RL algorithm to compute ϵ-optimal strategies against ω-regular reward machines and evaluate the effectiveness of the proposed algorithm through experiments.
Original language | English |
---|---|
Title of host publication | ECAI 2023 - 26th European Conference on Artificial Intelligence, including 12th Conference on Prestigious Applications of Intelligent Systems, PAIS 2023 - Proceedings |
Editors | Kobi Gal, Kobi Gal, Ann Nowe, Grzegorz J. Nalepa, Roy Fairstein, Roxana Radulescu |
Publisher | IOS |
Pages | 972-979 |
Number of pages | 8 |
ISBN (Electronic) | 9781643684369 |
DOIs | |
Publication status | Published - 28 Sept 2023 |
Event | 26th European Conference on Artificial Intelligence, ECAI 2023 - Kraków, Poland Duration: 30 Sept 2023 → 4 Oct 2023 Conference number: 26 |
Publication series
Name | Frontiers in Artificial Intelligence and Applications |
---|---|
Volume | 372 |
ISSN (Print) | 0922-6389 |
ISSN (Electronic) | 1879-8314 |
Conference
Conference | 26th European Conference on Artificial Intelligence, ECAI 2023 |
---|---|
Abbreviated title | ECAI |
Country/Territory | Poland |
City | Kraków |
Period | 30/09/23 → 4/10/23 |