Omega-Regular Reward Machines

Ernst Moritz Hahn, Mateo Perez, Sven Schewe, Fabio Somenzi, Ashutosh Trivedi*, Dominik Wojtczak

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

1 Citation (Scopus)
14 Downloads (Pure)

Abstract

Reinforcement learning (RL) is a powerful approach for training agents to perform tasks, but designing an appropriate reward mechanism is critical to its success. However, in many cases, the complexity of the learning objectives goes beyond the capabilities of the Markovian assumption, necessitating a more sophisticated reward mechanism. Reward machines and ω-regular languages are two formalisms used to express non-Markovian rewards for quantitative and qualitative objectives, respectively. This paper introduces ω-regular reward machines, which integrate reward machines with ω-regular languages to enable an expressive and effective reward mechanism for RL. We present a model-free RL algorithm to compute ϵ-optimal strategies against ω-regular reward machines and evaluate the effectiveness of the proposed algorithm through experiments.

Original languageEnglish
Title of host publicationECAI 2023 - 26th European Conference on Artificial Intelligence, including 12th Conference on Prestigious Applications of Intelligent Systems, PAIS 2023 - Proceedings
EditorsKobi Gal, Kobi Gal, Ann Nowe, Grzegorz J. Nalepa, Roy Fairstein, Roxana Radulescu
PublisherIOS
Pages972-979
Number of pages8
ISBN (Electronic)9781643684369
DOIs
Publication statusPublished - 28 Sept 2023
Event26th European Conference on Artificial Intelligence, ECAI 2023 - Kraków, Poland
Duration: 30 Sept 20234 Oct 2023
Conference number: 26

Publication series

NameFrontiers in Artificial Intelligence and Applications
Volume372
ISSN (Print)0922-6389
ISSN (Electronic)1879-8314

Conference

Conference26th European Conference on Artificial Intelligence, ECAI 2023
Abbreviated titleECAI
Country/TerritoryPoland
CityKraków
Period30/09/234/10/23

Fingerprint

Dive into the research topics of 'Omega-Regular Reward Machines'. Together they form a unique fingerprint.

Cite this