A Practical Guide to Multi-Objective Reinforcement Learning and Planning

Conor F. Hayes, Roxana Radulescu, Eugenio Bargiacchi, Johan Källström, Matthew Macfarlane, Mathieu Reymond, Timothy Verstraeten, Luisa Zintgraf, Richard Dazeley, Fredrik Heintz, Enda Howley, Athirai A. Irissappane, Patrick Mannion, Ann Nowe, Gabriel De Oliveira Ramos, Marcello Restelli, Peter Vamplew, Diederik M. Roijers

Research output: Contribution to journalArticlepeer-review

43 Citations (Scopus)
253 Downloads (Pure)

Abstract

Real-world sequential decision-making tasks are generally complex, requiring trade-offs between multiple, often conflicting, objectives. Despite this, the majority of research in reinforcement learning and decision-theoretic planning either assumes only a single objective, or that multiple objectives can be adequately handled via a simple linear combination. Such approaches may oversimplify the underlying problem and hence produce suboptimal results. This paper serves as a guide to the application of multi-objective methods to difficult problems, and is aimed at researchers who are already familiar with single-objective reinforcement learning and planning methods who wish to adopt a multi-objective perspective on their research, as well as practitioners who encounter multi-objective decision problems in practice. It identifies the factors that may influence the nature of the desired solution, and illustrates by example how these influence the design of multi-objective decision-making systems for complex problems.
Original languageEnglish
Article number26
Number of pages59
JournalAutonomous Agents and Multi-Agent Systems
Volume36
Issue number1
DOIs
Publication statusPublished - 13 Apr 2022

Bibliographical note

Funding Information:
This research was supported by funding from the Fonds voor Wetenschappelijk Onderzoek (FWO) through the grant of Eugenio Bargiacchi (#1SA2820N), and by funding from the Flemish Government under the “Onderzoeksprogramma Artificiële Intelligentie (AI) Vlaanderen” for Diederik M. Roijers and Ann Nowé. Roxana Rădulescu was partially supported through the FWO iBOF/21/027 project “DESCARTES”. Conor F. Hayes is funded by the National University of Ireland Galway Hardiman Scholarship. Gabriel Ramos was partially supported by FAPERGS (grant 19/2551-0001277-2) and FAPESP (grant 2020/05165-1). Johan Källström and Fredrik Heintz were partially supported by the Swedish Governmental Agency for Innovation Systems (grant NFFP7/2017-04885), and the Wallenberg Artificial Intelligence, Autonomous Systems and Software Program (WASP) funded by the Knut and Alice Wallenberg Foundation. Matthew Macfarlane was funded by LIFT-project 019.011 which is partly financed by the Dutch Research Council (NWO). Luisa Zintgraf is supported by the 2017 Microsoft Research PhD Scholarship Program, and the 2020 Microsoft Research EMEA PhD Award.

Publisher Copyright:
© 2022, The Author(s).

Fingerprint

Dive into the research topics of 'A Practical Guide to Multi-Objective Reinforcement Learning and Planning'. Together they form a unique fingerprint.

Cite this