Projects per year
Abstract
In this paper, we address the problem of ensuring that autonomous learning agents are in alignment with multiple moral values. Specifically, we present the theoretical principles and algorithmic tools necessary for creating an environment where an agent is assured of learning a behaviour (or policy) that corresponds to multiple moral values while striving to achieve its individual objective.
To address this value alignment problem, we adopt the Multi-Objective Reinforcement Learning framework and propose a novel algorithm that combines techniques from Multi-Objective Reinforcement Learning and Linear Programming. In addition to providing theoretical guarantees, we illustrate our value alignment process with an example involving an autonomous vehicle. Here, we demonstrate that the agent learns to behave in alignment with the ethical values of safety, achievement, and comfort. Additionally, we use a synthetic multi-objective environment generator to evaluate the computational costs associated with guaranteeing ethical learning in situations with an increasing numbers of values.
To address this value alignment problem, we adopt the Multi-Objective Reinforcement Learning framework and propose a novel algorithm that combines techniques from Multi-Objective Reinforcement Learning and Linear Programming. In addition to providing theoretical guarantees, we illustrate our value alignment process with an example involving an autonomous vehicle. Here, we demonstrate that the agent learns to behave in alignment with the ethical values of safety, achievement, and comfort. Additionally, we use a synthetic multi-objective environment generator to evaluate the computational costs associated with guaranteeing ethical learning in situations with an increasing numbers of values.
Original language | English |
---|---|
Number of pages | 9 |
Publication status | Published - May 2023 |
Event | 2023 Adaptive and Learning Agents Workshop at AAMAS - London, United Kingdom Duration: 29 May 2023 → 30 May 2023 https://alaworkshop2023.github.io |
Workshop
Workshop | 2023 Adaptive and Learning Agents Workshop at AAMAS |
---|---|
Abbreviated title | ALA 2023 |
Country/Territory | United Kingdom |
City | London |
Period | 29/05/23 → 30/05/23 |
Internet address |
Fingerprint
Dive into the research topics of 'Multi-objective reinforcement learning for guaranteeing alignment with multiple values'. Together they form a unique fingerprint.Projects
- 1 Active
-
FWOTM1108: Decision-making in team-reward multi-objective multi-agent domains
1/10/22 → 28/02/27
Project: Fundamental