Projecten per jaar
Samenvatting
For effective decision support in scenarios with conflicting objectives, sets of potentially optimal solutions can be presented to the decision maker. We explore both what policies these sets should contain and how such sets can be computed efficiently. With this in mind, we take a distributional approach and introduce a novel dominance criterion relating return distributions of policies directly. Based on this criterion, we present the distributional undominated set and show that it contains optimal policies otherwise ignored by the Pareto front. In addition, we propose the convex distributional undominated set and prove that it comprises all policies that maximise expected utility for multivariate risk-averse decision makers. We propose a novel algorithm to learn the distributional undominated set and further contribute pruning operators to reduce the set to the convex distributional undominated set. Through experiments, we demonstrate the feasibility and effectiveness of these methods, making this a valuable new approach for decision support in real-world problems.
Originele taal-2 | English |
---|---|
Titel | Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence |
Subtitel | Main Track |
Uitgeverij | International Joint Conferences on Artificial Intelligence |
Pagina's | 5711–5719 |
Aantal pagina's | 9 |
ISBN van elektronische versie | 978-1-956792-03-4 |
DOI's | |
Status | Published - 2023 |
Evenement | 32nd International Joint Conference on Artificial Intelligence - Sheraton Grand Macao, Macao, China Duur: 19 aug 2023 → 25 aug 2023 https://ijcai-23.org/ |
Conference
Conference | 32nd International Joint Conference on Artificial Intelligence |
---|---|
Verkorte titel | IJCAI |
Land/Regio | China |
Stad | Macao |
Periode | 19/08/23 → 25/08/23 |
Internet adres |
Bibliografische nota
Publisher Copyright:© 2023 International Joint Conferences on Artificial Intelligence. All rights reserved.
Vingerafdruk
Duik in de onderzoeksthema's van 'Distributional multi-objective decision making'. Samen vormen ze een unieke vingerafdruk.Projecten
- 1 Actief
-
FWOTM1082: Reinforcement Learning in Multi-Doel Multi-Agent Systemen
1/11/21 → 31/10/25
Project: Fundamenteel
Activiteiten
- 1 Research and Teaching at External Organisation
-
Research visit at the University of Galway in Ireland
Willem Röpke (Visitor)
14 nov 2022 → 9 dec 2022Activiteit: Research and Teaching at External Organisation