Projects per year
Abstract
According to the social intelligence hypothesis, cooperation is considered a key
component of intelligence and is required to solve a wide range of problems, from
everyday challenges like scheduling meetings to global challenges like mitigating
climate change and providing humanitarian aid. Extending the ability for artificial
intelligence (AI) to cooperate well is critical as AI becomes more prevalent in our
lives. In recent years, multi-agent reinforcement learning (MARL) has emerged
as a powerful approach to model and analyse the problem of cooperation among
artificial agents. In this paper, we investigate the impact of communication on
cooperation among reinforcement learning agents in social dilemmas. These are
settings in which the short-term individual interests are in conflict with the longterm collective ones, thus each individual profits from defecting, but the overall
group would benefit if everyone cooperates. We particularly focus on a temporally and spatially extended Stag-Hunt-like social dilemma that models animal
foraging behaviour using principles from Optimal Foraging Theory. We propose
a method for communication that combines a graph-based attention mechanism
with deep reinforcement learning methods. Additionally, we examine several
facets of communication, including the effects of the communication topology
and the communication range. We find that greater cooperative behaviour can
be achieved through graph-based communication using reinforcement learning in
social dilemmas. Additionally, we find that during foraging, local communication
promotes better cooperation than long-distance communication. Finally, we visualise and investigate the learned attention weights and explain how agents process
communications from other agents.
component of intelligence and is required to solve a wide range of problems, from
everyday challenges like scheduling meetings to global challenges like mitigating
climate change and providing humanitarian aid. Extending the ability for artificial
intelligence (AI) to cooperate well is critical as AI becomes more prevalent in our
lives. In recent years, multi-agent reinforcement learning (MARL) has emerged
as a powerful approach to model and analyse the problem of cooperation among
artificial agents. In this paper, we investigate the impact of communication on
cooperation among reinforcement learning agents in social dilemmas. These are
settings in which the short-term individual interests are in conflict with the longterm collective ones, thus each individual profits from defecting, but the overall
group would benefit if everyone cooperates. We particularly focus on a temporally and spatially extended Stag-Hunt-like social dilemma that models animal
foraging behaviour using principles from Optimal Foraging Theory. We propose
a method for communication that combines a graph-based attention mechanism
with deep reinforcement learning methods. Additionally, we examine several
facets of communication, including the effects of the communication topology
and the communication range. We find that greater cooperative behaviour can
be achieved through graph-based communication using reinforcement learning in
social dilemmas. Additionally, we find that during foraging, local communication
promotes better cooperation than long-distance communication. Finally, we visualise and investigate the learned attention weights and explain how agents process
communications from other agents.
Original language | English |
---|---|
Number of pages | 17 |
Publication status | Accepted/In press - Aug 2023 |
Event | The 16th European Workshop on Reinforcement Learning - Vrije Universiteit Brussel, Brussels, Belgium Duration: 14 Sep 2023 → 16 Sep 2023 https://ewrl.wordpress.com/ewrl16-2023/ |
Workshop
Workshop | The 16th European Workshop on Reinforcement Learning |
---|---|
Abbreviated title | EWRL 2023 |
Country/Territory | Belgium |
City | Brussels |
Period | 14/09/23 → 16/09/23 |
Internet address |
Fingerprint
Dive into the research topics of 'Cooperative Foraging Behaviour Through Multi-Agent Reinforcement Learning with Graph-Based Communication'. Together they form a unique fingerprint.Projects
- 1 Active
-
FWOTM1108: Decision-making in team-reward multi-objective multi-agent domains
1/10/22 → 28/02/27
Project: Fundamental