Programmatic Reinforcement Learning using Critic-Moderated Evolution

Research output: Unpublished contribution to conferencePoster

16 Downloads (Pure)

Abstract

We propose a new method to generate a program from a Reinforcement Learning policy. Compared to previous methods, we exploit more RL-specific elements such as the critic value-network. Improved actions from the critic are used to steer a Genetic Programming process via a fitness function.
Original languageEnglish
Publication statusUnpublished - 18 Nov 2024
EventBNAIC/BeNeLearn 2024: Joint International Scientific Conferences on AI and Machine Learning - Jaarbeurs Supernova, Utrecht, Netherlands
Duration: 18 Nov 202420 Nov 2024
Conference number: 36
https://bnaic2024.sites.uu.nl/
https://bnaic2024.sites.uu.nl

Conference

ConferenceBNAIC/BeNeLearn 2024: Joint International Scientific Conferences on AI and Machine Learning
Abbreviated titleBNAIC/BeNeLearn 2024
Country/TerritoryNetherlands
CityUtrecht
Period18/11/2420/11/24
Internet address

Keywords

  • Deep Reinforcement Learning
  • Genetic Programming
  • Explainable AI

Cite this