Balancing Learning Speed and Stability in Policy Gradient via Adaptive Exploration

Matteo Papini, Andrea Battistello, Marcello Restelli; 23rd International Conference on Artificial Intelligence and Statistics, August 26-28, online

  • Paper
  • Talk slides
  • Live sessions: 26 Aug at 4 pm (UCT+2) and 28 Aug at 12 noon (UTC+2)
  • Code


The conference won’t be in Palermo, Sicily as originally planned. You are still encouraged to eat cannoli (see picture above).

Written on June 15, 2020