AAAI 2021

Policy Optimization as Online Learning with Mediator Feedback

Alberto Maria Metelli, Matteo Papini, Pierluca D’Oro, Marcello Restelli; 35th AAAI conference on Artificial Intelligence, February 2-9 2021, virtual

  • Preprint
  • Code
  • Poster sessions: 7-Feb 08:45AM-10:30AM PST, 8-Feb 12:45AM-02:30AM PST
  • Poster pdf



image-title-here

Written on December 28, 2020