AAAI 2021

Policy Optimization as Online Learning with Mediator Feedback

Alberto Maria Metelli, Matteo Papini, Pierluca D’Oro, Marcello Restelli; 35th AAAI conference on Artificial Intelligence, February 2-9 2021, virtual



image-title-here

Written on December 28, 2020