AAAI 2021
Policy Optimization as Online Learning with Mediator Feedback
Alberto Maria Metelli, Matteo Papini, Pierluca D’Oro, Marcello Restelli; 35th AAAI conference on Artificial Intelligence, February 2-9 2021, virtual
Written on December 28, 2020