ICML 2019

OPTIMIST: Optimistic Policy Optimization via Multiple Importance Sampling

Read More

EWRL 2018

Safely Exploring Policy Gradient

Matteo Papini, Andrea Battistello, Marcello Restelli; 14th European Workshop on Reinforcement Learning, 2018

Read More

ICML 2018

SVRPG: Stochastic Variance-Reduced Policy Gradient

Matteo Papini, Damiano Binaghi, Giuseppe Canonaco, Matteo Pirotta, Marcello Restelli; 35th International Conference on Machine Learning, Stockholm, 2018 [bibtex]

Read More

NIPS 2017

Adaptive Batch Size for Safe Policy Gradients

Matteo Papini, Matteo Pirotta, Marcello Restelli, Advances in Neural Information Processing Systems, Long Beach, 2017 [bibtex]

Read More