Matteo Papini
TitleCited byYear
Stochastic variance-reduced policy gradient
M Papini, D Binaghi, G Canonaco, M Pirotta, M Restelli
arXiv preprint arXiv:1806.05618, 2018
212018
Policy optimization via importance sampling
AM Metelli, M Papini, F Faccio, M Restelli
Advances in Neural Information Processing Systems, 5442-5454, 2018
142018
Adaptive batch size for safe policy gradients
M Papini, M Pirotta, M Restelli
Advances in Neural Information Processing Systems, 3591-3600, 2017
132017
Smoothing Policies and Safe Policy Gradients
M Papini, M Pirotta, M Restelli
arXiv preprint arXiv:1905.03231, 2019
32019
Optimistic Policy Optimization via Multiple Importance Sampling
M Papini, AM Metelli, L Lupo, M Restelli
International Conference on Machine Learning, 4989-4999, 2019
12019
Safely Exploring Policy Gradient
M Papini, A Battistello, M Restelli, A Battistello
12018
Gradient-Aware Model-based Policy Search
P D'Oro, AM Metelli, A Tirinzoni, M Papini, M Restelli
arXiv preprint arXiv:1909.04115, 2019
2019
Feature Selection via Mutual Information: New Theoretical Insights
M Beraha, AM Metelli, M Papini, A Tirinzoni, M Restelli
2019 International Joint Conference on Neural Networks (IJCNN), 1-9, 2019
2019
Adaptive batch size for safe policy gradient methods
M PAPINI
Italy, 2017
2017
The system can't perform the operation now. Try again later.
Articles 1–9