Multi-objective reinforcement learning with continuous pareto frontier approximation M Pirotta, S Parisi, M Restelli Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015 | 26 | 2015 |
Policy gradient approaches for multi-objective sequential decision making S Parisi, M Pirotta, N Smacchia, L Bascetta, M Restelli 2014 International Joint Conference on Neural Networks (IJCNN), 2323-2330, 2014 | 26 | 2014 |
Manifold-based multi-objective policy search with sample reuse S Parisi, M Pirotta, J Peters Neurocomputing, Special Issue on Multi-objective Reinforcement Learning 263 …, 2017 | 8 | 2017 |
Reinforcement learning vs human programming in tetherball robot games S Parisi, H Abdulsamad, A Paraschos, C Daniel, J Peters 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2015 | 8 | 2015 |
Multi-objective Reinforcement Learning through Continuous Pareto Manifold Approximation S Parisi, M Pirotta, M Restelli Journal of Artificial Intelligence Research 57, 187-227, 2016 | 7 | 2016 |
TD-regularized actor-critic methods S Parisi, V Tangkaratt, J Peters, ME Khan Machine Learning, 1-35, 2019 | 4 | 2019 |
Goal-Driven Dimensionality Reduction for Reinforcement Learning S Parisi, S Ramstedt, J Peters 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2017 | 4 | 2017 |
Policy search with high-dimensional context variables V Tangkaratt, H van Hoof, S Parisi, G Neumann, J Peters, M Sugiyama Thirty-First AAAI Conference on Artificial Intelligence, 2017 | 4 | 2017 |
Policy gradient approaches for multi-objective sequential decision making: A comparison S Parisi, M Pirotta, N Smacchia, L Bascetta, M Restelli 2014 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement …, 2014 | 3 | 2014 |
Local-utopia Policy Selection for Multi-objective Reinforcement Learning S Parisi, A Blank, T Viernickel, J Peters 2016 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement …, 2016 | 2 | 2016 |
Studio e analisi di algoritmi di apprendimento per rinforzo policy gradient per la risoluzione di problemi decisionali multiobiettivo S PARISI, N SMACCHIA Italy, 2014 | | 2014 |