Simone Parisi
Simone Parisi
PhD Student, TU Darmstadt
Verified email at ias.tu-darmstadt.de - Homepage
TitleCited byYear
Multi-objective reinforcement learning with continuous pareto frontier approximation
M Pirotta, S Parisi, M Restelli
Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015
262015
Policy gradient approaches for multi-objective sequential decision making
S Parisi, M Pirotta, N Smacchia, L Bascetta, M Restelli
2014 International Joint Conference on Neural Networks (IJCNN), 2323-2330, 2014
262014
Manifold-based multi-objective policy search with sample reuse
S Parisi, M Pirotta, J Peters
Neurocomputing, Special Issue on Multi-objective Reinforcement Learning 263 …, 2017
82017
Reinforcement learning vs human programming in tetherball robot games
S Parisi, H Abdulsamad, A Paraschos, C Daniel, J Peters
2015 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2015
82015
Multi-objective Reinforcement Learning through Continuous Pareto Manifold Approximation
S Parisi, M Pirotta, M Restelli
Journal of Artificial Intelligence Research 57, 187-227, 2016
72016
TD-regularized actor-critic methods
S Parisi, V Tangkaratt, J Peters, ME Khan
Machine Learning, 1-35, 2019
42019
Goal-Driven Dimensionality Reduction for Reinforcement Learning
S Parisi, S Ramstedt, J Peters
2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2017
42017
Policy search with high-dimensional context variables
V Tangkaratt, H van Hoof, S Parisi, G Neumann, J Peters, M Sugiyama
Thirty-First AAAI Conference on Artificial Intelligence, 2017
42017
Policy gradient approaches for multi-objective sequential decision making: A comparison
S Parisi, M Pirotta, N Smacchia, L Bascetta, M Restelli
2014 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement …, 2014
32014
Local-utopia Policy Selection for Multi-objective Reinforcement Learning
S Parisi, A Blank, T Viernickel, J Peters
2016 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement …, 2016
22016
Studio e analisi di algoritmi di apprendimento per rinforzo policy gradient per la risoluzione di problemi decisionali multiobiettivo
S PARISI, N SMACCHIA
Italy, 2014
2014
The system can't perform the operation now. Try again later.
Articles 1–11