Seguir
Shie Mannor
Shie Mannor
Professor of Electrical Engineering @ Technion & Researcher @ Nvidia
Dirección de correo verificada de technion.ac.il - Página principal
Título
Citado por
Citado por
Año
A Tutorial on the Cross-Entropy Method
B DE, P KROESE, S MANNOR
Annals of Operations Research 134 (1), 19-67, 2005
3524*2005
The kernel recursive least-squares algorithm
Y Engel, S Mannor, R Meir
IEEE Transactions on signal processing 52 (8), 2275-2285, 2004
12692004
Action elimination and stopping conditions for the multi-armed bandit and reinforcement learning problems.
E Even-Dar, S Mannor, Y Mansour, S Mahadevan
Journal of machine learning research 7 (6), 2006
7922006
Robustness and Regularization of Support Vector Machines.
H Xu, C Caramanis, S Mannor
Journal of machine learning research 10 (7), 2009
6052009
Reward constrained policy optimization
C Tessler, DJ Mankowitz, S Mannor
arXiv preprint arXiv:1805.11074, 2018
6032018
Bayesian reinforcement learning: A survey
M Ghavamzadeh, S Mannor, J Pineau, A Tamar
Foundations and Trends® in Machine Learning 8 (5-6), 359-483, 2015
5802015
PAC bounds for multi-armed bandit and Markov decision processes
E Even-Dar, S Mannor, Y Mansour
Computational Learning Theory: 15th Annual Conference on Computational …, 2002
5482002
Robustness and generalization
H Xu, S Mannor
Machine learning 86, 391-423, 2012
5242012
Reinforcement learning with Gaussian processes
Y Engel, S Mannor, R Meir
ICML, 201-208, 2005
5072005
The sample complexity of exploration in the multi-armed bandit problem
S Mannor, JN Tsitsiklis
Journal of Machine Learning Research 5 (Jun), 623-648, 2004
4942004
A deep hierarchical approach to lifelong learning in minecraft
C Tessler, S Givony, T Zahavy, D Mankowitz, S Mannor
Proceedings of the AAAI conference on artificial intelligence 31 (1), 2017
4682017
Risk-sensitive and robust decision-making: a cvar optimization approach
Y Chow, A Tamar, S Mannor, M Pavone
Advances in neural information processing systems 28, 2015
3922015
The cross entropy method for classification
S Mannor, D Peleg, R Rubinstein
Proceedings of the 22nd international conference on Machine learning, 561-568, 2005
3892005
Robust regression and lasso
H Xu, C Caramanis, S Mannor
Advances in neural information processing systems 21, 2008
3852008
Q-cut—dynamic discovery of sub-goals in reinforcement learning
I Menache, S Mannor, N Shimkin
Machine Learning: ECML 2002: 13th European Conference on Machine Learning …, 2002
3722002
Policy gradients with variance related risk criteria
A Tamar, D Di Castro, S Mannor
Proceedings of the twenty-ninth international conference on machine learning …, 2012
3552012
Graying the black box: Understanding dqns
T Zahavy, N Ben-Zrihem, S Mannor
International conference on machine learning, 1899-1908, 2016
3472016
Percentile optimization for Markov decision processes with parameter uncertainty
E Delage, S Mannor
Operations research 58 (1), 203-213, 2010
346*2010
Dynamic abstraction in reinforcement learning via clustering
S Mannor, I Menache, A Hoze, U Klein
Proceedings of the twenty-first international conference on Machine learning, 71, 2004
3372004
Bayes meets Bellman: The Gaussian process approach to temporal difference learning
Y Engel, S Mannor, R Meir
Proceedings of the 20th International Conference on Machine Learning (ICML …, 2003
3122003
El sistema no puede realizar la operación en estos momentos. Inténtalo de nuevo más tarde.
Artículos 1–20