Mohammad Ghavamzadeh

Cited by

	All	Since 2019
Citations	14527	11116
h-index	58	46
i10-index	126	118

2800

1400

700

2100

2005200620072008200920102011201220132014201520162017201820192020202120222023202436 50 61 69 100 105 180 230 200 268 303 361 414 581 920 1203 1671 2093 2735 2470

Public access

View all

15 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Yinlam ChowResearch Scientist, Google ResearchVerified email at google.com
Alessandro LazaricResearch Scientist, Facebook Artificial Intelligence ResearchVerified email at inria.fr
Shie MannorProfessor of Electrical Engineering @ Technion & Researcher @ NvidiaVerified email at technion.ac.il
Branislav KvetonAdobe ResearchVerified email at adobe.com
Sridhar MahadevanDirector, Data Science Lab, Adobe Research & Professor, University of Massachusetts, AmherstVerified email at cs.umass.edu
Rémi MunosGoogle DeepMindVerified email at inria.fr
Csaba SzepesvariDeepMind & University of AlbertaVerified email at cs.ualberta.ca
Georgios TheocharousAdobe ResearchVerified email at adobe.com
Craig BoutilierPrincipal Scientist, GoogleVerified email at google.com
Marek PetrikUniversity of New HampshireVerified email at cs.unh.edu
Amir-massoud FarahmandUniversity of TorontoVerified email at cs.toronto.edu
Ofir NachumOpenAIVerified email at openai.com
Philip ThomasUniversity of Massachusetts AmherstVerified email at cs.umass.edu
Hung BuiResearch Scientist, Google DeepMindVerified email at google.com
Zheng WenGoogle DeepMindVerified email at google.com
Shalabh BhatnagarProfessor in the Department of Computer Science and Automation, Indian Institute of ScienceVerified email at iisc.ac.in
Richard S. SuttonKeen, Amii, and University of AlbertaVerified email at richsutton.com
Aviv TamarTechnionVerified email at technion.ac.il
Yonathan EfroniMeta, New YorkVerified email at fb.com
Manzil ZaheerGoogle ResearchVerified email at cmu.edu

Mohammad Ghavamzadeh

Amazon

Verified email at amazon.com - Homepage

Reinforcement Learning Online Learning Machine Learning Control AI


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
A Review of Uncertainty Quantification in Deep Learning: Techniques, Applications and Challenges M Abdar, F Pourpanah, S Hussain, D Rezazadegan, L Liu, ... Information Fusion, 2021	2137	2021
Natural Actor–critic Algorithms S Bhatnagar, RS Sutton, M Ghavamzadeh, M Lee Automatica 45 (11), 2471-2482, 2009	1101*	2009
A Lyapunov-based Approach to Safe Reinforcement Learning Y Chow, O Nachum, E Duenez-Guzman, M Ghavamzadeh Neural Information Processing Systems, 8103-8112, 2018	611	2018
Risk-constrained Reinforcement Learning with Percentile Risk Criteria Y Chow, M Ghavamzadeh, L Janson, M Pavone Journal of Machine Learning Research (JMLR) 18, 6070-6120, 2017	590	2017
Bayesian Reinforcement Learning: A Survey M Ghavamzadeh, S Mannor, J Pineau, A Tamar Foundations and Trends in Machine Learning 8 (5-6), 359-483, 2015	562	2015
Algorithms for CVaR Optimization in MDPs Y Chow, M Ghavamzadeh Advances in Neural Information Processing Systems, 3509-3517, 2014	386	2014
Best Arm Identification: A Unified Approach to Fixed Budget and Fixed Confidence V Gabillon, M Ghavamzadeh, A Lazaric Neural Information Processing Systems, 3221-3229, 2012	350	2012
High-confidence Off-policy Evaluation P Thomas, G Theocharous, M Ghavamzadeh AAAI, 3000-3006, 2015	329	2015
Actor-Critic Algorithms for Risk-sensitive MDPs LA Prashanth, M Ghavamzadeh Neural Information Processing Systems, 252-260, 2013	318*	2013
Safe Policy Learning for Continuous Control Y Chow, O Nachum, A Faust, E Duenez-Guzman, M Ghavamzadeh Conference on Robot Learning (CoRL), 2020	294*	2020
More Robust Doubly Robust Off-policy Evaluation M Farajtabar, Y Chow, M Ghavamzadeh ICML, 1447-1456, 2018	283	2018
High Confidence Policy Improvement P Thomas, G Theocharous, M Ghavamzadeh ICML, 2380-2388, 2015	224	2015
Speedy Q-learning M Ghavamzadeh, H Kappen, M Azar, R Munos Neural Information Processing Systems 24, 2411-2419, 2011	220*	2011
Personalized Ad Recommendation Systems for Life-time Value Optimization with Guarantees G Theocharous, PS Thomas, M Ghavamzadeh IJCAI, 1806-1812, 2015	207*	2015
Benchmarking Batch Deep Reinforcement Learning Algorithms S Fujimoto, E Conti, M Ghavamzadeh, J Pineau arXiv preprint arXiv:1910.01708, 2019	202	2019
Supervised actor-critic reinforcement learning MT Rosenstein, AG Barto, J Si, A Barto, W Powell, D Wunsch Learning and approximate dynamic programming: scaling up to the real world …, 2004	199	2004
Hierarchical Multi-agent Reinforcement Learning R Makar, S Mahadevan, M Ghavamzadeh International Conference on Autonomous Agents, 246-253, 2001	196	2001
Aligning text-to-image models using human feedback K Lee, H Liu, M Ryu, O Watkins, Y Du, C Boutilier, P Abbeel, ... arXiv preprint arXiv:2302.12192, 2023	182	2023
Finite-Sample Analysis of Proximal Gradient TD Algorithms B Liu, J Liu, M Ghavamzadeh, S Mahadevan, M Petrik UAI, 504-513, 2015	178*	2015
Hierarchical Multi-agent Reinforcement Learning M Ghavamzadeh, S Mahadevan, R Makar Journal of Autonomous Agents and Multi-Agent Systems (JAAMAS) 13 (2), 197-229, 2006	177	2006

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors