Victor Gabillon

Cited by

	All	Since 2019
Citations	1098	724
h-index	14	12
i10-index	15	13

180

135

201220132014201520162017201820192020202120222023202418 30 50 54 75 72 67 103 128 164 137 139 51

Public access

View all

7 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Mohammad GhavamzadehAmazonVerified email at amazon.com
Alessandro LazaricResearch Scientist, Facebook Artificial Intelligence ResearchVerified email at inria.fr
Peter BartlettProfessor, EECS and Statistics, UC BerkeleyVerified email at cs.berkeley.edu
Michal ValkoLlama @ Meta Paris & Inria & MVA - Ex: Gemini and BYOL @ Google DeepMindVerified email at meta.com
Matthieu GeistCohere (ex Google, on leave of Professor, Université de Lorraine)Verified email at univ-lorraine.fr
Brian ErikssonAdobeVerified email at adobe.com
Branislav KvetonAmazonVerified email at amazon.com
Yasin Abbasi YadkoriDeepMindVerified email at google.com
S MuthukrishnanRutgers UnivVerified email at cs.rutgers.edu
Zheng WenGoogle DeepMindVerified email at google.com
Alan MalekMITVerified email at mit.edu
Sebastien BubeckVP GenAI Research, Microsoft AIVerified email at microsoft.com
Fabio Maria CarlucciMetaVerified email at meta.com
Antoine YangGoogle DeepMindVerified email at google.com
Pedro M EsperançaMachine Learning Engineer (London, UK)Verified email at huawei.com
Hang XuResearcher, Huawei Noah's Ark LabVerified email at huawei.com
Ronald OrtnerMontanuniversität LeobenVerified email at unileoben.ac.at
Jun WangProfessor, Computer Science, University College LondonVerified email at cs.ucl.ac.uk
Haitham Bou-AmmarRL-Team Leader, BO-Team Leader, MAS-Team Leader @ Huawei London & H. Assistant Professor @ UCLVerified email at huawei.com
Rasul TutunovMassachusetts Institute of TechnologyVerified email at mit.edu

Victor Gabillon

Unknown affiliation

No verified email - Homepage

machine learning learning theory reinforcement learning online learning multi-armed bandits


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Best arm identification: A unified approach to fixed budget and fixed confidence V Gabillon, M Ghavamzadeh, A Lazaric NIPS, Neural Information Processing Systems, 2012	339	2012
Approximate modified policy iteration and its application to the game of Tetris. B Scherrer, M Ghavamzadeh, V Gabillon, B Lesner, M Geist JMLR, Journal of Machine Learning Research 16, 2015	149	2015
Multi-bandit best arm identification V Gabillon, M Ghavamzadeh, A Lazaric, S Bubeck NIPS, Neural Information Processing Systems, 2011	122	2011
Approximate dynamic programming finally performs well in the game of Tetris V Gabillon, M Ghavamzadeh, B Scherrer NIPS, Neural Information Processing systems, 2013	76	2013
Adaptive submodular maximization in bandit setting V Gabillon, B Kveton, Z Wen, B Eriksson, S Muthukrishnan NIPS, Neural Information Processing Systems, 2013	64	2013
Approximate modified policy iteration B Scherrer, V Gabillon, M Ghavamzadeh, M Geist ICML, International Conference on Machine Learning, 2012	58	2012
Best of both worlds: Stochastic & adversarial best-arm identification Y Abbasi-Yadkori, P Bartlett, V Gabillon, A Malek, M Valko Conference on learning theory, 918-949, 2018	47	2018
Improved learning complexity in combinatorial pure exploration bandits V Gabillon, A Lazaric, M Ghavamzadeh, R Ortner, P Bartlett AISTATS, Artificial Intelligence and Statistics, 2016	45	2016
A simple parameter-free and adaptive approach to optimization under a minimal local smoothness assumption PL Bartlett, V Gabillon, M Valko ALT, Algorithmic Learning Theory, 2019	41	2019
Classification-based policy iteration with a critic V Gabillon, A Lazaric, M Ghavamzadeh, B Scherrer ICML, International Conference on Machine Learning, 2011	30	2011
MANAS: Multi-agent neural architecture search V Lopes, FM Carlucci, PM Esperança, M Singh, V Gabillon, A Yang, H Xu, ... arXiv preprint arXiv:1909.01051, 2019	25*	2019
Hit-and-Run for Sampling and Planning in Non-Convex Spaces Y Abbasi-Yadkori, PL Bartlett, V Gabillon, A Malek AISTATS, Artificial Intelligence and Statistics, 2017	23	2017
Large-Scale Optimistic Adaptive Submodularity. V Gabillon, B Kveton, Z Wen, B Eriksson, S Muthukrishnan AAAI, Association for the Advancement of Artificial Intelligence, 2014	17	2014
Near Minimax Optimal Players for the Finite-Time 3-Expert Prediction Problem Y Abbasi-Yadkori, PL Bartlett, V Gabillon NIPS, Neural Information Processing Systems, 2017	14	2017
Rollout allocation strategies for classification-based policy iteration V Gabillon, A Lazaric, M Ghavamzadeh Workshop on Reinforcement Learning and Search in Very Large Spaces, 2010	14	2010
Derivative-Free & Order-Robust Optimisation V Gabillon, R Tutunov, M Valko, HB Ammar AISTATS, Artificial Intelligence and Statistics, 2020	8*	2020
Scale-free adaptive planning for deterministic dynamics & discounted rewards P Bartlett, V Gabillon, J Healey, M Valko ICML, International Conference on Machine Learning, 495-504, 2019	7	2019
Adaptive multi-fidelity optimization with fast learning rates C Fiegel, V Gabillon, M Valko International Conference on Artificial Intelligence and Statistics, 3493-3502, 2020	5	2020
Multi-media content-recommender system that learns how to elicit user preferences VF Gabillon, B Kveton, B Eriksson US Patent App. 14/489,703, 2016	5	2016
Machine learning tools for online advertisement V Gabillon Technical report, INRIA Lille, France, 2009	5	2009

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors