Victor Gabillon
TitleCited byYear
Best arm identification: A unified approach to fixed budget and fixed confidence
V Gabillon, M Ghavamzadeh, A Lazaric
NIPS, Neural Information Processing Systems, 2012
1422012
Multi-bandit best arm identification
V Gabillon, M Ghavamzadeh, A Lazaric, S Bubeck
NIPS, Neural Information Processing Systems, 2011
732011
Approximate dynamic programming finally performs well in the game of Tetris
V Gabillon, M Ghavamzadeh, B Scherrer
NIPS, Neural Information Processing systems, 2013
542013
Approximate modified policy iteration and its application to the game of Tetris.
B Scherrer, M Ghavamzadeh, V Gabillon, B Lesner, M Geist
JMLR, Journal of Machine Learning Research 16, 2015
382015
Approximate modified policy iteration
B Scherrer, V Gabillon, M Ghavamzadeh, M Geist
ICML, International Conference on Machine Learning, 2012
352012
Adaptive submodular maximization in bandit setting
V Gabillon, B Kveton, Z Wen, B Eriksson, S Muthukrishnan
NIPS, Neural Information Processing Systems, 2013
332013
Classification-based policy iteration with a critic
V Gabillon, A Lazaric, M Ghavamzadeh, B Scherrer
ICML, International Conference on Machine Learning, 2011
272011
Improved learning complexity in combinatorial pure exploration bandits
V Gabillon, A Lazaric, M Ghavamzadeh, R Ortner, P Bartlett
AISTATS, Artificial Intelligence and Statistics, 2016
222016
Large-Scale Optimistic Adaptive Submodularity.
V Gabillon, B Kveton, Z Wen, B Eriksson, S Muthukrishnan
AAAI, Association for the Advancement of Artificial Intelligence, 2014
102014
Rollout allocation strategies for classification-based policy iteration
V Gabillon, A Lazaric, M Ghavamzadeh
Workshop on Reinforcement Learning and Search in Very Large Spaces, 2010
102010
Best of both worlds: Stochastic & adversarial best-arm identification
Y Abbasi-Yadkori, P Bartlett, V Gabillon, A Malek, M Valko
COLT, Conference on Learning Theory, 2018
72018
Hit-and-Run for Sampling and Planning in Non-Convex Spaces
Y Abbasi-Yadkori, PL Bartlett, V Gabillon, A Malek
AISTATS, Artificial Intelligence and Statistics, 2017
62017
A simple parameter-free and adaptive approach to optimization under a minimal local smoothness assumption
PL Bartlett, V Gabillon, M Valko
ALT, Algorithmic Learning Theory, 2019
42019
Near Minimax Optimal Players for the Finite-Time 3-Expert Prediction Problem
Y Abbasi-Yadkori, PL Bartlett, V Gabillon
NIPS, Neural Information Processing Systems, 2017
32017
Asymptotic performance analysis of PCA algorithms based on the weighted subspace criterion
JP Delmas, V Gabillon
ICASSP, Acoustics, Speech and Signal Processing, 2009
22009
Machine learning tools for online advertisement
V Gabillon
Technical report, INRIA, Lille, France, 2009
22009
Scale-free adaptive planning for deterministic dynamics & discounted rewards
P Bartlett, V Gabillon, J Healey, M Valko
ICML, International Conference on Machine Learning, 495-504, 2019
12019
Approximations de l'Algorithme Itérations sur les Politiques Modifié
B Scherrer, V Gabillon, M Ghavamzadeh, M Geist
JFPDA, Journées Francophones sur la planification, la décision et l …, 2012
12012
Derivative-Free & Order-Robust Optimisation
V Gabillon, R Tutunov, M Valko, HB Ammar
arXiv preprint arXiv:1910.04034, 2019
2019
MANAS: Multi-Agent Neural Architecture Search
FM Carlucci, P Esperanca, R Tutunov, M Singh, V Gabillon, A Yang, H Xu, ...
arXiv preprint arXiv:1909.01051, 2019
2019
The system can't perform the operation now. Try again later.
Articles 1–20