Shimon Whiteson
Shimon Whiteson
Professor of Computer Science, University of Oxford
Dirección de correo verificada de cs.ox.ac.uk - Página principal
Título
Citado por
Citado por
Año
Learning to communicate with deep multi-agent reinforcement learning
J Foerster, IA Assael, N De Freitas, S Whiteson
Advances in neural information processing systems, 2137-2145, 2016
5322016
Counterfactual multi-agent policy gradients
JN Foerster, G Farquhar, T Afouras, N Nardelli, S Whiteson
Thirty-second AAAI conference on artificial intelligence, 2018
4092018
A survey of multi-objective sequential decision-making
DM Roijers, P Vamplew, S Whiteson, R Dazeley
Journal of Artificial Intelligence Research 48, 67-113, 2014
3042014
Evolutionary Function Approximation for Reinforcement Learning
S Whiteson, P Stone
Journal of Machine Learning Research 7, 877-917, 2006
3042006
Stabilising experience replay for deep multi-agent reinforcement learning
J Foerster, N Nardelli, G Farquhar, T Afouras, PHS Torr, P Kohli, ...
arXiv preprint arXiv:1702.08887, 2017
2402017
QMIX: Monotonic value function factorisation for deep multi-agent reinforcement learning
T Rashid, M Samvelyan, CS De Witt, G Farquhar, J Foerster, S Whiteson
arXiv preprint arXiv:1803.11485, 2018
1842018
Learning with opponent-learning awareness
JN Foerster, RY Chen, M Al-Shedivat, S Whiteson, P Abbeel, I Mordatch
arXiv preprint arXiv:1709.04326, 2017
1822017
Multiagent reinforcement learning for urban traffic control using coordination graphs
L Kuyer, S Whiteson, B Bakker, N Vlassis
Joint European Conference on Machine Learning and Knowledge Discovery in …, 2008
1572008
Transfer via inter-task mappings in policy search reinforcement learning
ME Taylor, S Whiteson, P Stone
Proceedings of the 6th international joint conference on Autonomous agents …, 2007
1232007
Automatic feature selection in neuroevolution
S Whiteson, P Stone, KO Stanley, R Miikkulainen, N Kohl
Proceedings of the 7th annual conference on Genetic and evolutionary …, 2005
1222005
Balancing exploration and exploitation in listwise and pairwise online learning to rank for information retrieval
K Hofmann, S Whiteson, M de Rijke
Information Retrieval 16 (1), 63-90, 2013
1192013
Lipnet: End-to-end sentence-level lipreading
YM Assael, B Shillingford, S Whiteson, N De Freitas
arXiv preprint arXiv:1611.01599, 2016
1182016
Exploiting locality of interaction in factored Dec-POMDPs
FA Oliehoek, MTJ Spaan, N Vlassis, S Whiteson
Int. Joint Conf. on Autonomous Agents and Multi-Agent Systems, 517-524, 2008
1182008
Comparing evolutionary and temporal difference methods in a reinforcement learning domain
ME Taylor, S Whiteson, P Stone
Proceedings of the 8th annual conference on Genetic and evolutionary …, 2006
1172006
A theoretical and empirical analysis of Expected Sarsa
H Van Seijen, H Van Hasselt, S Whiteson, M Wiering
2009 ieee symposium on adaptive dynamic programming and reinforcement …, 2009
1152009
Evolving soccer keepaway players through task decomposition
S Whiteson, N Kohl, R Miikkulainen, P Stone
Machine Learning 59 (1-2), 5-30, 2005
1152005
A probabilistic method for inferring preferences from clicks
K Hofmann, S Whiteson, M De Rijke
Proceedings of the 20th ACM international conference on Information and …, 2011
1082011
Adaptive tile coding for value function approximation
S Whiteson
1022007
Lipnet: Sentence-level lipreading
YM Assael, B Shillingford, S Whiteson, N de Freitas
arXiv preprint arXiv:1611.01599 2 (8), 2016
962016
Reusing historical interaction data for faster online learning to rank for IR
K Hofmann, A Schuth, S Whiteson, M De Rijke
Proceedings of the sixth ACM international conference on Web search and data …, 2013
882013
El sistema no puede realizar la operación en estos momentos. Inténtalo de nuevo más tarde.
Artículos 1–20