Shimon Whiteson

Citado por

	Total	Desde 2019
Citas	22725	18287
Índice h	66	54
Índice i10	177	152

4800

2400

1200

3600

200720082009201020112012201320142015201620172018201920202021202220232024100 115 156 167 198 194 279 345 442 476 627 1009 1522 2312 3016 4023 4711 2694

Acceso público

Ver todo

123 artículos

0 artículos

disponibles

no disponibles

Basado en requisitos de financiación

Coautores

Jakob FoersterAssociate Professor, University of OxfordDirección de correo verificada de eng.ox.ac.uk
Gregory FarquharDeepMindDirección de correo verificada de google.com
Christian Schroeder de WittUniversity of OxfordDirección de correo verificada de robots.ox.ac.uk
Katja HofmannMicrosoft ResearchDirección de correo verificada de microsoft.com
Maximilian IglWaymo ResearchDirección de correo verificada de eng.ox.ac.uk
Wendelin BöhmerSequential Decision Making Group, Delft University of TechnologyDirección de correo verificada de tudelft.nl
Diederik M. RoijersSenior Researcher @ Vrije Universiteit Brussel & Academic Liaison AI Research @ City of AmsterdamDirección de correo verificada de ai.vub.ac.be
Frans A. OliehoekAssociate Professor at Delft University of TechnologyDirección de correo verificada de tudelft.nl
Peter StoneProfessor of Computer Science, The University of Texas at AustinDirección de correo verificada de cs.utexas.edu
Philip TorrProfessor, University of OxfordDirección de correo verificada de eng.ox.ac.uk
Kyriacos ShiarlisWaymoDirección de correo verificada de google.com
Tabish RashidMicrosoft ResearchDirección de correo verificada de microsoft.com
Luisa ZintgrafDeepMindDirección de correo verificada de deepmind.com
Nantas NardelliStealthDirección de correo verificada de arbitrarygravitas.com
Vitaly KurinResearch Scientist at Isomorphic LabsDirección de correo verificada de isomorphiclabs.com
Bei PengLecturer (Assistant Professor), University of LiverpoolDirección de correo verificada de liverpool.ac.uk
Yannis AssaelStaff Research Scientist, Google DeepMindDirección de correo verificada de google.com
Anuj MahajanAppleDirección de correo verificada de cs.ox.ac.uk
Tim RocktäschelProfessor of Artificial Intelligence at UCL, Open-Endedness Team Lead at Google DeepMindDirección de correo verificada de cs.ucl.ac.uk
Nando de FreitasCIFAR & DeepMindDirección de correo verificada de google.com

Seguir

Shimon Whiteson

Professor of Computer Science, University of Oxford / Senior Staff Research Scientist, Waymo

Dirección de correo verificada de cs.ox.ac.uk - Página principal

Artificial Intelligence Machine Learning Reinforcement Learning Decision-Theoretic Planning


Título Ordenar por citas Ordenar por año Ordenar por título	Citado por Citado por	Año
Monotonic value function factorisation for deep multi-agent reinforcement learning T Rashid, M Samvelyan, CS De Witt, G Farquhar, J Foerster, S Whiteson Journal of Machine Learning Research 21 (178), 1-51, 2020	2334	2020
Counterfactual multi-agent policy gradients J Foerster, G Farquhar, T Afouras, N Nardelli, S Whiteson Proceedings of the AAAI conference on artificial intelligence 32 (1), 2018	2169	2018
Learning to communicate with deep multi-agent reinforcement learning J Foerster, IA Assael, N De Freitas, S Whiteson Advances in neural information processing systems 29, 2016	1999	2016
The starcraft multi-agent challenge M Samvelyan, T Rashid, CS De Witt, G Farquhar, N Nardelli, TGJ Rudner, ... arXiv preprint arXiv:1902.04043, 2019	975	2019
A survey of multi-objective sequential decision-making DM Roijers, P Vamplew, S Whiteson, R Dazeley Journal of Artificial Intelligence Research 48, 67-113, 2014	739	2014
Stabilising experience replay for deep multi-agent reinforcement learning J Foerster, N Nardelli, G Farquhar, T Afouras, PHS Torr, P Kohli, ... International conference on machine learning, 1146-1155, 2017	734	2017
Learning with opponent-learning awareness JN Foerster, RY Chen, M Al-Shedivat, S Whiteson, P Abbeel, I Mordatch arXiv preprint arXiv:1709.04326, 2017	591	2017
Lipnet: End-to-end sentence-level lipreading YM Assael, B Shillingford, S Whiteson, N De Freitas arXiv preprint arXiv:1611.01599, 2016	453	2016
Fast context adaptation via meta-learning L Zintgraf, K Shiarli, V Kurin, K Hofmann, S Whiteson International Conference on Machine Learning, 7693-7702, 2019	410	2019
Maven: Multi-agent variational exploration A Mahajan, T Rashid, M Samvelyan, S Whiteson Advances in neural information processing systems 32, 2019	389	2019
Evolutionary Function Approximation for Reinforcement Learning S Whiteson, P Stone Journal of Machine Learning Research 7, 877-917, 2006	360	2006
Weighted qmix: Expanding monotonic value function factorisation for deep multi-agent reinforcement learning T Rashid, G Farquhar, B Peng, S Whiteson Advances in neural information processing systems 33, 10199-10210, 2020	339	2020
Multiagent reinforcement learning for urban traffic control using coordination graphs L Kuyer, S Whiteson, B Bakker, N Vlassis Machine Learning and Knowledge Discovery in Databases: European Conference …, 2008	311	2008
Deep variational reinforcement learning for POMDPs M Igl, L Zintgraf, TA Le, F Wood, S Whiteson International conference on machine learning, 2117-2126, 2018	301	2018
A survey of reinforcement learning informed by natural language J Luketina, N Nardelli, G Farquhar, J Foerster, J Andreas, E Grefenstette, ... arXiv preprint arXiv:1906.03926, 2019	293	2019
Is independent learning all you need in the starcraft multi-agent challenge? CS De Witt, T Gupta, D Makoviichuk, V Makoviychuk, PHS Torr, M Sun, ... arXiv preprint arXiv:2011.09533, 2020	289	2020
A theoretical and empirical analysis of expected sarsa H Van Seijen, H Van Hasselt, S Whiteson, M Wiering 2009 ieee symposium on adaptive dynamic programming and reinforcement …, 2009	281	2009
Varibad: A very good method for bayes-adaptive deep rl via meta-learning L Zintgraf, K Shiarlis, M Igl, S Schulze, Y Gal, K Hofmann, S Whiteson arXiv preprint arXiv:1910.08348, 2019	262	2019
Rode: Learning roles to decompose multi-agent tasks T Wang, T Gupta, A Mahajan, B Peng, S Whiteson, C Zhang arXiv preprint arXiv:2010.01523, 2020	196	2020
Facmac: Factored multi-agent centralised policy gradients B Peng, T Rashid, C Schroeder de Witt, PA Kamienny, P Torr, W Böhmer, ... Advances in Neural Information Processing Systems 34, 12208-12221, 2021	194	2021

El sistema no puede realizar la operación en estos momentos. Inténtalo de nuevo más tarde.

Artículos 1–20

Citas por año

Citas duplicadas

Citas combinadas

Añadir coautoresCoautores

Seguir

Citado por

Coautores