Alborz Geramifard

Citado por

	Total	Desde 2019
Citas	1930	1091
Índice h	22	16
Índice i10	36	25

260

130

195

20072008200920102011201220132014201520162017201820192020202120222023202412 12 18 14 46 89 85 100 108 92 104 138 138 157 229 240 252 73

Coautores

Jonathan P. HowRichard C. Maclaurin Professor of Aerospace Engineering, Massachusetts Institute of TechnologyDirección de correo verificada de mit.edu
Nicholas RoyMITDirección de correo verificada de csail.mit.edu
Satwik KotturResearch Scientist, Facebook AIDirección de correo verificada de fb.com
Seungwhan MoonFacebook, Carnegie Mellon UniversityDirección de correo verificada de fb.com
Ahmad BeiramiGoogle ResearchDirección de correo verificada de google.com
Michael BowlingUniversity of AlbertaDirección de correo verificada de ualberta.ca
Paul A CrookResearch Scientist, Meta Platforms, Inc.Dirección de correo verificada de fb.com
Nazim Kemal UreIstanbul Technical UniversityDirección de correo verificada de itu.edu.tr
Richard S. SuttonKeen, Amii, and University of AlbertaDirección de correo verificada de richsutton.com
Rajen SubbaGoogleDirección de correo verificada de google.com
Girish ChowdharyAssociate ProfessorDirección de correo verificada de illinois.edu
Chinnadhurai SankarResearch Lead, SliceX AI | ex-Meta AIDirección de correo verificada de fb.com
Ankita DeFacebookDirección de correo verificada de fb.com
Thomas J. WalshSony AIDirección de correo verificada de sony.com
Csaba SzepesvariDeepMind & University of AlbertaDirección de correo verificada de cs.ualberta.ca
Babak DamavandiMeta Reality LabsDirección de correo verificada de fb.com
David WhitneyMetaDirección de correo verificada de meta.com
Christoph DannResearch Scientist, GoogleDirección de correo verificada de google.com
Stefanie TellexBrown UniversityDirección de correo verificada de cs.brown.edu
Will DabneyDeepMindDirección de correo verificada de google.com

Seguir

Alborz Geramifard

Research Scientist Director at Meta

Dirección de correo verificada de meta.com - Página principal

Reinforcement Learning Conversational AI Planning Brain and Cognitive Sciences


Título Ordenar por citas Ordenar por año Ordenar por título	Citado por Citado por	Año
Dyna-style planning with linear function approximation and prioritized sweeping RS Sutton, C Szepesvári, A Geramifard, MP Bowling arXiv preprint arXiv:1206.3285, 2012	228	2012
A tutorial on linear function approximators for dynamic programming and reinforcement learning A Geramifard, TJ Walsh, S Tellex, G Chowdhary, N Roy, JP How Foundations and Trends® in Machine Learning 6 (4), 375-451, 2013	162	2013
Decentralized control of partially observable Markov decision processes C Amato, G Chowdhary, A Geramifard, NK Üre, MJ Kochenderfer 52nd IEEE Conference on Decision and Control, 2398-2405, 2013	148	2013
Cooperative mission planning for multi-UAV teams SS Ponda, LB Johnson, A Geramifard, JP How Handbook of unmanned aerial vehicles 2, 1447-1490, 2015	100	2015
Incremental least-squares temporal difference learning A Geramifard, M Bowling, RS Sutton Proceedings of the 21st national conference on Artificial intelligence …, 2006	91	2006
RLPy: a value-function-based reinforcement learning framework for education and research. A Geramifard, C Dann, RH Klein, W Dabney, JP How J. Mach. Learn. Res. 16 (1), 1573-1578, 2015	86	2015
Online Discovery of Feature Dependencies. A Geramifard, F Doshi, J Redding, N Roy, JP How ICML, 881-888, 2011	81	2011
SIMMC 2.0: A task-oriented dialog dataset for immersive multimodal conversations S Kottur, S Moon, A Geramifard, B Damavandi arXiv preprint arXiv:2104.08667, 2021	75	2021
Situated and interactive multimodal conversations S Moon, S Kottur, PA Crook, A De, S Poddar, T Levin, D Whitney, ... arXiv preprint arXiv:2006.01460, 2020	75	2020
Overview of the ninth dialog system technology challenge: Dstc9 C Gunasekara, S Kim, LF D'Haro, A Rastogi, YN Chen, M Eric, ... arXiv preprint arXiv:2011.06486, 2020	69	2020
iLSTD: Eligibility traces and convergence analysis A Geramifard, M Bowling, M Zinkevich, RS Sutton Advances in Neural Information Processing Systems 19, 2006	62	2006
On the design and use of a micro air vehicle to track and avoid adversaries R He, A Bachrach, M Achtelik, A Geramifard, D Gurdan, S Prentice, ... The International Journal of Robotics Research 29 (5), 529-546, 2010	54	2010
Intelligent cooperative control architecture: a framework for performance improvement using safe learning A Geramifard, J Redding, JP How Journal of Intelligent & Robotic Systems 72, 83-103, 2013	52	2013
Customized movie trailers A Geramifard US Patent App. 14/105,428, 2015	50	2015
Reinforcement learning with misspecified model classes J Joseph, A Geramifard, JW Roberts, JP How, N Roy 2013 IEEE International Conference on Robotics and Automation, 939-946, 2013	46	2013
UAV cooperative control with stochastic risk models A Geramifard, J Redding, N Roy, JP How Proceedings of the 2011 american control conference, 3393-3398, 2011	46	2011
Biased cost pathfinding A Geramifard, P Chubak, V Bulitko Proceedings of the AAAI Conference on Artificial Intelligence and …, 2006	41	2006
An intelligent cooperative control architecture J Redding, A Geramifard, A Undurti, HL Choi, JP How Proceedings of the 2010 American control conference, 57-62, 2010	37	2010
Adaptive planning for Markov decision processes with uncertain transition models via incremental feature dependency discovery NK Ure, A Geramifard, G Chowdhary, JP How Machine Learning and Knowledge Discovery in Databases: European Conference …, 2012	32	2012
Annotation inconsistency and entity bias in MultiWOZ K Qian, A Beirami, Z Lin, A De, A Geramifard, Z Yu, C Sankar arXiv preprint arXiv:2105.14150, 2021	30	2021

El sistema no puede realizar la operación en estos momentos. Inténtalo de nuevo más tarde.

Artículos 1–20

Citas por año

Citas duplicadas

Citas combinadas

Añadir coautoresCoautores

Seguir

Citado por

Coautores