Alborz Geramifard
Alborz Geramifard
Research Manager at Facebook
Dirección de correo verificada de fb.com - Página principal
Título
Citado por
Citado por
Año
Dyna-style planning with linear function approximation and prioritized sweeping
RS Sutton, C Szepesvári, A Geramifard, MP Bowling
arXiv preprint arXiv:1206.3285, 2012
1472012
A tutorial on linear function approximators for dynamic programming and reinforcement learning
A Geramifard, TJ Walsh, S Tellex, G Chowdhary, N Roy, JP How
Foundations and Trends® in Machine Learning 6 (4), 375-451, 2013
1012013
Decentralized control of partially observable Markov decision processes
C Amato, G Chowdhary, A Geramifard, NK Üre, MJ Kochenderfer
52nd IEEE Conference on Decision and Control, 2398-2405, 2013
922013
Incremental least-squares temporal difference learning
A Geramifard, M Bowling, RS Sutton
Proceedings of the National Conference on Artificial Intelligence 21 (1), 356, 2006
762006
Online Discovery of Feature Dependencies.
A Geramifard, F Doshi, J Redding, N Roy, JP How
ICML, 881-888, 2011
742011
Cooperative mission planning for multi-UAV teams
SS Ponda, LB Johnson, A Geramifard, JP How
Handbook of unmanned aerial vehicles 2, 1447-1490, 2015
592015
iLSTD: Eligibility traces and convergence analysis
A Geramifard, M Bowling, M Zinkevich, RS Sutton
Advances in Neural Information Processing Systems, 441-448, 2007
542007
On the design and use of a micro air vehicle to track and avoid adversaries
R He, A Bachrach, M Achtelik, A Geramifard, D Gurdan, S Prentice, ...
The International Journal of Robotics Research 29 (5), 529-546, 2010
512010
Rlpy: a value-function-based reinforcement learning framework for education and research
A Geramifard, C Dann, RH Klein, W Dabney, JP How
MIT Press, 2015
402015
UAV cooperative control with stochastic risk models
A Geramifard, J Redding, N Roy, JP How
Proceedings of the 2011 American Control Conference, 3393-3398, 2011
362011
An intelligent cooperative control architecture
J Redding, A Geramifard, A Undurti, HL Choi, JP How
Proceedings of the 2010 American control conference, 57-62, 2010
332010
Biased Cost Pathfinding.
A Geramifard, P Chubak, V Bulitko
AIIDE, 112-114, 2006
302006
Adaptive planning for Markov decision processes with uncertain transition models via incremental feature dependency discovery
NK Ure, A Geramifard, G Chowdhary, JP How
Joint European conference on machine learning and knowledge discovery in …, 2012
292012
Intelligent cooperative control architecture: a framework for performance improvement using safe learning
A Geramifard, J Redding, JP How
Journal of Intelligent & Robotic Systems 72 (1), 83-103, 2013
242013
Reinforcement learning with misspecified model classes
J Joseph, A Geramifard, JW Roberts, JP How, N Roy
2013 IEEE International Conference on Robotics and Automation, 939-946, 2013
242013
RLPy: The Reinforcement Learning Library for Education and Research
A Geramifard, RH Klein, P Jonathan
192013
Learning robust dialog policies in noisy environments
M Fazel-Zarandi, SW Li, J Cao, J Casale, P Henderson, D Whitney, ...
arXiv preprint arXiv:1712.04034, 2017
162017
Practical reinforcement learning using representation learning and safe exploration for large scale Markov decision processes
A Geramifard
Massachusetts Institute of Technology, 2012
162012
Handbook of Unmanned Aerial Vehicles, chapter Linear Flight Contol Techniques for Unmanned Aerial Vehicles
JP How, E Frazzoli, G Chowdhary
Springer, 2012
152012
Batch iFDD: A scalable matching pursuit algorithm for solving MDPs
A Geramifard, TJ Walsh, N Roy, J How
Proceedings of the 29th Annual Conference on Uncertainty in Artificial …, 2013
132013
El sistema no puede realizar la operación en estos momentos. Inténtalo de nuevo más tarde.
Artículos 1–20