Seguir
Claas Voelcker
Claas Voelcker
PhD student at University of Toronto
Dirección de correo verificada de cs.toronto.edu
Título
Citado por
Citado por
Año
Structured object-aware physics prediction for video modeling and planning
J Kossen, K Stelzner, M Hussing, C Voelcker, K Kersting
International Conference on Learning Representations 2020, 2020
722020
Value Gradient weighted Model-Based Reinforcement Learning
C Voelcker, V Liao, A Garg, A Farahmand
International Conference on Learning Representations 2022, 2022
332022
Queer in AI: A case study in community-led participatory AI
OO Queerinai, A Ovalle, A Subramonian, A Singh, C Voelcker, ...
Proceedings of the 2023 ACM Conference on Fairness, Accountability, and …, 2023
262023
Dissecting Deep RL with High Update Ratios: Combatting Value Overestimation and Divergence
M Hussing, C Voelcker, I Gilitschenski, A Farahmand, E Eaton
Reinforcement Learning Conference 2024, 2024
52024
VIPer: Iterative Value-Aware Model Learning on the Value Improvement Path
R Abachi, CA Voelcker, A Garg, A Farahmand
Decision Awareness in Reinforcement Learning Workshop at ICML 2022, 2022
32022
DeepNotebooks: Deep Probabilistic Models Construct Python Notebooks for Reporting Datasets
C Voelcker, A Molina, J Neumann, D Westermann, K Kersting
ECMLPKDD Workshop on Automating Data Science, 2019
32019
Can we hop in general? A discussion of benchmark selection and design using the Hopper environment
CA Voelcker, M Hussing, E Eaton
arXiv preprint arXiv:2410.08870, 2024
12024
When does Self-Prediction help? Understanding Auxiliary Tasks in Reinforcement Learning
C Voelcker, T Kastner, I Gilitschenski, A Farahmand
Reinforcement Learning Conference 2024, 2024
12024
-AC: Learning latent decision-aware models for reinforcement learning in continuous state-spaces
CA Voelcker, A Ahmadian, R Abachi, I Gilitschenski, A Farahmand
arXiv preprint arXiv:2306.17366, 2023
12023
Temporal-Difference Learning Using Distributed Error Signals
J Guan, SE Verch, C Voelcker, EC Jackson, N Papernot, WA Cunningham
arXiv preprint arXiv:2411.03604, 2024
2024
MAD-TD: Model-Augmented Data stabilizes High Update Ratio RL
CA Voelcker, M Hussing, E Eaton, A Farahmand, I Gilitschenski
arXiv preprint arXiv:2410.08896, 2024
2024
Local-Forward: Towards Biological Plausibility in Deep Reinforcement Learning
J Guan, SE Verch, CA Voelcker, EC Jackson, N Papernot, ...
El sistema no puede realizar la operación en estos momentos. Inténtalo de nuevo más tarde.
Artículos 1–12