Seguir
Thiago D. Simão
Título
Citado por
Citado por
Año
WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning
Q Yang, TD Simão, SH Tindemans, MTJ Spaan
AAAI, 10639-10646, 2021
1492021
AlwaysSafe: Reinforcement learning without safety constraint violations during training
TD Simão, N Jansen, MTJ Spaan
AAMAS, 1226-1235, 2021
522021
Safety-constrained reinforcement learning with a distributional safety critic
Q Yang, TD Simão, SH Tindemans, MTJ Spaan
Machine Learning 112 (3), 859-887, 2023
502023
Safe Policy Improvement with Baseline Bootstrapping in Factored Environments
TD Simão, MTJ Spaan
AAAI, 4967-4974, 2019
352019
Safe Policy Improvement with an Estimated Baseline Policy
TD Simão, R Laroche, R Tachet des Combes
AAMAS, 1269-1277, 2020
34*2020
Robust anytime learning of Markov decision processes
M Suilen, TD Simão, D Parker, N Jansen
NeurIPS, 28790-28802, 2022
312022
Safe Reinforcement Learning From Pixels Using a Stochastic Latent Representation
Y Hogewind, TD Simão, T Kachman, N Jansen
ICLR, 2023
172023
Decision-making under uncertainty: beyond probabilities: Challenges and perspectives
T Badings, TD Simão, M Suilen, N Jansen
International Journal on Software Tools for Technology Transfer 25 (3), 375-391, 2023
162023
Reinforcement Learning by Guided Safe Exploration
Q Yang, TD Simão, N Jansen, SH Tindemans, MTJ Spaan
ECAI, 2858-2865, 2023
14*2023
Safe policy improvement for POMDPs via finite-state controllers
TD Simão, M Suilen, N Jansen
AAAI, 15109-15117, 2023
132023
Act-then-measure: reinforcement learning for partially observable environments with active measuring
M Krale, TD Simão, N Jansen
ICAPS, 212-220, 2023
112023
Structure Learning for Safe Policy Improvement
TD Simão, MTJ Spaan
IJCAI, 3453-3459, 2019
112019
A Modern Perspective on Safe Automated Driving for Different Traffic Dynamics Using Constrained Reinforcement Learning
D Kamran, TD Simão, Q Yang, CT Ponnambalam, J Fischer, MTJ Spaan, ...
ITSC, 4017-4023, 2022
102022
Scalable Safe Policy Improvement via Monte Carlo Tree Search
A Castellini, F Bianchi, E Zorzi, TD Simão, A Farinelli, MTJ Spaan
ICML, 3732-3756, 2023
82023
Recursive small-step multi-agent A* for dec-POMDPs
W Koops, N Jansen, S Junges, TD Simão
IJCAI, 5402-5410, 2023
62023
More for Less: Safe Policy Improvement With Stronger Performance Guarantees
P Wienhöft, M Suilen, TD Simão, C Dubslaff, C Baier, N Jansen
IJCAI, 4406-4415, 2023
62023
When a Robot Reaches Out for Human Help
I Andrés, LN de Barros, DD Mauá, TD Simão
Ibero-American Conference on Artificial Intelligence, 277-289, 2018
32018
Maintenance Strategies for Sewer Pipes with Multi-State Degradation and Deep Reinforcement Learning
LA Jimenez-Roa, TD Simão, Z Bukhsh, T Tinga, H Molegraaf, N Jansen, ...
PHM Society European Conference 8 (1), 14-14, 2024
22024
Risk-aware curriculum generation for heavy-tailed task distributions
C Koprulu, TD Simão, N Jansen, U Topcu
UAI, 1132-1142, 2023
22023
Planejamento probabilístico com becos sem saída
TD Simão
Universidade de São Paulo, 2017
22017
El sistema no puede realizar la operación en estos momentos. Inténtalo de nuevo más tarde.
Artículos 1–20