Seguir
Linrui Zhang
Linrui Zhang
Dirección de correo verificada de mails.tsinghua.edu.cn
Título
Citado por
Citado por
Año
Penalized Proximal Policy Optimization for Safe Reinforcement Learning
L Zhang, L Shen, L Yang, S Chen, B Yuan, X Wang, D Tao
The 31st International Joint Conference on Artificial Intelligence (IJCAI), 2022
602022
Constrained Update Projection Approach to Safe Policy Optimization
L Yang, J Ji, J Dai, L Zhang, B Zhou, P Li, Y Yang, G Pan
36th Conference on Neural Information Processing Systems (NeurIPS 2022), 2022
392022
Evaluating Model-free Reinforcement Learning toward Safety-critical Tasks
L Zhang, Q Zhang, L Shen, B Yuan, X Wang, D Tao
37th AAAI Conference on Artificial Intelligence (AAAI-23), 2022
212022
Saformer: A conditional sequence modeling approach to offline safe reinforcement learning
Q Zhang, L Zhang, H Xu, L Shen, B Wang, Y Chang, X Wang, B Yuan, ...
arXiv preprint arXiv:2301.12203, 2023
182023
Are Large Language Models Really Robust to Word-Level Perturbations?
H Wang, G Ma, C Yu, N Gui, L Zhang, Z Huang, S Ma, Y Chang, S Zhang, ...
Socially Responsible Language Modelling Research (SoLaR) Workshop at NeurIPS'23, 2023
152023
CAT: Closed-loop Adversarial Training for Safe End-to-End Driving
L Zhang, Z Peng, Q Li, B Zhou
7th Annual Conference on Robot Learning (CoRL 2023), 2023
112023
SafeRL-Kit: Evaluating Efficient Reinforcement Learning Methods for Safe Autonomous Driving
L Zhang, Q Zhang, L Shen, B Yuan, X Wang
Safe Learning for Autonomous Driving Workshop in ICML 2022, 2022
112022
Learning Better with Less: Effective Augmentation for Sample-Efficient Visual Reinforcement Learning
G Ma, L Zhang, H Wang, L Li, Z Wang, Z Wang, L Shen, X Wang, D Tao
37th Conference on Neural Information Processing Systems (NeurIPS 2023), 2023
72023
Safety Correction from Baseline: Towards the Risk-aware Policy in Robotics via Dual-agent Reinforcement Learning
L Zhang, Z Yan, L Shen, S Li, X Wang, D Tao
IROS 2022, 2022
32022
El sistema no puede realizar la operación en estos momentos. Inténtalo de nuevo más tarde.
Artículos 1–9