November 06, 2024
Safe Policy Optimization With Stretchable Penalties
Ning Pang, Longyang Huang, Botao Dong, et al.