Offline Reward Perturbation Boosts Distributional Shift in Online RL
Yu, Z.*, Kang, S.*, & Zhang, X. (*equal contribution). (2024, July). Offline Reward Perturbation Boosts Distributional Shift in Online RL. In The 40th Conference on Uncertainty in Artificial Intelligence.