Offline Reward Perturbation Boosts Distributional Shift in Online RL
Published in 40th Conference on Uncertainty in Artificial Intelligence (UAI), 2024
We proposed a data poisoning attack on offline to online reinforcement learning to stealthily promote distribution shift.
Recommended citation: Yu, Z.*, Kang, S.*, & Zhang, X. (*equal contribution). (2024, July). Offline Reward Perturbation Boosts Distributional Shift in Online RL. In The 40th Conference on Uncertainty in Artificial Intelligence. https://openreview.net/pdf?id=wbwTF909Ve