Adversarial Reinforcement Learning

Adversarial Reinforcement Learning

Research papers covered in this post will showcase the landscape of attacks on RL agents and the optimal attack strategies, which is crucial for understanding security threats against the deployed systems. In particular, the research papers will cover optimal attack strategies for test-time, backdoor, and training-time (environment poisoning) attacks on RL agents. These research papers provides better perspective of important problems for developing robust and secure algorithms in sequential decision-making settings. This repo is result of Course - Adversarial Reinforcement Learning, I followed at Saarland University.

List of research papers covered

Test-time attacks

  1. Tactics of Adversarial Attack on Deep Reinforcement Learning Agents by Y. Lin, Z. Hong, Y. Liao, M. Shih, M. Liu, and M. Sun, at IJCAI 2017.

  2. Stealthy and Efficient Adversarial Attacks against Deep Reinforcement Learning by J. Sun, T. Zhang, X. Xie, L. Ma, Y. Zheng, K. Chen, and Y. Liu, at AAAI 2020.

  3. Robust Deep Reinforcement Learning against Adversarial Perturbations on State Observations by H. Zhang, H. Chen, C. Xiao, B. Li, M. Liu, D. Boning, and C. Hsieh, at NeurIPS 2020.

Backdoor attacks

  1. TrojDRL: Evaluation of Backdoor Attacks on Deep Reinforcement Learning by P. Kiourti, K. Wardega, S. Jha, and W. Li, at DAC 2020.

  2. Temporal Watermarks for Deep Reinforcement Learning Modelss by K. Chen, S. Guo, T. Zhang, S. Li, and Y. Liu, at AAMAS 2021.

Training-time attacks

  1. Policy Teaching via Environment Poisoning: Training-time Adversarial Attacks against Reinforcement Learning by A. Rakhsha, G. Radanovic, R. Devidze, X. Zhu, and A. Singla, at ICML 2020.

  2. Vulnerability-Aware Poisoning Mechanism for Online RL with Unknown Dynamics by Y. Sun, D. Huo, and F. Huang, at ICLR 2021.

  3. Defense Against Reward Poisoning Attacks in Reinforcement Learning by K. Banihashem, A. Singla, and G. Radanovic, at arXiv preprint 2021.

Summary/Overview

Find the summary of above papers Here