Explaining and Harnessing Adversarial Examples, Ian J. Goodfellow, Jonathon Shlens, and Christian Szegedy, 2015International Conference on Learning Representations (ICLR)DOI: 10.48550/arXiv.1412.6572 - Introduces the Fast Gradient Sign Method (FGSM) and discusses fundamental properties of adversarial examples, laying groundwork for white-box evasion attacks.
Towards Evaluating the Robustness of Neural Networks, Nicholas Carlini and David A. Wagner, 2017IEEE Symposium on Security and Privacy (SP) (IEEE)DOI: 10.1109/SP.2017.49 - Introduces powerful optimization-based white-box adversarial attacks (C&W attacks) and details methods for generating both targeted and untargeted adversarial examples.