GANs Trained by a Two Time-Scale Update Rule Converge to a Local Nash Equilibrium, Martin Heusel, Hubert Ramsauer, Thomas Unterthiner, Bernhard Nessler, Sepp Hochreiter, 2017Advances in Neural Information Processing Systems 30 (NIPS 2017), Vol. 30DOI: 10.48550/arXiv.1706.08500 - Introduces the Fréchet Inception Distance (FID) as a widely used quantitative metric, providing context for its strengths and limitations, which necessitates complementary qualitative evaluation.
Denoising Diffusion Probabilistic Models, Jonathan Ho, Ajay Jain, Pieter Abbeel, 2020Advances in Neural Information Processing Systems 33 (NeurIPS 2020)DOI: 10.48550/arXiv.2006.11239 - This foundational paper for modern diffusion models presents generated images and discusses their visual characteristics, which is key for qualitative assessment of diffusion model outputs and their common artifacts.
Evaluating Generative Models via the Perceptual Realism of Their Samples, Alireza Poursaeed, Phillip Isola, Bryan Catanzaro, Edward H. Adelson, 2018Advances in Neural Information Processing Systems 31 (NeurIPS 2018) (NeurIPS Foundation)DOI: 10.55917/tgd3.31 - A focused work on systematic human evaluation methods, including the 'real vs. fake' discrimination task and preference judgments, directly addressing the perceptual realism of synthetic data.