Adversarial Audio Synthesis, Chris Donahue, Julian McAuley, Miller Puckette, 2019International Conference on Learning Representations (ICLR)DOI: 10.48550/arXiv.1802.04208 - 介绍了WaveGAN,这是一种使用一维卷积GAN进行直接原始音频波形合成的开创性模型,通过相位洗牌解决生成挑战。
Improved Training of Wasserstein GANs, Ishaan Gulrajani, Faruk Ahmed, Martin Arjovsky, Vincent Dumoulin, Aaron C. Courville, 2017Advances in Neural Information Processing Systems (NeurIPS), Vol. 30DOI: 10.48550/arXiv.1704.00028 - 提出了带有梯度惩罚的Wasserstein GAN,这是一种用于稳定GAN训练的强大且被广泛采用的技术,对WaveGAN等模型尤为重要。