WaveNet: A Generative Model for Raw Audio, Aaron van den Oord, Sander Dieleman, Heiga Zen, Karen Simonyan, Oriol Vinyals, Alex Graves, Nal Kalchbrenner, Andrew Senior, Koray Kavukcuoglu, 2016arXiv preprint arXiv:1609.03499DOI: 10.48550/arXiv.1609.03499 - 介绍WaveNet架构的基础论文,包括因果卷积和空洞卷积,用于高保真原始音频生成。
Efficient Neural Audio Synthesis, Nal Kalchbrenner, Erich Elsen, Karen Simonyan, Seb Noury, Norman Casagrande, Edward Lockhart, Florian Stimberg, Aaron van den Oord, Sander Dieleman, Koray Kavukcuoglu, 2018International Conference on Machine Learning (ICML)DOI: 10.48550/arXiv.1802.08435 - 介绍WaveRNN的原始论文,这是一种用于神经音频合成的高效自回归模型,重点在于速度优化。