Transfer Learning from Speaker Verification to Multispeaker Text-to-Speech Synthesis, Ye Jia, Yu Zhang, Ron J. Weiss, Quan Wang, Jonathan Shen, Fei Ren, Zhifeng Chen, Patrick Nguyen, Ruoming Pang, Ignacio Lopez Moreno, Yonghui Wu, 2018Advances in Neural Information Processing Systems 31 (NeurIPS 2018) (Neural Information Processing Systems Foundation, Inc. (NeurIPS)) - 本文提出了一种稳健的多说话人表达性文本转语音方法,通过训练风格编码器从参考音频中提取固定维度的嵌入,实现零样本风格迁移。