wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations, Alexei Baevski, Henry Zhou, Abdelrahman Mohamed, Michael Auli, 2020Advances in Neural Information Processing Systems (NeurIPS), Vol. 33 (Neural Information Processing Systems (NeurIPS)) - Introduces the influential wav2vec 2.0 model for self-supervised speech representation learning using masked prediction and a contrastive loss.
Noisy Student Training for Automatic Speech Recognition, Qiantong Xu, Alexis Conneau, Abdelrahman Mohamed, Wei-Ning Hsu, 2020Proceedings of Interspeech 2020 (International Speech Communication Association)DOI: 10.21437/Interspeech.2020-1793 - Demonstrates an effective semi-supervised self-training approach for ASR, achieving state-of-the-art results by leveraging unlabeled data.