Neural Machine Translation of Rare Words with Subword Units, Rico Sennrich, Barry Haddow, Alexandra Birch, 2016Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Vol. 1 (Association for Computational Linguistics)DOI: 10.18653/v1/P16-1162 - 引入字节对编码(BPE)的原始论文,这是一种将单词分割成子词单元的方法,是许多现代分词器的基础。