Tokenization (from Hugging Face Course), Hugging Face, 2024 (Hugging Face) - Provides a practical and accessible explanation of tokenization, including different types like subword tokenization, relevant for LLMs.
Speech and Language Processing, Daniel Jurafsky and James H. Martin, 2025 (Pearson) - A comprehensive textbook covering fundamental NLP concepts including tokenization, word embeddings, and their evolution.
CS224N: Natural Language Processing with Deep Learning, Stanford University, 2024 - Offers comprehensive lecture notes and materials on deep learning methods for NLP, including detailed explanations of word vectors and embeddings.