Speech and Language Processing, Daniel Jurafsky and James H. Martin, 2025 (Pearson) - A comprehensive textbook on natural language processing, which covers foundational concepts of text representation and data preparation for machine learning models.
tf.keras.preprocessing.sequence.pad_sequences, TensorFlow Developers, 2024 (Google) - Official documentation for Keras' utility function to pad sequences to a uniform length, detailing its parameters and usage.
torch.nn.utils.rnn.pad_sequence, PyTorch Developers, 2024 (PyTorch) - Official documentation for PyTorch's utility function designed to pad a list of variable-length sequences into a single padded batch tensor.
Stanford CS224N: Natural Language Processing with Deep Learning, Diyi Yang, Tatsunori Hashimoto, 2023 (Stanford University) - Course materials from a highly-regarded university course on deep learning for natural language processing, often including practical data preparation techniques.