GGUF File Format, Georgi Gerganov and llama.cpp contributors, 2024 - Official specification and explanation of the GGUF file format, developed for efficient local LLM inference within the llama.cpp project.
safetensors Documentation, Hugging Face and safetensors contributors, 2024 - Official documentation for the safetensors library, which provides a secure and efficient way to save and load model weights, offering an alternative to traditional serialization methods.
Saving and Loading Models, Matthew Inkawhich, 2024 (PyTorch Foundation) - The official PyTorch tutorial on saving and loading models, demonstrating how models trained with PyTorch are typically stored and retrieved using native framework formats.
Save and load Keras models, TensorFlow Team and contributors, 2024 - The official TensorFlow guide for saving and loading models using the Keras API, covering formats such as SavedModel, which are native to the TensorFlow ecosystem.