Hugging Face Transformers: Share and Use Models, Hugging Face, 2024 (Hugging Face) - A comprehensive guide to saving and loading models, configurations, and tokenizers using the save_pretrained and from_pretrained methods within the Hugging Face Transformers library.
safetensors GitHub Repository, Hugging Face, 2022 - The official repository providing technical specifications, implementations, and background on the safetensors format, emphasizing its security and performance advantages over traditional serialization methods.
Hugging Face PEFT Documentation: Package Structure, Hugging Face, 2024 (Hugging Face) - Documentation describing the specific procedures for saving, loading, and integrating Parameter-Efficient Fine-Tuning (PEFT) adapters, such as LoRA, with base language models.
ONNX (Open Neural Network Exchange) Documentation, ONNX Community, 2024 (Linux Foundation AI & Data (LFAI)) - The official resource for understanding the ONNX format, its ecosystem, and how it enables model interoperability and optimized inference across various hardware and runtimes.