The Hugging Face Hub Documentation, Hugging Face team, 2024 (Hugging Face) - Official documentation for the Hugging Face Hub, explaining its features and how to interact with models, datasets, and spaces.
llama.cpp: Port of Facebook's LLaMA model in C/C++, ggml-org and llama.cpp contributors, 2024 (ggml-org) - Official repository for llama.cpp, which implements the GGUF format for efficient local inference of LLMs. This helps understand the 'GGUF' format mentioned.