llama.cpp, Georgi Gerganov and the llama.cpp Contributors, 2023 - Source project for the GGUF format, detailing quantization techniques for efficient LLM inference.
Mistral 7B, Albert Q. Jiang, Alexandre Sablayrolles, Arthur Mensch, Chris Bamford, Devendra Singh Chaplot, Diego de las Casas, Florian Bressand, Gianna Lengyel, Guillaume Lample, Lucile Saulnier, Lélio Renard Lavaud, Marie-Anne Lachaux, Pierre Stock, Teven Le Scao, Thibaut Lavril, Thomas Wang, Timothée Lacroix, William El Sayed, 2023arXivDOI: 10.48550/arXiv.2310.06825 - Introduces the Mistral 7B model, recognized for its strong performance at a smaller size, suitable for local use.