An Empirical Study of Quantization in Transformers, Tim Dettmers, Elias Frantar, Saleh Ashkboos, Torsten Hoefler, Dan Alistarh, 2022Proceedings of Machine Learning Research, Vol. 162 (Proceedings of Machine Learning Research) - Examines the effects and challenges of quantizing transformer models, offering insights into common accuracy and numerical issues discussed in this section.
bitsandbytes documentation, Hugging Face, 2024 (Hugging Face) - Official documentation for the bitsandbytes library, providing practical details on configuration, usage, and troubleshooting for LLM quantization.