LoRA: Low-Rank Adaptation of Large Language Models, Edward J. Hu, Yelong Shen, Phillip Wallis, Zeyuan Allen-Zhu, Yuanzhi Li, Shean Wang, Lu Wang, Weizhu Chen, 2021arXiv preprint arXiv:2106.09685DOI: 10.48550/arXiv.2106.09685 - Introduces the Low-Rank Adaptation (LoRA) method, a parameter-efficient technique for fine-tuning large language models.
Parameter-Efficient Fine-Tuning (PEFT), Hugging Face, 2024 (Hugging Face) - Official documentation for the Hugging Face PEFT library, detailing how to implement and use LoRA for fine-tuning.