All Courses

Introduction to LLM Fine-Tuning

Chapter 1: Foundations of Model Customization

What is Fine-Tuning?

Pre-training vs. Fine-Tuning

When to Fine-Tune: An Analytical Framework

Overview of Fine-Tuning Strategies

The Role of Transfer Learning in LLMs

Setting Up Your Development Environment

Chapter 2: Data Preparation for Fine-Tuning

Sourcing and Selecting High-Quality Datasets

Instruction-Based vs. Conversational Data Formats

Data Cleaning and Preprocessing Techniques

Creating and Structuring Custom Datasets

Tokenization for Fine-Tuning

Hands-on Practical: Building a Fine-Tuning Dataset

Chapter 3: Full Parameter Fine-Tuning

The Mechanics of Full Fine-Tuning

Architectural Considerations for Full Fine-Tuning

Managing Computational Resources

Configuring Training Arguments and Hyperparameters

Monitoring Training: Loss and Metrics

Saving and Loading Fine-Tuned Models

Practice: Full Fine-Tuning on a Small-Scale Model

Chapter 4: Parameter-Efficient Fine-Tuning (PEFT)

Introduction to Parameter-Efficient Fine-Tuning

Low-Rank Adaptation (LoRA): Theory and Operation

Implementing LoRA with the PEFT Library

Quantization and its effect on Fine-Tuning (QLoRA)

Other PEFT Methods: A Brief Survey

Comparing PEFT and Full Fine-Tuning Trade-offs

Hands-on Practical: Fine-Tuning with LoRA

Chapter 5: Evaluation and Deployment

Defining Performance Metrics for Generative Tasks

Quantitative Evaluation: ROUGE, BLEU, and Perplexity

Qualitative Evaluation: Human-in-the-Loop Assessment

Building an Evaluation Pipeline

Strategies for Merging Adapters with the Base Model

Preparing Models for Inference

Practice: Evaluating a Fine-Tuned Model

Other PEFT Methods: A Brief Survey

Was this section helpful?

References

Parameter-Efficient Transfer Learning for NLP, Neil Houlsby, Andrei Giurgiu, Stanislaw Jastrzebski, Bruna Morrone, Quentin de Laroussilhe, Andrea Gesmundo, Mona Attariyan, Sylvain Gelly, 2019 Proceedings of the 36th International Conference on Machine Learning (ICML) DOI: 10.48550/arXiv.1902.00751 - Introduces adapter modules for efficient transfer learning, demonstrating their effectiveness by freezing pre-trained weights and inserting small, trainable layers.
Prefix-Tuning: Optimizing Continuous Prompts for Generation, Li, Xiang Lisa and Liang, Percy, 2021 Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers) (Association for Computational Linguistics) DOI: 10.18653/v1/2021.acl-long.353 - Proposes prefix-tuning, a parameter-efficient fine-tuning method that prepends trainable continuous prefixes to the input of each transformer layer's attention mechanism.

© 2025 ApX Machine LearningEngineered with