All Courses

Introduction to LLM Fine-Tuning

Chapter 1: Foundations of Model Customization

What is Fine-Tuning?

Pre-training vs. Fine-Tuning

When to Fine-Tune: An Analytical Framework

Overview of Fine-Tuning Strategies

The Role of Transfer Learning in LLMs

Setting Up Your Development Environment

Chapter 2: Data Preparation for Fine-Tuning

Sourcing and Selecting High-Quality Datasets

Instruction-Based vs. Conversational Data Formats

Data Cleaning and Preprocessing Techniques

Creating and Structuring Custom Datasets

Tokenization for Fine-Tuning

Hands-on Practical: Building a Fine-Tuning Dataset

Chapter 3: Full Parameter Fine-Tuning

The Mechanics of Full Fine-Tuning

Architectural Considerations for Full Fine-Tuning

Managing Computational Resources

Configuring Training Arguments and Hyperparameters

Monitoring Training: Loss and Metrics

Saving and Loading Fine-Tuned Models

Practice: Full Fine-Tuning on a Small-Scale Model

Chapter 4: Parameter-Efficient Fine-Tuning (PEFT)

Introduction to Parameter-Efficient Fine-Tuning

Low-Rank Adaptation (LoRA): Theory and Operation

Implementing LoRA with the PEFT Library

Quantization and its effect on Fine-Tuning (QLoRA)

Other PEFT Methods: A Brief Survey

Comparing PEFT and Full Fine-Tuning Trade-offs

Hands-on Practical: Fine-Tuning with LoRA

Chapter 5: Evaluation and Deployment

Defining Performance Metrics for Generative Tasks

Quantitative Evaluation: ROUGE, BLEU, and Perplexity

Qualitative Evaluation: Human-in-the-Loop Assessment

Building an Evaluation Pipeline

Strategies for Merging Adapters with the Base Model

Preparing Models for Inference

Practice: Evaluating a Fine-Tuned Model

Practice: Full Fine-Tuning on a Small-Scale Model

Was this section helpful?

References

Hugging Face Transformers Documentation, Hugging Face, 2024 - Provides comprehensive guides and API references for using the transformers library, including Trainer, TrainingArguments, and AutoModelForCausalLM, which are central to the fine-tuning workflow presented.
Qwen: A Super Large-Scale Pretrained Language Model Family with Strong Chinese and English Abilities, Jinze Bai, Shuai Bai, Yunfei Chu, Zeyu Cui, Kai Dang, Xiaodong Deng, Yang Fan, Wenbin Ge, Yu Han, Fei Huang, Binyuan Hui, Luo Ji, Mei Li, Junyang Lin, Runji Lin, Dayiheng Liu, Gao Liu, Chengqiang Lu, Keming Lu, Jianxin Ma, Rui Men, Xingzhang Ren, Xuancheng Ren, Chuanqi Tan, Sinan Tan, Jianhong Tu, Peng Wang, Shijie Wang, Wei Wang, Shengguang Wu, Benfeng Xu, Jin Xu, An Yang, Hao Yang, Jian Yang, Shusheng Yang, Yang Yao, Bowen Yu, Hongyi Yuan, Zheng Yuan, Jianwei Zhang, Xingxuan Zhang, Yichang Zhang, Zhenru Zhang, Chang Zhou, Jingren Zhou, Xiaohuan Zhou, Tianhang Zhu, 2023 arXiv preprint arXiv:2309.16609 DOI: 10.48550/arXiv.2309.16609 - Describes the Qwen model family, including the architecture and training methodology, which is the base model used for fine-tuning in this section.

© 2025 ApX Machine LearningEngineered with