All Courses

Introduction to Large Language Models

Chapter 1: Understanding Large Language Models

What is Artificial Intelligence? A Brief Overview

Introducing Natural Language Processing (NLP)

Defining Large Language Models (LLMs)

How LLMs Learn from Text Data

Examples of Tasks LLMs Can Perform

Common Misconceptions about LLMs

Quiz for Chapter 1

Chapter 2: The Mechanics of LLMs (Simplified)

Representing Words: Tokens and Embeddings

Predicting the Next Word: The Core Idea

The Role of Training Data Size

Understanding Model Parameters

Introduction to Transformer Architecture (High-Level)

How Context Influences Generation

Quiz for Chapter 2

Chapter 3: Communicating with LLMs: Prompts

What is a Prompt?

Basic Prompting Techniques

Providing Instructions Clearly

Giving Examples (Few-Shot Prompting)

Controlling Output Length and Format

Common Prompting Challenges

Practice: Crafting Your First Prompts

Quiz for Chapter 3

Chapter 4: A Look at Different LLMs

Overview of Foundational Models

General Purpose vs. Specialized Models

Open vs. Closed Models: What's the Difference?

Understanding Model Size and Capabilities

Accessing Models: APIs and Interfaces

Quiz for Chapter 4

Chapter 5: Using Pre-trained LLMs

What are Pre-trained Models?

Finding and Choosing an LLM Service

Interacting via Web Interfaces

Introduction to Using LLM APIs

Sending Your First API Request

Interpreting LLM Responses

Hands-on Practical: Simple Text Generation Task

Quiz for Chapter 5

What are Pre-trained Models?

Was this section helpful?

References

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding, Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova, 2019 Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers) (Association for Computational Linguistics) DOI: 10.18653/v1/N19-1423 - This paper introduces BERT, a transformer-based model that significantly advanced the pre-training paradigm for natural language understanding tasks. It describes how models acquire general language knowledge through extensive pre-training.
Natural Language Processing with Transformers: Building Innovative Applications with Deep Learning Models, Lewis Tunstall, Leandro von Werra, and Thomas Wolf, 2022 (O'Reilly Media) - This book offers practical guidance on using the Hugging Face Transformers library, which gives access to a wide range of pre-trained models. It covers how to use these models for various NLP tasks, showing their accessibility and utility.

© 2025 ApX Machine Learning