所有课程

大型语言模型入门

章节 1: 了解大型语言模型

什么是人工智能？简单介绍

自然语言处理 (NLP) 简介

定义大型语言模型（LLMs）

LLM如何从文本数据中学习

LLM能完成的任务示例

大型语言模型的常见误解

第 1 章测验

章节 2: 大型语言模型的工作原理（简化版）

词语表示：分词和嵌入

预测下一个词：核心理念

训练数据规模的作用

理解模型参数

Transformer架构（高层）简介

语境如何影响生成

第 2 章测验

章节 3: 与大型语言模型沟通：提示词

什么是提示词？

基本提示技巧

明确给出指示

提供示例（少样本提示）

控制输出长度和格式

常见提示问题

练习：编写你的第一个提示

第 3 章测验

章节 4: 认识不同的大语言模型

基础模型概览

通用模型与专用模型对比

开放模型与封闭模型：有何不同？

理解模型大小与能力

模型使用方法：API与界面

第 4 章测验

章节 5: 使用预训练大语言模型

什么是预训练模型？

查找与选择大语言模型服务

通过网页界面交互

LLM API使用简介

发送您的第一个API请求

解释 LLM 响应

动手实践：简单的文本生成任务

第 5 章测验

LLM如何从文本数据中学习

这部分内容有帮助吗？

参考文献

Attention Is All You Need, Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, Illia Polosukhin, 2017 Advances in Neural Information Processing Systems (NeurIPS 2017) DOI: 10.48550/arXiv.1706.03762 - 介绍了Transformer架构，该架构是现代大型语言模型及其学习机制的基础。
Language Models are Few-Shot Learners, Tom B. Brown, Benjamin Mann, Nick Ryder, Melanie Subbiah, Jared Kaplan, Prafulla Dhariwal, Arvind Neelakantan, Pranav Shyam, Girish Sastry, Amanda Askell, Sandhini Agarwal, Ariel Herbert-Voss, Gretchen Krueger, Tom Henighan, Rewon Child, Aditya Ramesh, Daniel M. Ziegler, Jeffrey Wu, Clemens Winter, Christopher Hesse, Mark Chen, Eric Sigler, Mateusz Litwin, Scott Gray, Benjamin Chess, Jack Clark, Christopher Berner, Sam McCandlish, Alec Radford, Ilya Sutskever, Dario Amodei, 2020 Advances in Neural Information Processing Systems (NeurIPS 2020) DOI: 10.48550/arXiv.2005.14165 - 详细介绍了GPT-3的训练和能力，展示了通过在海量数据集上扩展下一个词预测如何实现强大的语言理解和生成。
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding, Jacob Devlin, Ming-Wei Chang, Kenton Lee, Kristina Toutanova, 2019 Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers) DOI: 10.48550/arXiv.1810.04805 - 描述了使用掩码语言建模等自监督任务在大规模文本语料库上预训练大型Transformer模型的方法，这对学习广泛的语言表示至关重要。
Speech and Language Processing, Daniel Jurafsky and James H. Martin, 2025 (Stanford University) - 提供了自然语言处理的全面学术介绍，包括语言模型和神经网络训练的基础理论。

© 2025 ApX Machine Learning用心打造