← Learning Portal

LLM From Scratch

A complete technical textbook covering everything from mathematical foundations to frontier model architectures, reinforcement learning, and beyond.

30

Chapters

1.8 MB

Content

~2000

Pages

Part I: Foundations

Mathematical prerequisites, neural network fundamentals, and the path to Transformers

1

Mathematical Foundations

Linear algebra, calculus, probability, and information theory for deep learning

58 KB
2

Neural Networks Deep Dive

Perceptrons to deep networks, backpropagation, activation functions, optimization

73 KB
3

Sequence Modeling

RNNs, LSTMs, GRUs, seq2seq, and the attention revolution

32 KB
4

The Transformer Architecture

Self-attention, multi-head attention, positional encoding, encoder-decoder

80 KB

Part II: Language Model Training

From raw text to pretrained language models

Part III: Reinforcement Learning & Alignment

From RL fundamentals to RLHF, DPO, and modern alignment techniques

Part IV: Model Families

The evolution of frontier LLMs from GPT to DeepSeek

Part V: Efficiency & Optimization

Making large models practical: attention, quantization, sparsity, and adaptation

Part VI: Advanced Capabilities

Multimodal understanding, reasoning, agents, and tool use

Part VII: Safety, Interpretability & the Future

Understanding, aligning, and evolving language models

Part VIII: Practice & Production

Data pipelines, inference systems, evaluation, prompt engineering, and synthetic data