Epoch v0 - Kickoff Meeting
EPOCH v0 - Kickoff Meeting
The Course
Stanford CS336: Language Modelling From Scratch
Topics covered:
- Transformer architectures and their implementations in PyTorch
- Distributed training
- GPU kernels for optimizing inference algorithms
- Data wrangling techniques for LLM training
- RL algorithms for post-training alignment and reasoning
Prerequisites
Fair Warning: This is a hard course
The website lists prerequisites that may even be understating the difficulty
- Proficiency in Python
- Experience with deep learning and systems optimization
- College Calculus and Linear Algebra
- Basic Probability and Statistics
- Foundational Machine Learning knowledge
Prerequisites (cont.)
Learning under pressure leads to the best outcomes.
Resources available:
- Documentation
- Papers
- Communities
- Each other
Course Structure
5 Assignments Total
Meetings every Wednesday to discuss the previous assignment.
Accountability
A public list of assignment completions will be maintained and published on the website.
Compute Requirements
Some assignments require paid compute (GPU access) for full implementation.
GPU: Ideal, but costs money
CPU: Possible for some work if you have enough memory — slower, but workable for certain tasks.
Assignment 1
- Follow lectures through Lecture 5
- Implement the core components: tokenizer, model architecture, and optimizer
- Train a minimal language model
See you on 10th December!