Transformers 5
- Introduction to Mechanistic Interpretability, Superposition and Sparse Autoencoders
- Representation Engineering - I: Steering Language Models With Activation Engineering
- Building GPT-2 From Scratch: Mechanistic Interpretability View
- Visualizing Attention: See what an LLM sees.
- Supervised Finetuning in LLM training workflow