Lucas Beyer (Twitter: @lucasbey)
- Transformer tutorial slides (must-read): Link
- Talks Collection
-
Understanding and Coding the Self-Attention Mechanism of Large Language Models From Scratch from Sebastian Raschka
- Transformer Puzzles from Professor Alexander Rush. Also check out his Puzzle collection.