Featured Leanpub Book
My Adventures with Large Language Models
Build foundational LLMs from Transformers to DeepSeek, from scratch, in PyTorch.
Build GPT-2, Llama 3, and DeepSeek from scratch in PyTorch. Every chapter has runnable end-to-end code and loads real pretrained weights. Goes well past where most LLM tutorials stop.










































