AI Notebook
A Series · The Notebook

LLM

Transformer internals and the ideas that make large language models work. Attention, positional encoding, normalization, distributed training, and mixture of experts — each post picks one mechanism and goes all the way down.

6 Stories
~116m Total Read
2026 Last Updated
Nothing matched. Try a shorter query.
Sort

End of series.

Back to AI Notebook