2 items with this tag.
lanes
The lane for understanding what actually dominates cost and learning dynamics when training compact language models.
notes
Concept note on why tokenization changes not just sequence length but the whole byte/compute story of compact language models.