Tag: lane

Mar 19, 2026

lanes

When extra evaluation-time compute may dominate storing more parameters.

Mar 19, 2026

lanes

The lane focused on reducing the gap between train-time weights and the final compressed artifact.

Mar 19, 2026

lanes

Why parameter sharing may be the cleanest way to buy width, extra compute, or light specialization under a hard artifact cap.

Mar 19, 2026

lanes

Tokenization is part of the budget story, not just a preprocessing detail.

Mar 19, 2026

lanes

The lane for understanding what actually dominates cost and learning dynamics when training compact language models.

Parameter Golf Research Garden