Tag: mixed-precision

Mar 19, 2026

papers

OWQ

Paper note on preserving a tiny set of outlier-sensitive weight columns in high precision while quantizing the rest of the model aggressively.

Mar 19, 2026

ideas

Entropy-Weighted Vocabulary Rescue

Hypothesis that most head-side quantization damage is concentrated in a tiny set of difficult token rows, making row-level protection a better byte trade than uniform head precision.

Mar 19, 2026

papers

pQuant

Paper note on decoupled low-bit training with a tiny high-precision branch for the parameters that matter most.

Mar 19, 2026

notes

Decoupled Precision

Synthesis note on the recurring idea that a small subset of sensitive parameters deserves better precision than the rest.

Parameter Golf Research Garden

Section Tree

Tag: mixed-precision

OWQ

Entropy-Weighted Vocabulary Rescue

pQuant

Decoupled Precision

Graph View