7 items with this tag.
moonshots
Moonshot hypothesis that the shape of protected exceptions may matter more than the exact saliency ranking, because structured exception maps can compress better than irregular ones.
papers
Paper note on preserving a tiny set of outlier-sensitive weight columns in high precision while quantizing the rest of the model aggressively.
hypotheses
Hypothesis that protecting a tiny subset of highly sensitive parameters buys disproportionately large quality gains under a strict artifact cap.
notes
Concept note on why outliers dominate low-bit failure and why most serious compression methods end up treating them specially.
papers
Paper note on activation-aware weight quantization and the claim that a tiny set of salient channels dominates low-bit error.
papers
Paper note on hardware-aware outlier-preserving quantization and why selective protection must still respect deployment efficiency.
papers
Paper note on pushing post-training quantization below 2 bits by preserving salient structure with unusually low overhead.