Search
1 item with this tag.
Mar 19, 2026
papers
Paper note on making LLM training explicitly produce more low-rank, compressible weights by constraining Muon updates with a nuclear-norm budget.