Search
1 item with this tag.
Mar 19, 2026
papers
Paper note on cross-layer parameter sharing and factorized embeddings as two clean ways to reduce stored parameters without simply shrinking hidden capacity.