2 items with this tag.
papers
Paper note on activation outliers in recurrent state-space language models and why quantization difficulty survives architectural changes.
papers
Paper note on structured state space duality and why transformer intuition can transfer into linear-time sequence models.