HN
New
Show
Ask
Job
Built with Remix
Recent Developments in LLM Architectures: KV Sharing, MHC, Compressed Attention
(substack.com)
2 points | by
eigenBasis
9 hours ago ago
No comments yet.
No comments yet.