GateGPT: 56k tokens per second Transformer (KV cache) on FPGA at 80 MHz

(twitter.com)

31 points | by laxmena 3 hours ago ago

9 comments