T-Mac: Low-bit LLM inference on CPU/NPU with lookup table

(github.com)

5 points | by nateb2022 13 hours ago ago

No comments yet.