Writing an LLM from scratch, part 20 – starting training, and cross entropy loss

(gilesthomas.com)

39 points | by gpjt 2 days ago ago

3 comments