Knowledge Infusion Scaling Law for Pre-Training Large Language Models

(arxiv.org)

26 points | by PaulHoule 3 hours ago ago

2 comments