Language Model Contains Personality Subnetworks

(arxiv.org)

41 points | by PaulHoule 7 hours ago ago

26 comments