Gwern – HackTech.info

Comment Permalink I think this is missing a major piece of the self-play scaling paradigm, one which has been weirdly absent in most discussions of o1 as well: much of the point of a model like o1 is not to deploy it, but to generate training data for the next model. It was cool that

ByHackTechJanuary 22, 20250Comments

Sign Up to Our Newsletter