It's because there is nothing novel here from an architectural point of view. Ag... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		lossolo on Sept 12, 2024 \| parent \| context \| favorite \| on: Learning to Reason with LLMs It's because there is nothing novel here from an architectural point of view. Again, the secret sauce is only in the training data. O1 seems like a variant of RLRF https://arxiv.org/abs/2403.14238 Soon you will see similar models from competitors.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact