I tested o1-preview on some coding stuff I've been using gpt-4o for. I am *not* ...

egorfine · on Sept 12, 2024

I have tested o1-preview on a couple of coding tasks and I am impressed.

I am looking at a TypeScript project with quite an amount of type gymnastics and a particular line of code did not validate with tsc no matter what I have tried. I copy pasted the whole context into o1-preview and it told me what is likely the error I am seeing (and it was a spot on correct letter-by-letter error message including my variable names), explained the problem and provided two solutions, both of which immediately worked.

Another test was I have pasted a smart contract in solidity and naively asked to identify vulnerabilities. It thought for more than a minute and then provided a detailed report of what could go wrong. Much, much deeper than any previous model could do. (No vulnerabilities found because my code is perfect, but that's another story).

jdthedisciple · on Sept 12, 2024

Your case would be more convincing by an example.

Though o1 did fail at the puzzle in my profile.

Maybe it's just tougher than even, its author, I had assumed...