That’s only tangentially related but I have a very hard time using Opus for anyt...

That’s only tangentially related but I have a very hard time using Opus for anything serious. Sonnet is still much more useful to me thanks to the context window size. By the moment Opus actually understands what’s needed, I’m n compactions deep and pretty much hoping for the best.

That’s a reason why I can’t believe the benchmarks and why I also believe open source models (claiming 200 but realistically struggling past 40k) aren’t only a bit but very far behind SOTA in actual software dev.

This is not true for all software, but there are types of systems or environments where it’s abundantly clear that Opus (or anything with a sub 1m window) won’t cut it, unless it has a very efficient agentic system to help.

I’m not talking about dumping an entire code base in the context, I’m talking about clear specs, some code, library guidelines, and a few elements to allow the LLM to be better than a glorified autocomplete that lives in an electron fork.

Sonnet still wins easily.