Is Opus 4.5 really 'the best model in the world for coding'? It just failed half my tests
zdnet.com
Follow ZDNET: Add us as a preferred source on Google.
ZDNET's key takeaways
- Opus 4.5 failed half my coding tests, despite bold claims
- File handling glitches made basic plugin testing nearly impossible
- Two tests passed, but reliability issues still dominate the story
I've got to tell you: I've had fairly okay coding results with Claude's lower-end Sonnet AI model. But for whatever reason, its high-end Opus model has never done well on my tests.
Usually, you expect the super-duper coding model to code better than the cheap seats, but with Opus, not so much.
Also: Google's Antigravity puts coding productivity before AI hype - and the result is astonishing
Now, we're back with Opus 4.5. Anthropic, the company behind Claude claims, and I quote, "Our newest model, Claude Opus 4.5, is available today. It's intelligent, efficient ...
Copyright of this story solely belongs to zdnet.com . To see the full text click HERE

