Claude 4: The "World's Best Coding Model"
Anthropic launched Claude Opus 4 and Sonnet 4, which are their next-generation models that can think through problems step-by-step while using external tools.
both models can switch between instant responses and extended reasoning, with visible summaries showing their thought processes.
Opus 4 achieved 72.5% on SWE-bench (a benchmark used to measure software engineering capabilites) and can code autonomously for hours.