Anthropic Says Claude Is Now More Powerful Than GPT-4o

It’s only been a few months since Anthropic debuted Claude 3, but the company is ready to take the next step—at least for one of its models. Enter Claude 3.5 Sonnet. As the middle-ground for Anthropic’s large language models (LLMs), Claude Sonnet is a good option for those who want access to a powerful but affordable AI chatbot, and with Claude 3.5 Sonnet, the company says it’s making its middle offering even better.

According to the announcement, Claude 3.5 Sonnet is up to two times faster at processing than Claude 3 Opus, previously the most powerful model the company offered (Opus will be getting an update to take back its top spot). Anthropic claims that Claude 3.5 is “ideal for complex tasks,” and that it shows improvement in writing with nuance, humor, and following complex instructions. Claude 3.5 reportedly solved 64 percent of the problems it was given, outperforming the 38 percent record previously set by Claude 3 Opus. That is, indeed, a marked improvement.

You can see Anthropic’s full list of how Claude 3.5 Sonnet compares to other LLMs across different areas in the image above. Based on the data shown, it appears to outperform OpenAI’s newest model, GPT-4o, in almost every category. However, exactly how well those benchmarks will play out in real-world usage remains to be seen.

Coinciding with the launch of Claude 3.5 Sonnet is Artifacts, a new feature that essentially creates a separate window in your Claude window that can showcase your documents, code, and other AI-generated content in a visual space in real time. Anthropic says that this will make collaborating through Claude much easier for teams. Eventually, it hopes to allow entire organizations to use Claude to securely centralize its knowledge in one shared space and then access it through the chatbot. This will likely be similar to what Google has been doing with its Gemini AI offerings in Google Workspace.

Anthropic isn’t stopping here, either. The AI company says it plans to release updated versions of the Opus and Haiku LLMs later this year. The company also noted that it is exploring features like Memory, which would allow Claude to remember a user’s specific preferences and interaction history to help make their experiences even more personal. ChatGPT already utilizes a memory system, so it isn’t surprising to see Anthropic leaning that way with Claude, too.

If you’re interested in trying Claude 3.5 Sonnet for yourself, you can access it directly from the Claude website or through the Claude iOS app. Claude Pro and Team subscribers will still get access to higher rates with the new model. Developers can also utilize Anthropic’s API directly, though they’ll need to pay for tokens to do so.

