It has strong reasoning, but it sometimes answers questions you didn't ask. Formatting and image generation lag behind the text quality. It's a new month, and a new AI version number. It's called ...
Stop Googling. The answer is staring you right in the face—you just have to read it.
Anthropic researchers say Claude Opus 4.6 showed unusual behaviour during a BrowseComp evaluation. The model suspected it was being tested, identified the benchmark online, and wrote code to decrypt ...