Opus 4.6: Anthropic's Most Capable Model Shows Why We Need Independent Verification

Article

Feb 10, 2026

Anthropic released its newest and most capable model yet: Opus 4.6. The new Opus is genuinely impressive. Its performance is state of the art on the leading benchmarks, with the highest score on the agentic coding evaluation Terminal-Bench 2.0, the multidisciplinary reasoning test Humanity’s Last Exam, the leading test for knowledge work GDPval-AA, and the agentic search test BrowseComp.

Opus 4.6 performs better on long-context retrieval and long-context reasoning, helping avoid “context rot,” and also features a 1M token context window — a first for Opus-class models. Anthropic has upgraded Claude’s integration in applications like Powerpoint and Excel and stresses the new model’s applications in everyday tasks like financial analyses and presentations.

Anthropic has also bulked up its safety protections. In the model release announcement, the company writes that “these intelligence gains do not come at the cost of safety.” Its system card states that Opus 4.6 is as well-aligned as its predecessor, Opus 4.5, with the lowest rate of excessive refusals of any recent Claude model. Anthropic presents results from a range of new and upgraded safety tests covering user well-being, refusals, interpretability, and misaligned behaviors such as deception, sycophancy, encouragement of user delusions, and cooperation with misuse. The company has also published new safeguards in areas with dual-use concerns, like cyber and bio-risks, and argues that Claude should be used to “level the playing field” against bad actors by accelerating cyber-defensive applications like finding and patching vulnerabilities in open-source software.

Independent.
Nonpartisan.
Nonprofit.

Fathom is a 501(c)(3) organization funded by philanthropists. We do not take donations from corporations, including frontier labs and the FAANG companies, or foreign entities associated with countries of concern.

Independent.
Nonpartisan.
Nonprofit.

Fathom is a 501(c)(3) organization funded by philanthropists. We do not take donations from corporations, including frontier labs and the FAANG companies, or foreign entities associated with countries of concern.

Independent.
Nonpartisan.
Nonprofit.

Fathom is a 501(c)(3) organization funded by philanthropists. We do not take donations from corporations, including frontier labs and the FAANG companies, or foreign entities associated with countries of concern.