Loading the catalogue…
Loading the catalogue…
Anthropic's instruct-tuned Claude Sonnet 4.5 for agentic coding, computer use, and long-context reasoning — now on a 200K context window and heading toward retirement.
Anthropic's hybrid reasoning Sonnet model — frontier-grade GPQA scores, API-controlled extended thinking, and a 1M-token context window in beta, delivered at interactive speeds.
Each axis is the mean score across the family’s variants that have been scored on that dimension. Per-axis sample size is shown next to each label — the family currently aggregates up to 3 variants per axis.
Values aggregated across the family’s variants: any variant supporting a capability resolves the family to Supported; flag-driven support resolves to Optional; only when every variant explicitly denies a capability does the family render as Not supported. 15 of 15 capabilities have variant data so far.