Loading the catalogue…
Loading the catalogue…
Anthropic's most agentic Sonnet-class model, featuring adaptive reasoning with selectable effort levels, a one-million-token context window, and strong coding and multi-step workflow performance.
Anthropic's first Mythos-class frontier model with always-on adaptive reasoning, a one-million-token context window, and multimodal input — built for deep reasoning across code, research, and analytical workloads.
Google DeepMind's frontier-tier multimodal Flash model with a 1M-token context window, optional extended thinking, and native support for text, image, video, audio, and file inputs — built for complex interactive workloads.
OpenAI's frontier agentic instruct model with a 1M+ token context window, built for complex multi-step reasoning, long-document analysis, and autonomous tool orchestration.
Alibaba's 27.8B open-weight multimodal reasoning model with near-frontier math, science, and code performance. Ships switchable thinking and non-thinking modes in a single checkpoint, across a 262,144-token context window.
Anthropic's iterative frontier flagship with a one-million-token context window, graduate-level reasoning (GPQA 0.914), enhanced vision, and optional extended thinking — built for complex long-horizon tasks.
Google DeepMind's instruction-tuned Mixture-of-Experts model with 25.2B total parameters and only 3.8B active per token, delivering high-tier reasoning and vision understanding across a 256K-token context window — available as open weights under Apache 2.0.
Google DeepMind's open-weights 30.7B dense multimodal model with a 256K-token context window, optional chain-of-thought thinking, and native function calling — instruction-tuned for complex, long-context, and multilingual tasks.
OpenAI's efficient GPT-5-generation model with a 400K context window, tool calling, structured output, and optional reasoning traces — optimised for high-throughput agentic and coding workloads.
Mistral AI's 119B open-weights MoE model unifying instruct, on-demand reasoning, vision, and coding in a single Apache 2.0 checkpoint, with a 262K-token context window and interactive-class latency.
Google's frontier reasoning model with a 1M-token context window, dynamic chain-of-thought thinking, and full multimodal input — built for complex, long-horizon inference tasks.
Anthropic's hybrid reasoning Sonnet model — frontier-grade GPQA scores, API-controlled extended thinking, and a 1M-token context window in beta, delivered at interactive speeds.
Anthropic's flagship reasoning model with a 1M-token context window (beta), adaptive thinking via /effort controls, and top benchmark results on Humanity's Last Exam, Terminal-Bench 2.0, BrowseComp, and GDPval-AA. Optimised for complex agentic and long-context workloads.
OpenAI's frontier agentic coding model combining top-tier software-engineering performance with broad professional reasoning, a 400K-token context window, and native support for reasoning traces and tool orchestration.
Google's frontier-tier Flash model with a 1M-token context, native multimodal inputs, and opt-in extended thinking — built for fast agentic workflows that need near-Pro reasoning.
Anthropic's frontier reasoning model with an 80.9% SWE-bench Verified score, 200K context window, and optional extended thinking — built for complex coding, research, and long-document tasks where inference depth takes priority over latency.
Anthropic's fastest Claude 4 model — built for high-throughput agentic and coding workloads, with a 200K-token context window, extended thinking support, vision, and strong safety benchmarks.
OpenAI's compact GPT-5 variant — multimodal input, 400K context, optional extended thinking, and tool calling — tuned for interactive, medium-complexity workloads at a lighter weight than full GPT-5.
OpenAI's fastest GPT-5 tier — optimised for high-volume classification, summarisation, and short-completion tasks with a 400K context window, vision and file input, and optional reasoning traces.
Anthropic's agentic coding and research model with 74.5% SWE-bench Verified, 200K context, and extended thinking up to 64K tokens.
Google's flagship reasoning model with a one-million-token context window, adaptive chain-of-thought thinking, and native support for multimodal inputs, tool calling, web search grounding, and sandboxed code execution.
Anthropic's flagship hybrid reasoning model from the Claude 4 generation — leading SWE-bench Verified at launch and built for complex coding, agentic, and research workflows with extended thinking up to 64K tokens.
Google DeepMind's efficiency-tier multimodal model with a 1M-token context window, optional chain-of-thought thinking, and native support for text, image, audio, video, and file inputs.
No models match your search.
Anthropic's first Mythos-class frontier model with always-on adaptive reasoning, a one-million-token context window, and multimodal input — built for deep reasoning across code, research, and analytical workloads.