Model Catalogue — Stav AI

Qwen · qwen3.6

Qwen3.6 27B

Alibaba's 27.8B open-weight multimodal reasoning model with near-frontier math, science, and code performance. Ships switchable thinking and non-thinking modes in a single checkpoint, across a 262,144-token context window.

Google · gemma

Gemma 4 26B A4B IT

Google DeepMind's instruction-tuned Mixture-of-Experts model with 25.2B total parameters and only 3.8B active per token, delivering high-tier reasoning and vision understanding across a 256K-token context window — available as open weights under Apache 2.0.

€0.06/€0.33/M in·out

Google · gemma

Gemma 4 31B IT

Google DeepMind's open-weights 30.7B dense multimodal model with a 256K-token context window, optional chain-of-thought thinking, and native function calling — instruction-tuned for complex, long-context, and multilingual tasks.

€0.12/€0.35/M in·out

Mistral Ai · mistral-small

mistral-small-2603

Mistral AI's 119B open-weights MoE model unifying instruct, on-demand reasoning, vision, and coding in a single Apache 2.0 checkpoint, with a 262K-token context window and interactive-class latency.

€0.09/€0.28/M in·out

Every model worth running, side by side.

Qwen3.6 27B — Alibaba's 27.