CATALOGUE · CAPABILITIES

Capabilities

What a model can do at the API boundary — tool calling, vision input, extended thinking, context caching. Fifteen first-class capabilities across input, output, serving, and tools categories. Stav grades every model in the catalogue against this vocabulary.

Input

Input capabilities

What the model can ingest — text, images, audio, video, and structured data.

System Prompt

system_prompt

Model accepts a separate system message distinct from the user message.

See models supporting this →

Vision Input

vision_input

Model accepts image inputs alongside text.

See models supporting this →

Audio Input

audio_input

Model accepts audio input.

See models supporting this →

Document Understanding

document_understanding

Model accepts native document files (e.g. PDF) end-to-end.

See models supporting this →

Output

Output capabilities

What the model can return — streaming, structured output, tool calls, audio.

Tool Calling

tool_calling

Model emits structured function calls per OpenAI tool-use schema.

See models supporting this →

Parallel Tool Calls

parallel_tool_calls

Model can emit ≥2 tool calls in one response. Modifier on tool_calling.

See models supporting this →

Structured Output

structured_output

Model returns JSON conforming to a supplied JSON Schema (constrained decoding).

See models supporting this →

JSON Mode

json_mode

Model returns syntactically valid JSON when asked. Looser legacy capability.

Serving

Serving capabilities

Operational features — prompt caching, long context, batch inference.

Streaming

streaming

Model emits tokens via SSE as they're generated.

See models supporting this →

Context Caching

context_caching

Provider exposes explicit prompt-caching primitives (`cache_control`) with read-discount pricing.

See models supporting this →

Tools

Tools capabilities

Agentic features — function calling, code execution, retrieval, browsing.

Native Web Search

web_search_native

Provider offers a native web-search tool exposed through the same endpoint.

See models supporting this →

Native Code Execution

code_execution_native

Provider offers a native code-execution tool.

See models supporting this →

Native File Search

file_search_native

Provider offers a native file-search / RAG primitive.

See models supporting this →

Source · capability registry. Stav’s capability vocabulary is locked at v3.6 and shared across the public site, customer portal, and admin curation tooling. New capability slugs require a registry write and a curator-pipeline conformance pass before they land here.

See models supporting this →

Extended Thinking

extended_thinking

Model produces a visible reasoning trace before its final answer.

See models supporting this →

Audio Output

audio_output

Model produces audio output natively (TTS).

See models supporting this →