Skip to content

Models

The Models page is where you register, edit, and remove model definitions. Each model maps a model ID to a provider and defines its capabilities and pricing, which the gateway uses for routing, cost tracking, and budget enforcement.

Model List

Models are displayed in a paginated, searchable data table with the following columns:

ColumnDescription
Model IDThe identifier clients use in requests (e.g. gpt-4-turbo, claude-3-opus)
ProviderThe provider that serves this model
CapabilitiesTags indicating what the model supports
Context WindowMaximum token context length
PricingInput and output cost per 1,000 tokens
Statusactive, inactive, or deprecated

Use the search bar to filter by model ID or provider name. The table supports sorting on Model ID, Provider, Context Window, and Status.

Adding a Model

Click Add Model and fill in the form:

  • Model ID — The identifier that clients will reference (e.g. gpt-4-turbo). This cannot be changed after creation.
  • Provider — Select from the list of configured providers.
  • Context Window — Maximum token context length (defaults to 4096).
  • Status — Set to active to make the model available immediately.
  • Input Price — Cost per 1,000 input tokens (4 decimal places).
  • Output Price — Cost per 1,000 output tokens (4 decimal places).
  • Capabilities — Check all that apply.

Capabilities

Each model can declare one or more capabilities, which the gateway uses for routing decisions:

CapabilityDescription
chatConversational completions (messages API)
completionText completions
embeddingVector embeddings
imageImage generation
audioAudio transcription, translation, or speech
function_callingTool/function call support
visionImage understanding in prompts
streamingServer-sent event streaming

Routing rules filter models by capability, so accurate tagging is important. A routing rule targeting the embedding capability will only consider models with that capability enabled.

Editing a Model

Click Edit on any model row to open the edit modal. All fields except Model ID can be changed. Common operations include:

  • Updating pricing when providers change their rates
  • Marking a model as deprecated to stop new routing while honoring in-flight requests
  • Adding or removing capabilities

Deleting a Model

Click Delete to remove a model. A confirmation dialog warns that existing routing rules referencing this model may be affected. Deletion is permanent.

Pricing and Cost Tracking

Model pricing feeds directly into the gateway’s cost calculations. When a request completes, the gateway multiplies the token counts by the model’s configured input and output prices to compute the cost. This data appears in:

  • Dashboard cost breakdowns
  • Budget enforcement checks
  • Usage reports

Keep pricing current to ensure accurate budget tracking and cost-optimized routing decisions.

Model Statuses

StatusEffect
activeModel is available for routing
inactiveModel is hidden from routing but retained in configuration
deprecatedModel is marked for phase-out; routing strategies may deprioritize it