Models
The Models page is where you register, edit, and remove model definitions. Each model maps a model ID to a provider and defines its capabilities and pricing, which the gateway uses for routing, cost tracking, and budget enforcement.
Model List
Models are displayed in a paginated, searchable data table with the following columns:
| Column | Description |
|---|---|
| Model ID | The identifier clients use in requests (e.g. gpt-4-turbo, claude-3-opus) |
| Provider | The provider that serves this model |
| Capabilities | Tags indicating what the model supports |
| Context Window | Maximum token context length |
| Pricing | Input and output cost per 1,000 tokens |
| Status | active, inactive, or deprecated |
Use the search bar to filter by model ID or provider name. The table supports sorting on Model ID, Provider, Context Window, and Status.
Adding a Model
Click Add Model and fill in the form:
- Model ID — The identifier that clients will reference (e.g.
gpt-4-turbo). This cannot be changed after creation. - Provider — Select from the list of configured providers.
- Context Window — Maximum token context length (defaults to 4096).
- Status — Set to
activeto make the model available immediately. - Input Price — Cost per 1,000 input tokens (4 decimal places).
- Output Price — Cost per 1,000 output tokens (4 decimal places).
- Capabilities — Check all that apply.
Capabilities
Each model can declare one or more capabilities, which the gateway uses for routing decisions:
| Capability | Description |
|---|---|
chat | Conversational completions (messages API) |
completion | Text completions |
embedding | Vector embeddings |
image | Image generation |
audio | Audio transcription, translation, or speech |
function_calling | Tool/function call support |
vision | Image understanding in prompts |
streaming | Server-sent event streaming |
Routing rules filter models by capability, so accurate tagging is important. A routing rule targeting the embedding capability will only consider models with that capability enabled.
Editing a Model
Click Edit on any model row to open the edit modal. All fields except Model ID can be changed. Common operations include:
- Updating pricing when providers change their rates
- Marking a model as
deprecatedto stop new routing while honoring in-flight requests - Adding or removing capabilities
Deleting a Model
Click Delete to remove a model. A confirmation dialog warns that existing routing rules referencing this model may be affected. Deletion is permanent.
Pricing and Cost Tracking
Model pricing feeds directly into the gateway’s cost calculations. When a request completes, the gateway multiplies the token counts by the model’s configured input and output prices to compute the cost. This data appears in:
- Dashboard cost breakdowns
- Budget enforcement checks
- Usage reports
Keep pricing current to ensure accurate budget tracking and cost-optimized routing decisions.
Model Statuses
| Status | Effect |
|---|---|
active | Model is available for routing |
inactive | Model is hidden from routing but retained in configuration |
deprecated | Model is marked for phase-out; routing strategies may deprioritize it |