Skip to content

Manage LLM Provider

After deploying your LLM Provider, manage its configuration through the provider details page. This guide covers all management operations organized by tab.

On This Page

Provider Configuration:

Lifecycle Operations:


Access Provider Details

  1. Navigate to AI Workspace > LLM Providers.

  2. Click on a provider name to open its details page.

The provider details page shows:

  • Provider name, version, and template type
  • Creation and last updated timestamps
  • Six management tabs (Connection, Access Control, Security, Rate Limiting, Guardrails, Models)
  • Deploy to Gateway button in the top right

Connection

Manage upstream endpoint configuration and authentication credentials for the LLM provider.

Provider Endpoint

Upstream URL - Where API requests are forwarded to:

Provider Type URL Configuration
OpenAI, Anthropic, Gemini, Mistral AI Pre-configured (read-only)
Azure OpenAI, Azure AI Foundry Custom URL (editable)

To update:

  1. Modify the Provider Endpoint field
  2. Click Save

Authentication

Configure how the gateway authenticates with the upstream provider:

  • Authentication Type: Usually api-key (set by template)
  • Credentials: Your provider API key
    • Click the 👁️ icon to show/hide the value
    • Update and click Save to change credentials

OpenAPI Specification

Spec URL - The OpenAPI specification for importing API resources:

  • Can be a URL or file upload
  • Automatically populates the Access Control resources list
  • Re-import to refresh available endpoints

Access Control

Control which API endpoints are accessible through this provider.

Mode

Select the access control mode at the top of the tab:

Mode Behavior
Allow all All resources are accessible by default. Move specific resources to Denied Resources to block them.
Deny all All resources are blocked by default. Move specific resources to Allowed Resources to permit them.

Use the arrow buttons between the two panels to move resources:

  • >> — Move all resources to the other panel
  • > — Move selected resource(s) to the other panel
  • < — Move selected resource(s) back
  • << — Move all resources back

Import Resources

Click Import resources from specification to load or refresh the resource list from the provider's OpenAPI specification.


Security

Configure how applications authenticate when accessing this provider through the gateway.

API Key Authentication

Set up the API key that client applications must provide:

Field Description Example
Authentication Type Authentication method apiKey (default)
API Key Name Header or query parameter name x-api-key, apikey, Authorization
Key Location Where clients send the key Header (recommended) or Query

To configure:

  1. Select Authentication Type: apiKey
  2. Enter API Key Name that applications will use
  3. Choose Key Location:
    • Header - Sent in HTTP request header (most secure)
    • Query - Sent as URL query parameter (less secure)
  4. Click Save

Rate Limiting

Control request and token consumption to prevent cost overruns and ensure fair usage across all consumers.

The Rate Limiting tab provides two independent sections: Backend and Per Consumer.

Section Controls Protects Status
Backend Requests from the gateway to the upstream LLM provider Your provider API credentials and total spend Available
Per Consumer Requests from client applications to the gateway Fair usage across all consumers Coming soon

Per Consumer Rate Limiting — Coming Soon

Per consumer rate limiting is not yet available. Only the Backend section is currently configurable.

Both sections support two configuration modes:

  • Provider-wide — A single limit applied across all API endpoints.
  • Per Resource — Individual limits per API endpoint (e.g., chat completions vs. embeddings).

Note

Provider-wide and Per Resource modes are mutually exclusive per section. Clear existing limits before switching modes.

Limit Criteria

Each section lets you configure:

Criterion Description
Request Count Maximum number of requests within the reset duration
Token Count Maximum number of tokens (prompt + completion) within the reset duration
Cost Cost-based limiting (Coming soon)

For each enabled criterion, set the Quota and Reset Duration (second, minute, or hour).

Provider-wide Configuration

  1. Select Provider-wide in the Backend section.
  2. Enable Request Count and/or Token Count.
  3. Enter the Quota and select the Reset Duration for each criterion.
  4. Click Save.

Per Resource Configuration

  1. Select Per Resource in the Backend section.
  2. Expand Limit per Resource to set default limits for all endpoints:
    • Enable the criteria and configure Quota and Reset Duration.
  3. To override limits for a specific endpoint, expand that resource row and configure it separately.
  4. Click Save.

Cost Control Best Practices

Set conservative backend limits first to protect your provider credentials. Monitor actual usage via the Insights dashboard before increasing limits. Use Per Resource mode only when endpoints have significantly different usage patterns.

Learn more: Token-Based Rate Limiting


Guardrails

Attach guardrails to enforce content safety, compliance, and quality standards. Guardrails on a provider can be applied globally (all endpoints) or per resource (specific endpoints), and affect all proxies that use this provider.

View Attached Guardrails

The tab displays all guardrails currently attached to the provider:

  • Guardrail name and type
  • Configuration status and parameters
  • Enable/disable toggles for quick activation control

Add a Guardrail

Guardrails can be added globally (applying to all endpoints) or per resource (applying to a specific endpoint).

To add a global guardrail:

  1. In the Global Guardrails section, click + Add Guardrail.
  2. A sidebar opens showing available guardrail types.
  3. Select a guardrail and configure its settings:
    • Fill in required parameters
    • Expand Advanced Settings for additional options
  4. Click Add to attach it to the provider.

To add a resource-level guardrail:

  1. Find the resource you want to protect and expand its card.
  2. Click + Add Guardrail within that resource.
  3. Select and configure the guardrail (same process as global guardrails).
  4. Click Add to attach it to the resource.

Configure Guardrails

Guardrail parameters cannot be edited in place. To change a guardrail's configuration, you must delete it and add it again with the updated settings.

To update a guardrail:

  1. Delete the existing guardrail.
  2. Click + Add Guardrail and re-add it with the updated configuration.
  3. Redeploy the provider to apply the changes.

Advanced Settings

Each guardrail includes advanced configuration options such as custom thresholds, severity levels, and execution phases. Click Advanced Settings when adding a guardrail.

Production Impact

Guardrail changes require a manual redeploy to take effect on deployed gateways. Test thoroughly in a non-production environment before enabling strict guardrails.

Learn more: Guardrails Overview. For the full policy catalog, visit the Policy Hub.


Models

Configure which AI models are accessible through this provider.

Add Models

To add new models:

  1. Click Add model provider
  2. Select a provider from the list to import its model catalog
  3. Click Add
  4. The models are added and enabled by default

Enable/Disable Models

Control which models applications can access:

  • Toggle switches: Enable/disable individual models
  • Effect: Disabled models are blocked at the gateway
  • Applications attempting to use disabled models will receive an error

Lifecycle Operations


Redeploy Provider

Push configuration changes to deployed gateways.

When to redeploy:

  • After updating connection settings
  • After modifying rate limits or guardrails
  • After enabling/disabling resources or models

To redeploy:

  1. Click Deploy to Gateway (top right corner)
  2. Select the gateway(s) to deploy to
  3. Review the changes summary
  4. Click Deploy

Deployment Status

Monitor deployment progress in the notification panel. Changes take effect within seconds of successful deployment.


Delete Provider

Permanently remove the provider and all its configurations.

Prerequisite

A provider cannot be deleted if any LLM Proxy is currently using it. Delete or reassign all dependent proxies before proceeding.

To delete:

  1. Navigate to AI Workspace > LLM Providers
  2. Find the provider in the list
  3. Click the Delete icon
  4. Review the warning and confirm deletion

Warning: Irreversible Action

Deleting a provider will:

  • ❌ Remove it from all deployed gateways immediately
  • ❌ Break applications consuming this provider
  • ❌ Delete all configuration (guardrails, rate limits, models)
  • Cannot be undone

Next Steps