Manage LLM Provider¶

After deploying your LLM Provider, manage its configuration through the provider details page. This guide covers all management operations organized by tab.

On This Page

Provider Configuration:

Connection - Endpoint and authentication settings
Access Control - Enable/disable API endpoints
Security - Application authentication setup
Rate Limiting - Token usage controls
Guardrails - Content safety and compliance
Models - Available AI models

Lifecycle Operations:

Redeploy Provider - Push configuration updates
Delete Provider - Remove provider permanently

Access Provider Details¶

Navigate to AI Workspace > LLM Providers.
Click on a provider name to open its details page.

The provider details page shows:

Provider name, version, and template type
Creation and last updated timestamps
Six management tabs (Connection, Access Control, Security, Rate Limiting, Guardrails, Models)
Deploy to Gateway button in the top right

Connection¶

Manage upstream endpoint configuration and authentication credentials for the LLM provider.

Provider Endpoint¶

Upstream URL - Where API requests are forwarded to:

Provider Type	URL Configuration
OpenAI, Anthropic, Gemini, Mistral AI	Pre-configured (read-only)
Azure OpenAI, Azure AI Foundry	Custom URL (editable)

To update:

Modify the Provider Endpoint field
Click Save

Authentication¶

Configure how the gateway authenticates with the upstream provider:

Authentication Type: Usually api-key (set by template)
Credentials: Your provider API key
- Click the 👁️ icon to show/hide the value
- Update and click Save to change credentials

OpenAPI Specification¶

Spec URL - The OpenAPI specification for importing API resources:

Can be a URL or file upload
Automatically populates the Access Control resources list
Re-import to refresh available endpoints

Access Control¶

Control which API endpoints are accessible through this provider.

Mode¶

Select the access control mode at the top of the tab:

Mode	Behavior
Allow all	All resources are accessible by default. Move specific resources to Denied Resources to block them.
Deny all	All resources are blocked by default. Move specific resources to Allowed Resources to permit them.

Use the arrow buttons between the two panels to move resources:

>> — Move all resources to the other panel
> — Move selected resource(s) to the other panel
< — Move selected resource(s) back
<< — Move all resources back

Import Resources¶

Click Import resources from specification to load or refresh the resource list from the provider's OpenAPI specification.

Security¶

Configure how applications authenticate when accessing this provider through the gateway.

API Key Authentication¶

Set up the API key that client applications must provide:

Field	Description	Example
Authentication Type	Authentication method	`apiKey` (default)
API Key Name	Header or query parameter name	`x-api-key`, `apikey`, `Authorization`
Key Location	Where clients send the key	`Header` (recommended) or `Query`

To configure:

Select Authentication Type: apiKey
Enter API Key Name that applications will use
Choose Key Location:
- Header - Sent in HTTP request header (most secure)
- Query - Sent as URL query parameter (less secure)
Click Save

Rate Limiting¶

Control request and token consumption to prevent cost overruns and ensure fair usage across all consumers.

The Rate Limiting tab provides two independent sections: Backend and Per Consumer.

Section	Controls	Protects	Status
Backend	Requests from the gateway to the upstream LLM provider	Your provider API credentials and total spend	Available
Per Consumer	Requests from client applications to the gateway	Fair usage across all consumers	Coming soon

Per Consumer Rate Limiting — Coming Soon

Per consumer rate limiting is not yet available. Only the Backend section is currently configurable.

Both sections support two configuration modes:

Provider-wide — A single limit applied across all API endpoints.
Per Resource — Individual limits per API endpoint (e.g., chat completions vs. embeddings).

Note

Provider-wide and Per Resource modes are mutually exclusive per section. Clear existing limits before switching modes.

Limit Criteria¶

Each section lets you configure:

Criterion	Description
Request Count	Maximum number of requests within the reset duration
Token Count	Maximum number of tokens (prompt + completion) within the reset duration
Cost	Cost-based limiting (Coming soon)

For each enabled criterion, set the Quota and Reset Duration (second, minute, or hour).

Provider-wide Configuration¶

Select Provider-wide in the Backend section.
Enable Request Count and/or Token Count.
Enter the Quota and select the Reset Duration for each criterion.
Click Save.

Per Resource Configuration¶

Select Per Resource in the Backend section.
Expand Limit per Resource to set default limits for all endpoints:
- Enable the criteria and configure Quota and Reset Duration.
To override limits for a specific endpoint, expand that resource row and configure it separately.
Click Save.

Cost Control Best Practices

Set conservative backend limits first to protect your provider credentials. Monitor actual usage via the Insights dashboard before increasing limits. Use Per Resource mode only when endpoints have significantly different usage patterns.

Learn more: Token-Based Rate Limiting

Guardrails¶

Attach guardrails to enforce content safety, compliance, and quality standards. Guardrails on a provider can be applied globally (all endpoints) or per resource (specific endpoints), and affect all proxies that use this provider.

View Attached Guardrails¶

The tab displays all guardrails currently attached to the provider:

Guardrail name and type
Configuration status and parameters
Enable/disable toggles for quick activation control

Add a Guardrail¶

Guardrails can be added globally (applying to all endpoints) or per resource (applying to a specific endpoint).

To add a global guardrail:

In the Global Guardrails section, click + Add Guardrail.
A sidebar opens showing available guardrail types.
Select a guardrail and configure its settings:
- Fill in required parameters
- Expand Advanced Settings for additional options
Click Add to attach it to the provider.

To add a resource-level guardrail:

Find the resource you want to protect and expand its card.
Click + Add Guardrail within that resource.
Select and configure the guardrail (same process as global guardrails).
Click Add to attach it to the resource.

Configure Guardrails¶

Guardrail parameters cannot be edited in place. To change a guardrail's configuration, you must delete it and add it again with the updated settings.

To update a guardrail:

Delete the existing guardrail.
Click + Add Guardrail and re-add it with the updated configuration.
Redeploy the provider to apply the changes.

Advanced Settings

Each guardrail includes advanced configuration options such as custom thresholds, severity levels, and execution phases. Click Advanced Settings when adding a guardrail.

Production Impact

Guardrail changes require a manual redeploy to take effect on deployed gateways. Test thoroughly in a non-production environment before enabling strict guardrails.

Learn more: Guardrails Overview. For the full policy catalog, visit the Policy Hub.

Models¶

Configure which AI models are accessible through this provider.

Add Models¶

To add new models:

Click Add model provider
Select a provider from the list to import its model catalog
Click Add
The models are added and enabled by default

Enable/Disable Models¶

Control which models applications can access:

Toggle switches: Enable/disable individual models
Effect: Disabled models are blocked at the gateway
Applications attempting to use disabled models will receive an error

Lifecycle Operations¶

Redeploy Provider¶

Push configuration changes to deployed gateways.

When to redeploy:

After updating connection settings
After modifying rate limits or guardrails
After enabling/disabling resources or models

To redeploy:

Click Deploy to Gateway (top right corner)
Select the gateway(s) to deploy to
Review the changes summary
Click Deploy

Deployment Status

Monitor deployment progress in the notification panel. Changes take effect within seconds of successful deployment.

Delete Provider¶

Permanently remove the provider and all its configurations.

Prerequisite

A provider cannot be deleted if any LLM Proxy is currently using it. Delete or reassign all dependent proxies before proceeding.

To delete:

Navigate to AI Workspace > LLM Providers
Find the provider in the list
Click the Delete icon
Review the warning and confirm deletion

Warning: Irreversible Action

Deleting a provider will:

❌ Remove it from all deployed gateways immediately
❌ Break applications consuming this provider
❌ Delete all configuration (guardrails, rate limits, models)
❌ Cannot be undone

Next Steps¶

Configure LLM Proxy - Configure and deploy proxy endpoints using your provider
Policies Overview - Explore all available guardrails and policies