Manage LLM Provider¶
After deploying your LLM Provider, manage its configuration through the provider details page. This guide covers all management operations organized by tab.
On This Page
Provider Configuration:
- Connection - Endpoint and authentication settings
- Access Control - Enable/disable API endpoints
- Security - Application authentication setup
- Rate Limiting - Token usage controls
- Guardrails - Content safety and compliance
- Models - Available AI models
Lifecycle Operations:
- Redeploy Provider - Push configuration updates
- Delete Provider - Remove provider permanently
Access Provider Details¶
-
Navigate to AI Workspace > LLM Providers.
-
Click on a provider name to open its details page.
The provider details page shows:
- Provider name, version, and template type
- Creation and last updated timestamps
- Six management tabs (Connection, Access Control, Security, Rate Limiting, Guardrails, Models)
- Deploy to Gateway button in the top right
Connection¶
Provider Endpoint¶
Upstream URL - Where API requests are forwarded to:
| Provider Type | URL Configuration |
|---|---|
| OpenAI, Anthropic, Gemini, Mistral AI | Pre-configured (read-only) |
| Azure OpenAI, Azure AI Foundry | Custom URL (editable) |
To update:
- Modify the Provider Endpoint field
- Click Save
Authentication¶
Configure how the gateway authenticates with the upstream provider:
- Authentication Type: Usually
api-key(set by template) - Credentials: Your provider API key
- Click the 👁️ icon to show/hide the value
- Update and click Save to change credentials
OpenAPI Specification¶
Spec URL - The OpenAPI specification for importing API resources:
- Can be a URL or file upload
- Automatically populates the Access Control resources list
- Re-import to refresh available endpoints
Access Control¶
Mode¶
Select the access control mode at the top of the tab:
| Mode | Behavior |
|---|---|
| Allow all | All resources are accessible by default. Move specific resources to Denied Resources to block them. |
| Deny all | All resources are blocked by default. Move specific resources to Allowed Resources to permit them. |
Use the arrow buttons between the two panels to move resources:
>>— Move all resources to the other panel>— Move selected resource(s) to the other panel<— Move selected resource(s) back<<— Move all resources back
Import Resources¶
Click Import resources from specification to load or refresh the resource list from the provider's OpenAPI specification.
Security¶
API Key Authentication¶
Set up the API key that client applications must provide:
| Field | Description | Example |
|---|---|---|
| Authentication Type | Authentication method | apiKey (default) |
| API Key Name | Header or query parameter name | x-api-key, apikey, Authorization |
| Key Location | Where clients send the key | Header (recommended) or Query |
To configure:
- Select Authentication Type:
apiKey - Enter API Key Name that applications will use
- Choose Key Location:
- Header - Sent in HTTP request header (most secure)
- Query - Sent as URL query parameter (less secure)
- Click Save
Rate Limiting¶
The Rate Limiting tab provides two independent sections: Backend and Per Consumer.
| Section | Controls | Protects | Status |
|---|---|---|---|
| Backend | Requests from the gateway to the upstream LLM provider | Your provider API credentials and total spend | Available |
| Per Consumer | Requests from client applications to the gateway | Fair usage across all consumers | Coming soon |
Per Consumer Rate Limiting — Coming Soon
Per consumer rate limiting is not yet available. Only the Backend section is currently configurable.
Both sections support two configuration modes:
- Provider-wide — A single limit applied across all API endpoints.
- Per Resource — Individual limits per API endpoint (e.g., chat completions vs. embeddings).
Note
Provider-wide and Per Resource modes are mutually exclusive per section. Clear existing limits before switching modes.
Limit Criteria¶
Each section lets you configure:
| Criterion | Description |
|---|---|
| Request Count | Maximum number of requests within the reset duration |
| Token Count | Maximum number of tokens (prompt + completion) within the reset duration |
| Cost | Cost-based limiting (Coming soon) |
For each enabled criterion, set the Quota and Reset Duration (second, minute, or hour).
Provider-wide Configuration¶
- Select Provider-wide in the Backend section.
- Enable Request Count and/or Token Count.
- Enter the Quota and select the Reset Duration for each criterion.
- Click Save.
Per Resource Configuration¶
- Select Per Resource in the Backend section.
- Expand Limit per Resource to set default limits for all endpoints:
- Enable the criteria and configure Quota and Reset Duration.
- To override limits for a specific endpoint, expand that resource row and configure it separately.
- Click Save.
Cost Control Best Practices
Set conservative backend limits first to protect your provider credentials. Monitor actual usage via the Insights dashboard before increasing limits. Use Per Resource mode only when endpoints have significantly different usage patterns.
Learn more: Token-Based Rate Limiting
Guardrails¶
View Attached Guardrails¶
The tab displays all guardrails currently attached to the provider:
- Guardrail name and type
- Configuration status and parameters
- Enable/disable toggles for quick activation control
Add a Guardrail¶
Guardrails can be added globally (applying to all endpoints) or per resource (applying to a specific endpoint).
To add a global guardrail:
- In the Global Guardrails section, click + Add Guardrail.
- A sidebar opens showing available guardrail types.
- Select a guardrail and configure its settings:
- Fill in required parameters
- Expand Advanced Settings for additional options
- Click Add to attach it to the provider.
To add a resource-level guardrail:
- Find the resource you want to protect and expand its card.
- Click + Add Guardrail within that resource.
- Select and configure the guardrail (same process as global guardrails).
- Click Add to attach it to the resource.
Configure Guardrails¶
Guardrail parameters cannot be edited in place. To change a guardrail's configuration, you must delete it and add it again with the updated settings.
To update a guardrail:
- Delete the existing guardrail.
- Click + Add Guardrail and re-add it with the updated configuration.
- Redeploy the provider to apply the changes.
Advanced Settings
Each guardrail includes advanced configuration options such as custom thresholds, severity levels, and execution phases. Click Advanced Settings when adding a guardrail.
Production Impact
Guardrail changes require a manual redeploy to take effect on deployed gateways. Test thoroughly in a non-production environment before enabling strict guardrails.
Learn more: Guardrails Overview. For the full policy catalog, visit the Policy Hub.
Models¶
Add Models¶
To add new models:
- Click Add model provider
- Select a provider from the list to import its model catalog
- Click Add
- The models are added and enabled by default
Enable/Disable Models¶
Control which models applications can access:
- Toggle switches: Enable/disable individual models
- Effect: Disabled models are blocked at the gateway
- Applications attempting to use disabled models will receive an error
Lifecycle Operations¶
Redeploy Provider¶
Push configuration changes to deployed gateways.
When to redeploy:
- After updating connection settings
- After modifying rate limits or guardrails
- After enabling/disabling resources or models
To redeploy:
- Click Deploy to Gateway (top right corner)
- Select the gateway(s) to deploy to
- Review the changes summary
- Click Deploy
Deployment Status
Monitor deployment progress in the notification panel. Changes take effect within seconds of successful deployment.
Delete Provider¶
Permanently remove the provider and all its configurations.
Prerequisite
A provider cannot be deleted if any LLM Proxy is currently using it. Delete or reassign all dependent proxies before proceeding.
To delete:
- Navigate to AI Workspace > LLM Providers
- Find the provider in the list
- Click the Delete icon
- Review the warning and confirm deletion
Warning: Irreversible Action
Deleting a provider will:
- ❌ Remove it from all deployed gateways immediately
- ❌ Break applications consuming this provider
- ❌ Delete all configuration (guardrails, rate limits, models)
- ❌ Cannot be undone
Next Steps¶
- Configure LLM Proxy - Configure and deploy proxy endpoints using your provider
- Policies Overview - Explore all available guardrails and policies