diff --git a/assets/agw-docs/snippets/llm-comparison.md b/assets/agw-docs/snippets/llm-comparison.md
index ebc3257f4..e01c60dcc 100644
--- a/assets/agw-docs/snippets/llm-comparison.md
+++ b/assets/agw-docs/snippets/llm-comparison.md
@@ -1,18 +1,36 @@
 Review the following table to compare agentgateway's support of different LLM provider APIs.
 
-| API | OpenAI | Anthropic | Amazon Bedrock | Azure | Google Gemini | Google Vertex AI | GitHub Copilot |
-|-----|:------:|:---------:|:--------------:|:------------:|:-------------:|:----------------:|:---------------:|
-| Completions<br>`/v1/chat/completions` | ✅ Native | ✅ Translation | ✅ Translation| ✅ Native | ✅ Native`*`| ✅ Native`†` | ✅ Native |
-| Responses<br>`/v1/responses` | ✅ Native  | ❌ No |  ✅ Translation| ✅ Native| ❌ No | ❌ No | ❌ No |
-| Messages<br>`/v1/messages` |  ✅ Translation  | ✅ Native |  ✅ Translation | ✅ Translation | ✅ Translation | ✅ Native`†` | ✅ Translation |
-| Embeddings<br>`/v1/embeddings` | ✅ Native | ❌ No |  ✅ Translation | ✅ Native | ❌ No | ✅ Translation | ❌ No |
-| Realtime<br>`/v1/realtime` | ✅ Native  | ❌ No | ❌ No | ❌ No | ❌ No | ❌ No | ❌ No |
-| Token Count<br>`/v1/messages/count_tokens` | ❌ No | ✅ Native|  ✅ Translation | ❌ No| ❌ No | ✅ Translation | ❌ No |
+| Provider | Chat Completions | Responses | Messages | Embeddings | Realtime | Count Tokens | Rerank |
+|---|:---:|:---:|:---:|:---:|:---:|:---:|:---:|
+| <img src="/integrations/providers/openai.svg" alt="" width="20" height="20" style="vertical-align:middle;margin-right:0.4rem;"> OpenAI | ✅ | ✅ | ✅¹ | ✅ | ✅ | ✅² | - |
+| <img src="/integrations/providers/anthropic.svg" alt="" width="20" height="20" style="vertical-align:middle;margin-right:0.4rem;"> Anthropic | ✅¹ | ◇ | ✅ | - | - | ✅ | - |
+| <img src="/integrations/providers/bedrock.svg" alt="" width="20" height="20" style="vertical-align:middle;margin-right:0.4rem;"> Bedrock | ✅¹ | ✅¹ | ✅¹ | ✅¹ | - | ✅⁴ | ✅¹ |
+| <img src="/integrations/providers/azure.svg" alt="" width="20" height="20" style="vertical-align:middle;margin-right:0.4rem;"> Azure | ✅ | ✅ | ✅¹ | ✅ | - | ✅² | ⚠️³ |
+| <img src="/integrations/providers/gemini.svg" alt="" width="20" height="20" style="vertical-align:middle;margin-right:0.4rem;"> Gemini | ✅ | ✅¹ | ✅¹ | ✅ | - | ✅² | - |
+| <img src="/integrations/providers/vertex.svg" alt="" width="20" height="20" style="vertical-align:middle;margin-right:0.4rem;"> Vertex AI | ✅⁴ | ◇ | ✅⁴ | ✅¹ | - | ✅⁴ | ✅¹ |
+| <img src="/integrations/providers/copilot.svg" alt="" width="20" height="20" style="vertical-align:middle;margin-right:0.4rem;"> Copilot | ✅ | ✅ | ✅¹ | ◇ | - | ✅² | ⚠️³ |
+| <img src="/integrations/providers/cohere.svg" alt="" width="20" height="20" style="vertical-align:middle;margin-right:0.4rem;"> Cohere | ✅ | ✅¹ | ✅¹ | ✅ | - | ✅² | ✅ |
+| <img src="/integrations/providers/ollama.svg" alt="" width="20" height="20" style="vertical-align:middle;margin-right:0.4rem;"> Ollama | ✅ | ✅ | ✅¹ | ✅ | - | ✅² | - |
+| <img src="/integrations/providers/baseten.svg" alt="" width="20" height="20" style="vertical-align:middle;margin-right:0.4rem;"> Baseten | ✅ | ✅¹ | ✅ | - | - | ✅² | - |
+| <img src="/integrations/providers/cerebras.svg" alt="" width="20" height="20" style="vertical-align:middle;margin-right:0.4rem;"> Cerebras | ✅ | ✅¹ | ✅¹ | - | - | ✅² | - |
+| <img src="/integrations/providers/deepinfra.svg" alt="" width="20" height="20" style="vertical-align:middle;margin-right:0.4rem;"> Deepinfra | ✅ | ✅¹ | ✅ | ✅ | - | ✅² | - |
+| <img src="/integrations/providers/deepseek.svg" alt="" width="20" height="20" style="vertical-align:middle;margin-right:0.4rem;"> Deepseek | ✅ | ✅¹ | ✅ | - | - | ✅² | - |
+| <img src="/integrations/providers/groq.svg" alt="" width="20" height="20" style="vertical-align:middle;margin-right:0.4rem;"> Groq | ✅ | ✅ | ✅¹ | - | - | ✅² | - |
+| <img src="/integrations/providers/huggingface.svg" alt="" width="20" height="20" style="vertical-align:middle;margin-right:0.4rem;"> Hugging Face | ✅ | ✅ | ✅¹ | - | - | ✅² | - |
+| <img src="/integrations/providers/mistral.svg" alt="" width="20" height="20" style="vertical-align:middle;margin-right:0.4rem;"> Mistral | ✅ | ✅¹ | ✅¹ | ✅ | - | ✅² | - |
+| <img src="/integrations/providers/openrouter.svg" alt="" width="20" height="20" style="vertical-align:middle;margin-right:0.4rem;"> OpenRouter | ✅ | ✅ | ✅ | ✅ | - | ✅² | ✅ |
+| <img src="/integrations/providers/togetherai.svg" alt="" width="20" height="20" style="vertical-align:middle;margin-right:0.4rem;"> Together AI | ✅ | ✅¹ | ✅¹ | ✅ | - | ✅² | ✅ |
+| <img src="/integrations/providers/xai.svg" alt="" width="20" height="20" style="vertical-align:middle;margin-right:0.4rem;"> xAI | ✅ | ✅ | ✅¹ | - | ✅ | ✅² | - |
+| <img src="/integrations/providers/fireworks.svg" alt="" width="20" height="20" style="vertical-align:middle;margin-right:0.4rem;"> Fireworks | ✅ | ✅ | ✅ | ✅ | - | ✅² | ✅ |
 
-**Notes**:
-- **✅ Native**: Agentgateway has complete support for the API, and the provider supports the API natively. This allows Agentgateway to passthrough unknown fields without change. As such, even if you use extra fields or new models, the proxying likely works.
-- **✅ Translation**: Agentgateway translates from one API to another. As such, agentgateway only supports fields that it is aware of. New models or LLM APIs might require code changes before they are fully supported.
-- **❌ No**: Agentgateway does not currently support the API for this provider.
-- `*`: Agentgateway supports the API natively via a compatibility endpoint. Note that Google Gemini does a translation for their Completions API support.
-- `†`: Agentgateway supports the API natively via translation to Anthropic. Support in Vertex AI differs depending on the model type.
-- Both streaming and non-streaming options for the Completions, Responses, and Messages APIs are supported.
+Legend:
+
+| Symbol | Meaning                                                                        |
+|--------|--------------------------------------------------------------------------------|
+| ✅      | Supported natively                                                             |
+| ✅¹     | Supported via Agentgateway translation                                         |
+| ✅²     | Supported by a local estimate by Agentgateway                                  |
+| ⚠️³    | Passthrough/provider-dependent; works only with a compatible upstream endpoint |
+| ✅⁴     | Supported, but behavior depends on model family or provider route              |
+| ◇      | Not currently implemented in Agentgateway                                      |
+| -      | Provider does not offer this capability                                        |
diff --git a/assets/agw-docs/standalone/deployment/binary.md b/assets/agw-docs/standalone/deployment/binary.md
index 15ab40fd6..f3e7a77bc 100644
--- a/assets/agw-docs/standalone/deployment/binary.md
+++ b/assets/agw-docs/standalone/deployment/binary.md
@@ -4,7 +4,7 @@ To run agentgateway as a standalone binary, follow the steps to download, instal
 
 {{% steps %}}
 
-### Step 1: Download and install
+### Download and install
 
 Download and install the agentgateway binary. Alternatively, you can manually download the binary from the [agentgateway releases page](https://github.com/agentgateway/agentgateway/releases/latest).
 
@@ -79,7 +79,7 @@ Password:
 agentgateway installed into /usr/local/bin/agentgateway
 ```
 
-### Step 2: Verify the installation
+### Verify the installation
 
 Verify that the `agentgateway` binary is installed.
 
@@ -99,26 +99,22 @@ Example output with the latest version, {{< reuse "agw-docs/versions/n-patch.md"
 }
 ```
 
-### Step 3: Create a configuration file
+### Run agentgateway
 
-Create a [configuration file]({{< link-hextra path="/configuration/" >}}) for agentgateway. In this example, `config.yaml` is used. You might start with [this simple example configuration file](https://agentgateway.dev/examples/basic/config.yaml).
+To run agentgateway, the binary can simply be executed. Configuration will be stored in `~/.config/agentgateway`
 
-```yaml
-{{< github url="https://agentgateway.dev/examples/basic/config.yaml" >}}
+```sh
+agentgateway
 ```
 
-### Step 4: Run agentgateway
+To specify an explicit configuration file, use `-f`:
 
 ```sh
 agentgateway -f config.yaml
 ```
 
-Example output:
+You might start with [this simple example configuration file](https://agentgateway.dev/examples/basic/config.yaml).
 
-```
-info  state_manager  loaded config from File("config.yaml")
-info  app            serving UI at http://localhost:15000/ui
-info  proxy::gateway started bind  bind="bind/3000"
-```
+Open <http://localhost:15000/ui> to get started!
 
 {{% /steps %}}
diff --git a/assets/agw-docs/standalone/virtual-keys.md b/assets/agw-docs/standalone/virtual-keys.md
index 2b585bee7..bfdff2603 100644
--- a/assets/agw-docs/standalone/virtual-keys.md
+++ b/assets/agw-docs/standalone/virtual-keys.md
@@ -198,6 +198,10 @@ EOF
 
 LLMs typically charge per input and output token. Without spending control, users can quickly generate large bills by submitting long prompts, streaming or retrying requests, or running recursive agent loops. To protect against unexpected bills, scaling surprises, and abuse, use token-based rate limits to cap the number of tokens that can be used.
 
+{{< callout type="warning" >}}
+`localRateLimit` is a **gateway-wide** limit, not a per-key limit. It enforces a single shared token budget across **all** requests and API keys.
+{{< /callout >}}
+
 ### How rate limiting works
 
 Agentgateway checks token-based rate limits in two phases:
@@ -352,61 +356,6 @@ EOF
 
 With this setting, requests are denied immediately if the estimated prompt token count exceeds the available budget.
 
-## Add a global token budget
-
-{{< callout type="warning" >}}
-`localRateLimit` is a **gateway-wide** limit, not a per-key limit. It enforces a single shared token budget across **all** requests and API keys.
-{{< /callout >}}
-
-To add a token budget that limits total token usage across all requests using more advanced routing options, use the routing-based configuration format with `localRateLimit`.
-
-{{< callout type="info" >}}
-Rate limiting requires the `binds/listeners/routes` configuration format because `localRateLimit` is an HTTP-level policy. For more information, see the [Routing-based configuration guide]({{< link-hextra path="/llm/configuration-modes/" >}}).
-{{< /callout >}}
-
-```yaml
-cat <<'EOF' > config.yaml
-# yaml-language-server: $schema=https://agentgateway.dev/schema/config
-
-binds:
-- port: 4000
-  listeners:
-  - routes:
-    - backends:
-      - ai:
-          name: openai
-          provider:
-            openAI:
-              model: gpt-3.5-turbo
-      policies:
-        apiKey:
-          mode: strict
-          keys:
-          - key: sk-alice-abc123def456
-            metadata:
-              user: alice
-          - key: sk-bob-xyz789uvw012
-            metadata:
-              user: bob
-        backendAuth:
-          key: "$OPENAI_API_KEY"
-        localRateLimit:
-        - maxTokens: 100000
-          tokensPerFill: 100000
-          fillInterval: 86400s
-          type: tokens
-EOF
-```
-
-| Setting | Description |
-| -- | -- |
-| `backendAuth` | The API key used to authenticate with the LLM provider backend. For configuration options, see [Manage API keys]({{< link-hextra path="/llm/api-keys/" >}}). |
-| `localRateLimit` | Token-based rate limiting applied globally to **all** requests through this route, regardless of which API key is used. |
-| `maxTokens` | The maximum number of tokens available in the shared budget. |
-| `tokensPerFill` | The number of tokens added during each refill. |
-| `fillInterval` | The interval between refills. Use `86400s` for a daily budget. |
-| `type` | Set to `tokens` for token-based limits. Use `requests` for request-based limits. |
-
 For more information about rate limiting configuration options, see [Rate limits]({{< link-hextra path="/configuration/resiliency/rate-limits/" >}}).
 
 ## Monitor per-key spending
diff --git a/content/docs/standalone/main/deployment/docker/_index.md b/content/docs/standalone/main/deployment/docker/_index.md
index 07c55f8f6..76ee64e1c 100644
--- a/content/docs/standalone/main/deployment/docker/_index.md
+++ b/content/docs/standalone/main/deployment/docker/_index.md
@@ -6,32 +6,47 @@ description: Overview of how to deploy agentgateway with Docker.
 
 To run agentgateway as a Docker container, agentgateway publishes official Docker images at `cr.agentgateway.dev/agentgateway`.
 
-Before you begin, create a [configuration file]({{< link-hextra path="/configuration/" >}}) for agentgateway. In this example, `config.yaml` is used.
-You might start with [this simple example configuration file](https://agentgateway.dev/examples/basic/config.yaml).
 
 ## Docker
 
-To run agentgateway with Docker, mount your configuration file into the container and expose any necessary ports.
+To run agentgateway with Docker, you may either mount your [configuration file]({{< link-hextra path="/configuration/" >}}) directly, or mount a directory
+and create the configuration in the UI:
 
 ```sh
-docker run -v ./config.yaml:/config.yaml -p 3000:3000 \
-  cr.agentgateway.dev/agentgateway:v{{< reuse "agw-docs/versions/n-patch.md" >}} \
-  -f /config.yaml
+mkdir agentgateway-config
+docker run \
+  --user "$(id -u):$(id -g)" \
+  -v ./agentgateway-config:/config \
+  -p 3000:3000 -p 4000:4000 -p 127.0.0.1:15000:15000 \
+  cr.agentgateway.dev/agentgateway:v{{< reuse "agw-docs/versions/n-patch.md" >}}
 ```
 
-By default, the agentgateway admin UI listens on localhost, which is not exposed outside of the container.
-To access the UI, you can change the bind address and expose the port.
+When run in this mode, a configuration file will automatically be created, setting up logging and exposing the admin UI.
+The `user` is customized to run as the current user to ensure the container can read and write the configuration.
+
+If you want to provide an explicit file, you can also do so. By default, the agentgateway admin UI listens on localhost, which is not exposed outside of the container;
+the `ADMIN_ADDR` is set below to expose it and is optional.
 
 ```sh
-docker run -v ./config.yaml:/config.yaml -p 3000:3000 \
-  -p 127.0.0.1:15000:15000 -e ADMIN_ADDR=0.0.0.0:15000 \
+docker run \
+  --user "$(id -u):$(id -g)" \
+  -v ./config.yaml:/config.yaml \
+  -p 3000:3000 -p 4000:4000 -p 127.0.0.1:15000:15000 \
+  -e ADMIN_ADDR=0.0.0.0:15000 \
   cr.agentgateway.dev/agentgateway:v{{< reuse "agw-docs/versions/n-patch.md" >}} \
   -f /config.yaml
 ```
 
+Open <http://localhost:15000/ui> to get started!
+
 ## Docker Compose
 
-To run agentgateway in Docker Compose, follow a similar approach to mount the configuration file and expose the ports.
+To run agentgateway in Docker Compose, follow the same approach as above. Create a directory for the configuration and start the service.
+
+```sh
+mkdir agentgateway-config
+docker compose up
+```
 
 ```yaml
 services:
@@ -39,12 +54,14 @@ services:
     container_name: agentgateway
     restart: unless-stopped
     image: cr.agentgateway.dev/agentgateway:v{{< reuse "agw-docs/versions/n-patch.md" >}}
+    # Replace with your user and group IDs, such as the output of: id -u && id -g
+    user: "1000:1000"
     ports:
       - "3000:3000"
+      - "4000:4000"
       - "127.0.0.1:15000:15000"
     volumes:
-      - ./config.yaml:/config.yaml
-    environment:
-      - ADMIN_ADDR=0.0.0.0:15000
-    command: ["-f", "/config.yaml"]
+      - ./agentgateway-config:/config
 ```
+
+Open <http://localhost:15000/ui> to get started!
diff --git a/content/docs/standalone/main/llm/_index.md b/content/docs/standalone/main/llm/_index.md
index 43344887f..86d044a73 100644
--- a/content/docs/standalone/main/llm/_index.md
+++ b/content/docs/standalone/main/llm/_index.md
@@ -8,4 +8,5 @@ next: /reference/observability
 test: skip
 ---
 
-Consume LLM services by setting up AI backends for your LLM providers.
+Agentgateway can act as a feature rich AI/LLM gateway, acting as a proxy between your applications and LLM providers.
+This enables connecting to thousands of LLM model through a unified interface providing governance, observability, and reliability controls.
diff --git a/content/docs/standalone/main/llm/about.md b/content/docs/standalone/main/llm/about.md
index 25c303ded..8c20d9ff8 100644
--- a/content/docs/standalone/main/llm/about.md
+++ b/content/docs/standalone/main/llm/about.md
@@ -35,10 +35,6 @@ Many providers now have dedicated integrations with preconfigured base URLs and
 - [OpenRouter]({{< link-hextra path="/llm/providers/openrouter/" >}})
 - [Fireworks AI]({{< link-hextra path="/llm/providers/fireworks/" >}})
 
-### OpenAI-compatible fallback
-
-Use [OpenAI-compatible]({{< link-hextra path="/llm/providers/openai-compatible/" >}}) for Perplexity, vLLM, LM Studio, or another provider without built-in support.
-
 ### Self-hosted solutions
 
 Run models locally or in your own infrastructure:
@@ -46,14 +42,19 @@ Run models locally or in your own infrastructure:
 - [vLLM]({{< link-hextra path="/llm/providers/openai-compatible/#vllm" >}})
 - [LM Studio]({{< link-hextra path="/llm/providers/openai-compatible/#lm-studio" >}})
 
+### Custom providers
+
+Use [Custom provider]({{< link-hextra path="/llm/providers/openai-compatible/" >}}) for other providers without direct support such as Perplexity, vLLM, or LM Studio.
+Agentgateway supports all of the common LLM formats and can generally integrate with any provider ([file an issue](https://github.com/agentgateway/agentgateway/issues/new) if one is missing!).
+
 ## Using the API
 
-By default, requests to agentgateway use the [OpenAI Chat Completions](https://developers.openai.com/api/reference/chat-completions/overview) API.
-These requests are translated to the upstream provider's API.
+Agentgateway exposes multiple different API endpoints, including [OpenAI Chat Completions](https://developers.openai.com/api/reference/chat-completions/overview), [Anthropic Messages](https://platform.claude.com/docs/en/api/messages), and more.
+Depending on the API used in the request, and the provider selected, agentgateway can pass the request through or translate it as needed.
 
-Using the Chat Completions API works exactly the same as consuming OpenAI, with a change to the base URL.
-This allows you to continue using existing code and SDKs.
+This enables a unified API regardless of the provider used, allowing seamlessly connecting clients (regardless of which API they use) to any provider.
 
+Below shows some basic examples using the Chat Completions API
 {{< callout type="info" >}}
 For detailed configuration of specific API endpoint types, including Chat Completions and the OpenAI Realtime API, see [API types]({{< link-hextra path="/llm/api-types/" >}}).
 {{< /callout >}}
@@ -62,7 +63,7 @@ For detailed configuration of specific API endpoint types, including Chat Comple
 {{% tab %}}
 
 ```shell
-curl 'http://localhost:4000/' \
+curl 'http://localhost:4000/v1/chat/completions' \
 --header 'Content-Type: application/json' \
 --data ' {
   "model": "gpt-3.5-turbo",
@@ -89,7 +90,7 @@ import openai
 
 client = openai.OpenAI(
     api_key="anything",
-    base_url="http://localhost:4000"
+    base_url="http://localhost:4000/v1"
 )
 
 response = client.chat.completions.create(model="gpt-4o-mini", messages = [
@@ -110,7 +111,7 @@ import OpenAI from "openai";
 
 const openai = new OpenAI({
   apiKey: "anything",
-  baseURL: "http://localhost:4000",
+  baseURL: "http://localhost:4000/v1",
 });
 const response = await openai.chat.completions.create({
   model: "gpt-4o-mini",
@@ -125,16 +126,25 @@ console.log(response);
 
 ## Model routing and aliases
 
-Model routing is configured within the `llm` section of your agentgateway configuration file. The top-level configuration file is organized into sections such as `config`, `binds`, `llm`, `mcp`, `services`, and `workloads`; for a complete overview, see [Configuration overview]({{< link-hextra path="/configuration/overview/" >}}). The `llm` section offers a simplified, model-centric approach compared to the traditional `binds/listeners/routes` model; for more details on the two approaches, see [LLM configuration modes]({{< link-hextra path="/llm/configuration-modes/" >}}). The model configurations shown in this section live under the `llm.models` key.
+Model routing is configured within the `llm` section of your agentgateway configuration file. 
+The `llm` section offers a simplified, model-centric approach compared to the traditional `binds/listeners/routes` model; for more details on the two approaches, see [LLM configuration modes]({{< link-hextra path="/llm/configuration-modes/" >}}).
+The model configurations shown in this section live under the `llm.models` key.
+
+Agentgateway routes requests by matching an incoming model name, and then sending it to the configured model.
+The outgoing model can be passed through from the incoming model, be transformed, or be a static model.
 
-When you configure a model in the `llm` section, two fields control how requests are routed, as shown in the following table.
+Some examples:
+
+* Match `fast` and send to `gpt-mini`.
+* Match `*` and forward the model as-is.
+* Match `openai/*` and strip the `openai/` prefix, forwarding the remaining model as-is.
 
 | Field | Purpose |
 |-------|---------|
 | `models.name` | The model name to match in incoming client requests. Agentgateway compares this value against the `model` field in the request body. Use a wildcard `*` to match any model name. |
 | `params.model` | The model name sent to the upstream provider. If set, this overrides the model from the request. If not set, the model from the request is passed through. |
 
-### Pass-through mode
+### Passthrough
 
 Use `name: "*"` without setting `params.model` to accept any model name and pass it directly to the provider. This is the simplest configuration for single-provider setups.
 
@@ -142,16 +152,35 @@ Use `name: "*"` without setting `params.model` to accept any model name and pass
 llm:
   models:
   - name: "*"
-    provider: openAI
+    provider: openai
     params:
       apiKey: "$OPENAI_API_KEY"
 ```
 
 Clients specify the actual model in their requests, such as `"model": "gpt-4o-mini"`, and agentgateway forwards it to the provider as-is.
 
+### Prefixed Passthrough
+
+Use `name: "openai/*"` without setting `params.model` to accept model requests like `openai/gpt-4o-mini` and forward to OpenAI as `gpt-4o-mini`.
+This is the recommended approach when you want to expose all models from multiple providers.
+
+```yaml
+llm:
+  models:
+  - name: "*"
+    provider: openai
+    params:
+      apiKey: "$OPENAI_API_KEY"
+    transformation:
+      model: llmRequest.model.stripPrefix("openai/")
+```
+
+Clients specify the provider and model in their requests, such as `"model": "openai/gpt-4o-mini"`, and agentgateway forwards to `gpt-4o-mini`
+
 ### Model aliases
 
-Set `name` to a user-friendly alias and `params.model` to the actual provider model. This lets you decouple client-facing model names from provider-specific identifiers, making it easier to swap models without updating client code.
+Set `name` to a user-friendly alias and `params.model` to the actual provider model.
+This lets you decouple client-facing model names from provider-specific identifiers, making it easier to swap models without updating client code.
 
 ```yaml
 llm:
@@ -172,17 +201,13 @@ Clients send `"model": "fast"` or `"model": "smart"`, and agentgateway translate
 
 ### Route priority
 
-When multiple models match a request, agentgateway selects the best match by using the following priority order:
-
-1. **Match specificity**: Routes with more match criteria take priority. For example, a route with two header matchers ranks higher than a route with one.
-2. **Config order**: When two routes have equal specificity, the route listed first in the configuration file takes priority.
-
-This means you can control tie-breaking behavior by ordering your models in the config. Place more specific routes before generic or wildcard routes to ensure they match first.
+When multiple models match a request, the more precise match takes precedence.
+For example, with the configuration below, requests with `accounts/fireworks/*` will match the `fireworks` provider first:
 
 ```yaml
 llm:
   models:
-  # Specific route — listed first, wins ties against the wildcard
+  # Specific route: wins ties against the wildcard
   - name: "accounts/fireworks/*"
     provider: fireworks
     matches:
@@ -192,9 +217,7 @@ llm:
           exact: "eng"
     params:
       apiKey: "$FIREWORKS_API_KEY"
-      # Optional. Override the default Fireworks endpoint:
-      # baseUrl: "https://api.fireworks.ai/inference/v1"
-  # Catch-all route — matches anything, but lower priority
+  # Catch-all route: matches anything, but lower priority
   - name: "*"
     provider: openAI
     matches:
diff --git a/content/docs/standalone/main/llm/api-keys.md b/content/docs/standalone/main/llm/api-keys.md
deleted file mode 100644
index 95e6da66a..000000000
--- a/content/docs/standalone/main/llm/api-keys.md
+++ /dev/null
@@ -1,125 +0,0 @@
----
-title: Manage API keys
-weight: 40
-description: Manage API keys for LLM provider authentication.
-prev: /llm/providers
----
-
-Managing API keys is an important security mechanism to prevent unauthorized access to your LLM provider. If API keys are compromised, attackers can deliberately run expensive queries, such as large and recursive prompts, at your expense.
-
-You can choose between the following options to provide an API key to agentgateway: 
-* Inline
-* Environment variable
-* File
-* Kubernetes secret or passthrough token
-
-Follow the instructions in this guide to learn how to use these different methods. 
-
-## Before you begin
-
-{{< reuse "agw-docs/snippets/prereq-agentgateway.md" >}}
-
-## Configure your agentgateway proxy
-
-Browse through the tabs to learn about different ways for how to provide your API key to agentgateway. 
-
-{{< tabs items="Inline,Environment variable,File,Kubernetes secret or passthrough token" >}}
-
-{{% tab %}}
-
-You can provide your API key directly in the agentgateway configuration. This option is the least secure. Only use this option for quick tests.
-
-1. Configure the agentgateway proxy and enter your key in the `params.apiKey` field directly.
-   ```yaml
-   cat <<EOF > config.yaml
-   # yaml-language-server: $schema=https://agentgateway.dev/schema/config
-   llm:
-     models:
-     - name: "*"
-       provider: openAI
-       params:
-         apiKey: "sk-proj...."
-   EOF
-   ```
-
-{{% /tab %}}
-{{% tab %}}
-
-1. Get the token from your LLM provider, such as an API key to OpenAI and save it as an environment variable.
-   ```sh
-   export OPENAI_API_KEY=<your-api-key>
-   ```
-
-2. Configure the agentgateway proxy to refer to that environment variable. Agentgateway automatically replaces the value of the variable with the value that is stored in the environment.
-   ```yaml
-   cat <<'EOF' > config.yaml
-   # yaml-language-server: $schema=https://agentgateway.dev/schema/config
-   llm:
-     models:
-     - name: "*"
-       provider: openAI
-       params:
-         apiKey: "$OPENAI_API_KEY"
-   EOF
-   ```
-   
-{{% /tab %}}
-{{% tab %}}
-
-You can store your API key in a file and load the file into agentgateway during startup.
-
-1. Save your API key in a file, such as `key.txt`.
-   ```sh
-   echo "<your-apikey>" >> key.txt
-   ```
-
-2. Load the key from the file into an environment variable and configure the agentgateway proxy.
-   ```sh
-   export OPENAI_API_KEY=$(cat key.txt)
-   ```
-
-   ```yaml
-   cat <<'EOF' > config.yaml
-   # yaml-language-server: $schema=https://agentgateway.dev/schema/config
-   llm:
-     models:
-     - name: "*"
-       provider: openAI
-       params:
-         apiKey: "$OPENAI_API_KEY"
-   EOF
-   ```
-{{% /tab %}}
-{{% tab %}}
-
-When deploying agentgateway on Kubernetes, you can leverage Kubernetes secrets to store your API key or pass through a token by using an `Authorization` or other custom header. 
-
-For more information, see the [agentgateway on Kubernetes docs](https://agentgateway.dev/docs/kubernetes/latest/llm/api-keys/). 
-
-{{% /tab %}}
-{{< /tabs >}}
-
-## Authenticate incoming LLM API calls
-
-In addition to sending provider API keys upstream, you can authenticate incoming requests on the local LLM listener with `llm.policies.apiKey`.
-
-Set `llm.policies.apiKey.mode: permissive` when you want to populate API key metadata for later policies (for example, authorization or logging), without rejecting requests based on authentication.
-
-```yaml
-# yaml-language-server: $schema=https://agentgateway.dev/schema/config
-llm:
-  policies:
-    apiKey:
-      mode: permissive
-      keys:
-      - key: sk-team-engineering
-        metadata:
-          team: engineering
-  models:
-  - name: "*"
-    provider: openAI
-    params:
-      apiKey: "$OPENAI_API_KEY"
-```
-
-For the authentication mode semantics (`strict`, `optional`, and `permissive`), see [API Key authentication]({{< link-hextra path="/configuration/security/apikey-authn/" >}}).
diff --git a/content/docs/standalone/main/llm/api-types/_index.md b/content/docs/standalone/main/llm/api-types/_index.md
index b39d4af96..6ff6eccb4 100644
--- a/content/docs/standalone/main/llm/api-types/_index.md
+++ b/content/docs/standalone/main/llm/api-types/_index.md
@@ -5,14 +5,17 @@ description: Supported LLM API endpoint types and route configurations
 test: skip
 ---
 
-Agentgateway supports multiple LLM API endpoint types, called *route types*, that determine how clients interact with the gateway and how requests are routed to backends. In the simplified `llm` configuration, agentgateway maps standard endpoint paths to these route types automatically. In the `binds/listeners/routes` configuration, you set the route type explicitly in the `policies.ai.routes` map.
+Agentgateway natively supports multiple LLM API endpoint types.
+These are automatically exposed on the gateway, and translated as appropriate based on the provider.
 
 The following API types have dedicated guides:
 
-- **[Chat completions]({{< link-hextra path="/llm/api-types/completions/" >}})** — The OpenAI `/v1/chat/completions` endpoint. This is the most widely used API type for text generation and chat applications.
-- **[Responses]({{< link-hextra path="/llm/api-types/responses/" >}})** — The OpenAI `/v1/responses` endpoint for stateful, multi-step model interactions.
-- **[Messages]({{< link-hextra path="/llm/api-types/messages/" >}})** — The Anthropic `/v1/messages` endpoint for Claude models.
-- **[Realtime]({{< link-hextra path="/llm/api-types/realtime/" >}})** — The OpenAI Realtime API for low-latency, streaming voice and text interactions over WebSockets.
-- **[Passthrough]({{< link-hextra path="/llm/api-types/passthrough/" >}})** — Forwards requests directly to the backend provider without transformation.
-
-Agentgateway also recognizes additional route types for specific endpoints, including `embeddings` (`/v1/embeddings`), `models` (`/v1/models`), and `anthropicTokenCount` (`/v1/messages/count_tokens`).
+- **[Chat completions]({{< link-hextra path="/llm/api-types/completions/" >}})**: The OpenAI `/v1/chat/completions` endpoint. This is the most widely used API type for text generation and chat applications.
+- **[Responses]({{< link-hextra path="/llm/api-types/responses/" >}})**: The OpenAI `/v1/responses` endpoint for stateful, multi-step model interactions.
+- **[Messages]({{< link-hextra path="/llm/api-types/messages/" >}})**: The Anthropic `/v1/messages` endpoint for Claude models.
+- **[Embeddings]({{< link-hextra path="/llm/api-types/embeddings/" >}})**: The OpenAI-compatible `/v1/embeddings` endpoint for creating vector representations of text.
+- **[Realtime]({{< link-hextra path="/llm/api-types/realtime/" >}})**: The OpenAI Realtime API for low-latency, streaming voice and text interactions over WebSockets.
+- **[Rerank]({{< link-hextra path="/llm/api-types/rerank/" >}})**: The Cohere-compatible `/v2/rerank` endpoint for ranking documents by relevance to a query.
+- **[Models]({{< link-hextra path="/llm/api-types/models/" >}})**: The OpenAI-compatible `/v1/models` endpoint for listing available models.
+- **[Token count]({{< link-hextra path="/llm/api-types/token-count/" >}})**: The Anthropic `/v1/messages/count_tokens` endpoint for estimating input tokens.
+- **[Passthrough]({{< link-hextra path="/llm/api-types/passthrough/" >}})**: Forwards requests directly to the backend provider without transformation.
diff --git a/content/docs/standalone/main/llm/api-types/completions.md b/content/docs/standalone/main/llm/api-types/completions.md
index d75c75a13..660b63bc3 100644
--- a/content/docs/standalone/main/llm/api-types/completions.md
+++ b/content/docs/standalone/main/llm/api-types/completions.md
@@ -11,8 +11,6 @@ The OpenAI Chat Completions API (`/v1/chat/completions`) is the primary interfac
 
 The [OpenAI Chat Completions API](https://developers.openai.com/api/docs/guides/text) is the most widely used LLM endpoint. Agentgateway proxies these requests to your configured providers while providing token usage tracking, observability metrics, and policy enforcement.
 
-By default, requests to agentgateway use the Chat Completions API. These requests are translated to the upstream provider's native API format when necessary.
-
 ## Route type configuration
 
 In the simplified `llm` configuration, agentgateway automatically maps `/v1/chat/completions` requests to the `completions` route type, so no explicit route configuration is required.
@@ -56,7 +54,7 @@ For detailed information about model routing and configuration modes, see [Model
 
 Using the Chat Completions API works exactly the same as consuming OpenAI directly, with only a change to the base URL. This allows you to continue using existing code and SDKs.
 
-{{< tabs items="Curl,Python,JavaScript" >}}
+{{< tabs items="Curl,Python,JavaScript,Other" >}}
 {{% tab %}}
 
 ```shell
@@ -85,7 +83,7 @@ import openai
 
 client = openai.OpenAI(
     api_key="anything",
-    base_url="http://localhost:4000"
+    base_url="http://localhost:4000/v1"
 )
 
 response = client.chat.completions.create(
@@ -109,7 +107,7 @@ import OpenAI from "openai";
 
 const openai = new OpenAI({
   apiKey: "anything",
-  baseURL: "http://localhost:4000",
+  baseURL: "http://localhost:4000/v1",
 });
 
 const response = await openai.chat.completions.create({
@@ -121,13 +119,9 @@ console.log(response);
 ```
 
 {{% /tab %}}
-{{< /tabs >}}
-
-## Token usage tracking
-
-After sending Chat Completions requests, verify that agentgateway recorded token usage metrics.
+{{% tab %}}
 
-1. Open the agentgateway [metrics endpoint](http://localhost:15020/metrics).
-2. Look for the `agentgateway_gen_ai_client_token_usage` metric. The metric includes labels for the token type (`input` or `output`) and the model used.
+[View other LLM client integrations](/docs/standalone/main/integrations/llm-clients/).
 
-For more information about LLM metrics and observability, see [Observe traffic]({{< link-hextra path="/llm/observability/" >}}).
+{{% /tab %}}
+{{< /tabs >}}
diff --git a/content/docs/standalone/main/llm/api-types/embeddings.md b/content/docs/standalone/main/llm/api-types/embeddings.md
new file mode 100644
index 000000000..3352851de
--- /dev/null
+++ b/content/docs/standalone/main/llm/api-types/embeddings.md
@@ -0,0 +1,78 @@
+---
+title: Embeddings
+weight: 35
+description: Send embedding requests through agentgateway using the OpenAI-compatible Embeddings API.
+test: skip
+---
+
+The Embeddings API (`/v1/embeddings`) creates vector representations of text that you can use for search, retrieval, clustering, and other semantic workflows.
+
+## About
+
+Agentgateway supports the OpenAI-compatible Embeddings API. Requests to `/v1/embeddings` are routed to your configured provider while agentgateway applies the same routing, authentication, observability, and policy framework that you use for other LLM traffic.
+
+## Route type configuration
+
+In the simplified `llm` configuration, agentgateway automatically maps `/v1/embeddings` requests to the `embeddings` route type, so no explicit route configuration is required.
+
+```yaml
+# yaml-language-server: $schema=https://agentgateway.dev/schema/config
+llm:
+  models:
+  - name: "*"
+    provider: openAI
+    params:
+      apiKey: "$OPENAI_API_KEY"
+```
+
+To configure the route type explicitly, use the `binds/listeners/routes` format and set the `embeddings` route type in the `policies.ai.routes` map.
+
+```yaml
+# yaml-language-server: $schema=https://agentgateway.dev/schema/config
+binds:
+- port: 4000
+  listeners:
+  - routes:
+    - backends:
+      - ai:
+          name: openai
+          provider:
+            openAI: {}
+      policies:
+        ai:
+          routes:
+            "/v1/embeddings": "embeddings"
+        backendAuth:
+          key: "$OPENAI_API_KEY"
+```
+
+{{< callout type="info" >}}
+For detailed information about model routing and configuration modes, see [Model routing and aliases]({{< link-hextra path="/llm/about/" >}}).
+{{< /callout >}}
+
+## Using the API
+
+Send a request to the `/v1/embeddings` endpoint. The response includes an embedding vector for each input item.
+
+{{< tabs items="Curl,Other" >}}
+{{% tab %}}
+
+```shell
+curl 'http://localhost:4000/v1/embeddings' \
+--header 'Content-Type: application/json' \
+--data '{
+  "model": "text-embedding-3-small",
+  "input": [
+    "agentgateway routes LLM traffic",
+    "embeddings turn text into vectors"
+  ]
+}'
+```
+
+{{% /tab %}}
+{{% tab %}}
+
+[View other LLM client integrations](/docs/standalone/main/integrations/llm-clients/).
+
+{{% /tab %}}
+{{< /tabs >}}
diff --git a/content/docs/standalone/main/llm/api-types/messages.md b/content/docs/standalone/main/llm/api-types/messages.md
index a5c9923af..690f50e43 100644
--- a/content/docs/standalone/main/llm/api-types/messages.md
+++ b/content/docs/standalone/main/llm/api-types/messages.md
@@ -9,9 +9,12 @@ The Anthropic Messages API (`/v1/messages`) is the native interface for Anthropi
 
 ## About
 
-The [Anthropic Messages API](https://platform.claude.com/docs/en/api/messages) is the primary endpoint for Claude models. Agentgateway proxies these requests to your configured providers while providing token usage tracking, observability metrics, and policy enforcement. Agentgateway automatically adds the `x-api-key` and `anthropic-version` headers that the Anthropic API requires.
+The [Anthropic Messages API](https://platform.claude.com/docs/en/api/messages) is the primary endpoint for Claude models.
+Agentgateway proxies these requests to your configured providers while providing token usage tracking, observability metrics, and policy enforcement.
 
-The related `/v1/messages/count_tokens` endpoint, which estimates token usage before sending a request, is handled by the `anthropicTokenCount` route type.
+When using the Anthropic provider, Agentgateway automatically handles additional requirements, such as the `x-api-key` and `anthropic-version` headers that the Anthropic API requires.
+
+The related [`/v1/messages/count_tokens`]({{< link-hextra path="/llm/api-types/token-count/" >}}) endpoint estimates token usage before sending a request and is handled by the `anthropicTokenCount` route type.
 
 ## Route type configuration
 
@@ -57,6 +60,9 @@ For detailed information about model routing and configuration modes, see [Model
 
 Send a request to the `/v1/messages` endpoint. The request is forwarded to the Anthropic API and the response is returned to the client.
 
+{{< tabs items="Curl,Other" >}}
+{{% tab %}}
+
 ```shell
 curl -X POST http://localhost:4000/v1/messages \
   -H "Content-Type: application/json" \
@@ -67,13 +73,12 @@ curl -X POST http://localhost:4000/v1/messages \
   }'
 ```
 
-For Anthropic-specific features such as token counting, extended thinking, and structured outputs, see the [Anthropic provider]({{< link-hextra path="/llm/providers/anthropic/" >}}) guide.
+{{% /tab %}}
+{{% tab %}}
 
-## Token usage tracking
+[View other LLM client integrations](/docs/standalone/main/integrations/llm-clients/).
 
-After sending Messages requests, verify that agentgateway recorded token usage metrics.
+{{% /tab %}}
+{{< /tabs >}}
 
-1. Open the agentgateway [metrics endpoint](http://localhost:15020/metrics).
-2. Look for the `agentgateway_gen_ai_client_token_usage` metric. The metric includes labels for the token type (`input` or `output`) and the model used.
-
-For more information about LLM metrics and observability, see [Observe traffic]({{< link-hextra path="/llm/observability/" >}}).
+For Anthropic-specific features such as token counting, extended thinking, and structured outputs, see the [Anthropic provider]({{< link-hextra path="/llm/providers/anthropic/" >}}) guide.
diff --git a/content/docs/standalone/main/llm/api-types/models.md b/content/docs/standalone/main/llm/api-types/models.md
new file mode 100644
index 000000000..377050e69
--- /dev/null
+++ b/content/docs/standalone/main/llm/api-types/models.md
@@ -0,0 +1,49 @@
+---
+title: Models
+weight: 55
+description: List available models through agentgateway using the OpenAI-compatible Models API.
+test: skip
+---
+
+The Models API (`/v1/models`) lists the models that are available through the configured LLM provider.
+
+## About
+
+Agentgateway supports the OpenAI-compatible Models API. Use this endpoint when clients need to discover available model IDs, such as web UIs, SDKs, or developer tools that populate model selectors from `/v1/models`.
+
+## Route type configuration
+
+In the simplified `llm` configuration, agentgateway automatically maps `/v1/models` requests to the `models` route type, so no explicit route configuration is required.
+
+```yaml
+# yaml-language-server: $schema=https://agentgateway.dev/schema/config
+llm:
+  models:
+  - name: "*"
+    provider: openAI
+    params:
+      apiKey: "$OPENAI_API_KEY"
+```
+
+{{< callout type="info" >}}
+For detailed information about model routing and configuration modes, see [Model routing and aliases]({{< link-hextra path="/llm/about/" >}}).
+{{< /callout >}}
+
+## Using the API
+
+Send a request to the `/v1/models` endpoint to list models from the upstream provider.
+
+{{< tabs items="Curl,Other" >}}
+{{% tab %}}
+
+```shell
+curl 'http://localhost:4000/v1/models'
+```
+
+{{% /tab %}}
+{{% tab %}}
+
+[View other LLM client integrations](/docs/standalone/main/integrations/llm-clients/).
+
+{{% /tab %}}
+{{< /tabs >}}
diff --git a/content/docs/standalone/main/llm/api-types/rerank.md b/content/docs/standalone/main/llm/api-types/rerank.md
new file mode 100644
index 000000000..a1471cde0
--- /dev/null
+++ b/content/docs/standalone/main/llm/api-types/rerank.md
@@ -0,0 +1,85 @@
+---
+title: Rerank
+weight: 45
+description: Send rerank requests through agentgateway using the Cohere-compatible Rerank API.
+test: skip
+---
+
+The Rerank API (`/v2/rerank`) scores a list of documents against a query and returns the most relevant results in ranked order.
+
+## About
+
+Agentgateway supports the Cohere-compatible Rerank API. Use rerank when you already have a candidate set of documents, such as from keyword search or vector search, and want a model to reorder those documents by relevance to a query.
+
+Agentgateway also recognizes `/v1/rerank` as a rerank route, but `/v2/rerank` is the Cohere-compatible endpoint.
+
+## Route type configuration
+
+In the simplified `llm` configuration, agentgateway automatically maps `/v2/rerank` requests to the `rerank` route type, so no explicit route configuration is required.
+
+```yaml
+# yaml-language-server: $schema=https://agentgateway.dev/schema/config
+llm:
+  models:
+  - name: "*"
+    provider: cohere
+    params:
+      apiKey: "$COHERE_API_KEY"
+```
+
+To configure the route type explicitly, use the `binds/listeners/routes` format and set the `rerank` route type in the `policies.ai.routes` map.
+
+```yaml
+# yaml-language-server: $schema=https://agentgateway.dev/schema/config
+binds:
+- port: 4000
+  listeners:
+  - routes:
+    - backends:
+      - ai:
+          name: cohere
+          provider:
+            cohere: {}
+      policies:
+        ai:
+          routes:
+            "/v2/rerank": "rerank"
+        backendAuth:
+          key: "$COHERE_API_KEY"
+```
+
+{{< callout type="info" >}}
+For detailed information about model routing and configuration modes, see [Model routing and aliases]({{< link-hextra path="/llm/about/" >}}).
+{{< /callout >}}
+
+## Using the API
+
+Send a request to the `/v2/rerank` endpoint with a query and candidate documents. The response ranks the documents by relevance.
+
+{{< tabs items="Curl,Other" >}}
+{{% tab %}}
+
+```shell
+curl 'http://localhost:4000/v2/rerank' \
+--header 'Content-Type: application/json' \
+--data '{
+  "model": "rerank-v3.5",
+  "query": "What does agentgateway do?",
+  "documents": [
+    "agentgateway routes, secures, and observes agent and LLM traffic.",
+    "A bicycle drivetrain transfers power from pedals to wheels.",
+    "Vector databases store embeddings for semantic search."
+  ],
+  "top_n": 2
+}'
+```
+
+{{% /tab %}}
+{{% tab %}}
+
+[View other LLM client integrations](/docs/standalone/main/integrations/llm-clients/).
+
+{{% /tab %}}
+{{< /tabs >}}
+
+For more information about configuring Cohere, see the [Cohere provider]({{< link-hextra path="/llm/providers/cohere/" >}}) guide.
diff --git a/content/docs/standalone/main/llm/api-types/responses.md b/content/docs/standalone/main/llm/api-types/responses.md
index db5bf0cdb..30a1c8c90 100644
--- a/content/docs/standalone/main/llm/api-types/responses.md
+++ b/content/docs/standalone/main/llm/api-types/responses.md
@@ -54,7 +54,7 @@ For detailed information about model routing and configuration modes, see [Model
 
 Using the Responses API works exactly the same as consuming OpenAI directly, with only a change to the base URL. This allows you to continue using existing code and SDKs.
 
-{{< tabs items="Curl,Python,JavaScript" >}}
+{{< tabs items="Curl,Python,JavaScript,Other" >}}
 {{% tab %}}
 
 ```shell
@@ -78,7 +78,7 @@ import openai
 
 client = openai.OpenAI(
     api_key="anything",
-    base_url="http://localhost:4000"
+    base_url="http://localhost:4000/v1"
 )
 
 response = client.responses.create(
@@ -97,7 +97,7 @@ import OpenAI from "openai";
 
 const openai = new OpenAI({
   apiKey: "anything",
-  baseURL: "http://localhost:4000",
+  baseURL: "http://localhost:4000/v1",
 });
 
 const response = await openai.responses.create({
@@ -109,13 +109,9 @@ console.log(response);
 ```
 
 {{% /tab %}}
-{{< /tabs >}}
-
-## Token usage tracking
-
-After sending Responses requests, verify that agentgateway recorded token usage metrics.
+{{% tab %}}
 
-1. Open the agentgateway [metrics endpoint](http://localhost:15020/metrics).
-2. Look for the `agentgateway_gen_ai_client_token_usage` metric. The metric includes labels for the token type (`input` or `output`) and the model used.
+[View other LLM client integrations](/docs/standalone/main/integrations/llm-clients/).
 
-For more information about LLM metrics and observability, see [Observe traffic]({{< link-hextra path="/llm/observability/" >}}).
+{{% /tab %}}
+{{< /tabs >}}
diff --git a/content/docs/standalone/main/llm/api-types/token-count.md b/content/docs/standalone/main/llm/api-types/token-count.md
new file mode 100644
index 000000000..2602f261f
--- /dev/null
+++ b/content/docs/standalone/main/llm/api-types/token-count.md
@@ -0,0 +1,83 @@
+---
+title: Token count
+weight: 60
+description: Count tokens through agentgateway using the Anthropic Messages token-count API.
+test: skip
+---
+
+The Anthropic token-count API (`/v1/messages/count_tokens`) estimates the number of input tokens in an Anthropic Messages request before sending it to a model.
+
+## About
+
+Agentgateway supports the Anthropic Messages token-count endpoint with the `anthropicTokenCount` route type. Use this endpoint when clients need to estimate request size before calling `/v1/messages`, such as to enforce budgets, avoid context-window limits, or show usage estimates.
+
+## Route type configuration
+
+In the simplified `llm` configuration, agentgateway automatically maps `/v1/messages/count_tokens` requests to the `anthropicTokenCount` route type, so no explicit route configuration is required.
+
+```yaml
+# yaml-language-server: $schema=https://agentgateway.dev/schema/config
+llm:
+  models:
+  - name: "*"
+    provider: anthropic
+    params:
+      apiKey: "$ANTHROPIC_API_KEY"
+```
+
+To configure the route type explicitly, use the `binds/listeners/routes` format and set the `anthropicTokenCount` route type in the `policies.ai.routes` map. Most configurations also map `/v1/messages` to the `messages` route type for the actual model request.
+
+```yaml
+# yaml-language-server: $schema=https://agentgateway.dev/schema/config
+binds:
+- port: 4000
+  listeners:
+  - routes:
+    - backends:
+      - ai:
+          name: anthropic
+          provider:
+            anthropic: {}
+      policies:
+        ai:
+          routes:
+            "/v1/messages": "messages"
+            "/v1/messages/count_tokens": "anthropicTokenCount"
+        backendAuth:
+          key: "$ANTHROPIC_API_KEY"
+```
+
+{{< callout type="info" >}}
+For detailed information about model routing and configuration modes, see [Model routing and aliases]({{< link-hextra path="/llm/about/" >}}).
+{{< /callout >}}
+
+## Using the API
+
+Send a request to the `/v1/messages/count_tokens` endpoint with the same message shape that you would send to `/v1/messages`.
+
+{{< tabs items="Curl,Other" >}}
+{{% tab %}}
+
+```shell
+curl 'http://localhost:4000/v1/messages/count_tokens' \
+--header 'Content-Type: application/json' \
+--data '{
+  "model": "claude-opus-4-6",
+  "messages": [
+    {
+      "role": "user",
+      "content": "How many tokens are in this request?"
+    }
+  ]
+}'
+```
+
+{{% /tab %}}
+{{% tab %}}
+
+[View other LLM client integrations](/docs/standalone/main/integrations/llm-clients/).
+
+{{% /tab %}}
+{{< /tabs >}}
+
+For Anthropic-specific features such as Messages, token counting, extended thinking, and structured outputs, see the [Anthropic provider]({{< link-hextra path="/llm/providers/anthropic/" >}}) guide.
diff --git a/content/docs/standalone/main/llm/content-routing.md b/content/docs/standalone/main/llm/content-routing.md
deleted file mode 100644
index 2583f1476..000000000
--- a/content/docs/standalone/main/llm/content-routing.md
+++ /dev/null
@@ -1,258 +0,0 @@
----
-title: Content-based routing
-weight: 45
-description: Route requests to different LLM backends based on request body content, such as the requested model name.
----
-
-Route requests to different LLM backends based on the content of the request body, not just headers or path (also known as body-based routing or intelligent routing).
-
-## About content-based routing
-
-Content-based routing allows you to route requests to different backends based on fields in the request body, such as the `model` field in an LLM API request. This is useful when you want to:
-
-- Route different models to different providers (e.g., `gpt-4` to OpenAI, `claude-3` to Anthropic)
-- Direct certain models to specific backends based on cost or performance
-- Route based on custom fields like user tier or priority level
-
-Agentgateway implements content-based routing by using transformations to extract values from the request body into headers, then using header-based routing rules to select the appropriate backend.
-
-### How it works
-
-Content-based routing works in two steps:
-
-1. **Extract body field to header**: Use a transformation policy to extract a field from the JSON request body (like `model`) into a custom header
-2. **Match on header**: Use header matching in the route to route based on that header value
-
-This pattern lets you route based on any field in the request body while using standard routing capabilities.
-
-## Before you begin
-
-{{< reuse "agw-docs/snippets/prereq-agentgateway.md" >}}
-
-## Route by model name
-
-This example shows how to route requests to different backends based on the `model` field in the request body.
-
-1. Create a configuration file with multiple routes that extract the `model` field from the request body and match on it. Each route uses a transformation to extract the model name into the `x-model` header, then matches on that header value.
-
-   ```yaml
-   cat <<EOF > config.yaml
-   # yaml-language-server: $schema=https://agentgateway.dev/schema/config
-   binds:
-   - port: 3000
-     listeners:
-     - routes:
-       # Route GPT models to OpenAI
-       - matches:
-         - path:
-             pathPrefix: "/"
-           headers:
-           - name: "x-model"
-             value:
-               regex: "^gpt-.*"
-         backends:
-         - ai:
-             name: openai
-             provider:
-               openAI:
-                 model: gpt-4o
-         policies:
-           backendAuth:
-             key: "$OPENAI_API_KEY"
-           transformations:
-             request:
-               set:
-                 x-model: 'json(request.body).model'
-           cors:
-             allowOrigins:
-               - "*"
-             allowHeaders:
-               - "*"
-       # Route Claude models to Anthropic
-       - matches:
-         - path:
-             pathPrefix: "/"
-           headers:
-           - name: "x-model"
-             value:
-               regex: "^claude-.*"
-         backends:
-         - ai:
-             name: anthropic
-             provider:
-               anthropic:
-                 model: claude-3-5-sonnet-latest
-         policies:
-           backendAuth:
-             key: "$ANTHROPIC_API_KEY"
-           transformations:
-             request:
-               set:
-                 x-model: 'json(request.body).model'
-           cors:
-             allowOrigins:
-               - "*"
-             allowHeaders:
-               - "*"
-   EOF
-   ```
-
-   {{< reuse "agw-docs/snippets/review-table.md" >}}
-
-   | Setting | Description |
-   | --- | --- |
-   | `matches.headers.name` | The name of the header to match on. In this example, `x-model` is the custom header that contains the extracted model name. |
-   | `matches.headers.value.regex` | A regular expression to match the header value. Routes with `^gpt-.*` match any model starting with "gpt", while `^claude-.*` matches any model starting with "claude". |
-   | `transformations.request.set` | A CEL expression that extracts the `model` field from the JSON request body using `json(request.body).model` and sets it as the `x-model` header. |
-
-2. Run the agentgateway.
-   ```sh
-   agentgateway -f config.yaml
-   ```
-
-3. Send a request with `gpt-4o` in the model field. Verify that the request routes to the OpenAI backend.
-
-   ```sh
-   curl 'http://0.0.0.0:3000/' \
-   --header 'Content-Type: application/json' \
-   --data '{
-     "model": "gpt-4o",
-     "messages": [
-       {
-         "role": "user",
-         "content": "Say hello"
-       }
-     ]
-   }' | jq -r '.model'
-   ```
-
-   Example output:
-   ```
-   gpt-4o-2024-08-06
-   ```
-
-4. Send a request with `claude-3-5-sonnet-latest` in the model field. Verify that the request routes to the Anthropic backend.
-
-   ```sh
-   curl 'http://0.0.0.0:3000/' \
-   --header 'Content-Type: application/json' \
-   --data '{
-     "model": "claude-3-5-sonnet-latest",
-     "messages": [
-       {
-         "role": "user",
-         "content": "Say hello"
-       }
-     ]
-   }' | jq -r '.model'
-   ```
-
-   Example output:
-   ```
-   claude-3-5-sonnet-20241022
-   ```
-
-## Route by custom field
-
-You can extract any field from the request body for routing decisions, not just the `model` field.
-
-This example shows routing based on a custom `priority` field in the request body to route high-priority requests to a more powerful model.
-
-```yaml
-cat <<EOF > config.yaml
-# yaml-language-server: $schema=https://agentgateway.dev/schema/config
-binds:
-- port: 3000
-  listeners:
-  - routes:
-    # High priority route
-    - matches:
-      - path:
-          pathPrefix: "/"
-        headers:
-        - name: "x-priority"
-          value:
-            exact: "high"
-      backends:
-      - ai:
-          name: openai-premium
-          provider:
-            openAI:
-              model: gpt-4o
-      policies:
-        backendAuth:
-          key: "$OPENAI_API_KEY"
-        transformations:
-          request:
-            set:
-              x-priority: 'coalesce(json(request.body).priority, "standard")'
-        cors:
-          allowOrigins:
-            - "*"
-          allowHeaders:
-            - "*"
-    # Standard priority route (default)
-    - matches:
-      - path:
-          pathPrefix: "/"
-      backends:
-      - ai:
-          name: openai-standard
-          provider:
-            openAI:
-              model: gpt-4o-mini
-      policies:
-        backendAuth:
-          key: "$OPENAI_API_KEY"
-        transformations:
-          request:
-            set:
-              x-priority: 'coalesce(json(request.body).priority, "standard")'
-        cors:
-          allowOrigins:
-            - "*"
-          allowHeaders:
-            - "*"
-EOF
-```
-
-{{< callout type="info" >}}
-The `coalesce()` function returns the first non-null value from its arguments. This provides a default value if the field is missing, preventing errors when the custom field is not included in requests.
-{{< /callout >}}
-
-Test the routing by sending requests with different priority values:
-
-```sh
-# High priority request - routes to gpt-4o
-curl 'http://0.0.0.0:3000/' \
---header 'Content-Type: application/json' \
---data '{
-  "model": "gpt-4o",
-  "priority": "high",
-  "messages": [{"role": "user", "content": "Urgent request"}]
-}' | jq -r '.model'
-```
-
-```sh
-# Standard priority request - routes to gpt-4o-mini
-curl 'http://0.0.0.0:3000/' \
---header 'Content-Type: application/json' \
---data '{
-  "model": "gpt-4o",
-  "messages": [{"role": "user", "content": "Normal request"}]
-}' | jq -r '.model'
-```
-
-## Known limitations
-
-When implementing content-based routing, be aware of these limitations:
-
-- **Route order matters**: Routes are evaluated in the order they appear in the configuration. Place more specific routes (with header matches) before generic routes (without matches) to ensure proper routing.
-- **Performance impact**: Extracting fields from the request body adds processing overhead. For high-throughput scenarios, consider using header-based routing when possible.
-- **JSON parsing**: The `json()` CEL function requires valid JSON. Malformed JSON in the request body will cause routing failures.
-
-## Next steps
-
-- Learn about [transformations](../../configuration/traffic-management/transformations/) for more advanced request manipulation
-- Set up [backend routing](../../configuration/routes/) for multiple backends
-- Configure [rate limiting]({{< link-hextra path="/llm/virtual-keys/" >}}) to control costs per route
diff --git a/content/docs/standalone/main/llm/costs.md b/content/docs/standalone/main/llm/costs.md
index d3edbe797..0f3a80508 100644
--- a/content/docs/standalone/main/llm/costs.md
+++ b/content/docs/standalone/main/llm/costs.md
@@ -6,146 +6,142 @@ test:
   costs:
   - file: content/docs/standalone/main/llm/costs.md
     path: costs
+aliases:
+  - /llm/spending/
 ---
 
-Agentgateway can compute the realized USD cost of each LLM request when you provide a model cost catalog. With a catalog in place, agentgateway attributes cost per request in access logs, traces, and metrics, and exposes the values to CEL expressions as `llm.cost` and `llm.costRates`.
+Agentgateway can track LLM spend by mapping each request's provider, model, and token counts to per-token pricing.
 
-Agentgateway does not ship a built-in catalog. Costs are computed only when you configure one (for example, a catalog that you generate with [`agctl costs import`](#generate-a-catalog-with-agctl)).
+Agentgateway extracts token usage from supported LLM APIs automatically. To convert those token counts into cost, configure a model cost catalog. The catalog maps provider and model names to pricing data so agentgateway can attach realized USD cost to logs, traces, metrics, and CEL expressions.
 
-## Before you begin
+{{< callout type="info" >}}
+Cost analysis is best-effort and may not exactly match your provider bill in scenarios such as price changes, custom pricing, failed requests, or provider-specific billing rules.
+{{< /callout >}}
 
-{{< reuse "agw-docs/snippets/prereq-agentgateway.md" >}}
+## Configure a model catalog
 
+Use `config.modelCatalog` to load one or more model cost catalog files. Catalog entries are merged in order, and later entries take precedence. This lets you start with an imported public catalog and then layer local overrides for contracted pricing, internal models, or provider-specific aliases.
+
+```yaml
+# yaml-language-server: $schema=https://agentgateway.dev/schema/config
 
-## Step 1: Prepare a catalog
+config:
+  modelCatalog:
+  - file: ./costs/catalog.json
 
-Prepare a catalog by creating your own JSON file or using the `agctl costs import` command.
+llm:
+  models:
+  - name: "*"
+    provider: openAI
+    params:
+      apiKey: "$OPENAI_API_KEY"
+```
 
-### Catalog JSON format
+Run agentgateway with the config file.
 
-{{< reuse "agw-docs/snippets/model-catalog-json-format.md" >}}
+```sh
+agentgateway -f config.yaml
+```
 
-### Generate a catalog with agctl
+After the catalog is loaded, priced requests include cost data. The access log includes `agw.ai.usage.cost.total`, and CEL exposes cost data as `llm.cost` and `llm.costRates`.
 
-Use `agctl costs import` to generate a catalog JSON file, then reference that file from `config.modelCatalog` or `MODEL_CATALOG_PATHS`.
+For general LLM telemetry setup, see [Observe traffic]({{< link-hextra path="/llm/observability/" >}}).
 
-1. Generate a catalog from a supported source. By default, `agctl costs import` imports every provider that the proxy supports from [models.dev](https://models.dev).
+## Import costs with agctl
 
-   ```sh
-   agctl costs import --pretty --out ./catalog.json
-   ```
+Use `agctl costs import` to generate a catalog file from a supported pricing source. The default source is `models.dev`.
 
-2. To import only a subset of providers, pass a comma-separated list to `--providers`.
+```sh
+mkdir -p costs
+agctl costs import --out ./costs/catalog.json
+```
 
-   ```sh
-   agctl costs import --pretty --providers openai,anthropic --out ./catalog.json
-   ```
+To keep the catalog smaller, import only the providers that you use.
 
-3. Reference the generated file from your configuration with `config.modelCatalog[].file` or `MODEL_CATALOG_PATHS`, then run agentgateway.
+```sh
+agctl costs import \
+  --source models.dev \
+  --providers anthropic,google,openai \
+  --out ./costs/catalog.json
+```
 
-For all options, see the [`agctl costs import`]({{< link-hextra path="/reference/agctl/agctl-costs-import/" >}}) reference.
+For all flags, see the [`agctl costs import`]({{< link-hextra path="/reference/agctl/agctl-costs-import/" >}}) reference.
 
-## Step 2: Configure catalog sources
+## Import costs in the UI
 
-Configure one or more catalog sources for agentgateway with the `config.modelCatalog` config section. Sources are merged in order, with later sources taking precedence at the model level.
+You can also import model costs from the Admin UI.
 
-### Load a catalog from a file
+1. Open the [Admin UI cost page](http://localhost:15000/ui/llm/costs).
+2. Press **Refresh base costs**.
 
-The `file` field is a path to a catalog JSON file. Agentgateway watches the file and reloads it when it changes.
+The UI fetches the latest base costs and configures `modelCatalog`. You can refresh again later to pull updated pricing and model data.
 
-```yaml
-# yaml-language-server: $schema=https://agentgateway.dev/schema/config
-config:
-  modelCatalog:
-  - file: ./catalog.json
-```
+When you set up a fresh configuration for the first time, the UI automatically performs this step.
 
-### Embed a catalog inline
+## Override catalog entries
 
-The `inline` field is a string that contains the catalog JSON.
+If your provider pricing differs from the imported public catalog, add another catalog file after the imported one. Later catalog sources override earlier sources.
 
 ```yaml
-# yaml-language-server: $schema=https://agentgateway.dev/schema/config
 config:
   modelCatalog:
-  - inline: |
-      {
-        "providers": {
-          "openai": {
-            "models": {
-              "gpt-4o-mini": {
-                "rates": { "input": "0.15", "output": "0.6", "cacheRead": "0.075" }
-              }
-            }
-          }
-        }
-      }
+  - file: ./costs/catalog.json
+  - file: ./costs/overrides.json
 ```
 
-### Load catalog files with an environment variable
+Use overrides for contracted pricing, internally hosted models, or models that do not appear in the imported catalog.
 
-You can also load one or more catalog files with the `MODEL_CATALOG_PATHS` environment variable, set to a comma-separated list of file paths. The environment variable is useful for container deployments where you mount a catalog file and enable it without editing the main configuration file.
+You can also load one or more catalog files with the `MODEL_CATALOG_PATHS` environment variable. Set it to a comma-separated list of file paths.
 
 ```sh
-MODEL_CATALOG_PATHS=./catalog.json,./overrides.json agentgateway -f config.yaml
+MODEL_CATALOG_PATHS=./costs/catalog.json,./costs/overrides.json agentgateway -f config.yaml
 ```
 
 {{< callout type="warning" >}}
-When `MODEL_CATALOG_PATHS` is set, it **replaces** any `config.modelCatalog` sources; the two are not merged. Use one mechanism or the other.
+When `MODEL_CATALOG_PATHS` is set, it replaces any `config.modelCatalog` sources. Use one mechanism or the other.
 {{< /callout >}}
 
-## Step 3: Configure cost policies
+## Use cost data
 
-Use cost data in CEL, logs, traces, and metrics policies.
+When a request matches an entry in the catalog, agentgateway populates these CEL fields:
 
-When a request matches an entry in the catalog, agentgateway populates the following CEL fields:
+- `llm.cost`: The realized USD cost of the request. Includes `total` plus per-token-type components such as `input`, `output`, `cacheRead`, `cacheWrite`, `reasoning`, `inputAudio`, and `outputAudio`. Unset when the model cannot be priced.
+- `llm.costRates`: The effective USD-per-1,000,000-token rates that were applied. Includes the same per-token-type fields when available. Unset when the model cannot be priced.
 
-- `llm.cost`: The realized USD cost of the request. Includes `total` plus per-token-type components: `input`, `output`, `cacheRead`, `cacheWrite`, `reasoning`, `inputAudio`, and `outputAudio`. Unset when the model cannot be priced.
-- `llm.costRates`: The effective USD-per-1,000,000-token rates that were applied, after tier selection. Includes the same per-token-type fields when available. Unset when the model cannot be priced.
+The request access log always includes `agw.ai.usage.cost.total` for LLM requests when a cost is available.
+Traces always include the full breakdown:
+* `agw.ai.usage.cost.total`
+* `agw.ai.usage.cost.input`
+* `agw.ai.usage.cost.output`
+* `agw.ai.usage.cost.cache_read`
+* `agw.ai.usage.cost.cache_write`
+* `agw.ai.usage.cost.reasoning`
+* `agw.ai.usage.cost.input_audio`
+* `agw.ai.usage.cost.output_audio`
 
-The request access log always includes `agw.ai.usage.cost.total` for LLM requests (it is `0` when the model cannot be priced). To add the breakdown or rate fields, reference them with CEL in access logs, traces, or metrics:
+As these are loaded into the CEL context, they can be explicitly emited as well
 
 ```yaml
 # yaml-language-server: $schema=https://agentgateway.dev/schema/config
 frontendPolicies:
   accessLog:
     add:
-      llm.cost.total: 'llm.cost.total'
-      llm.cost.input: 'llm.cost.input'
-      llm.cost.output: 'llm.cost.output'
-      llm.cost.cacheRead: 'llm.cost.cacheRead'
-  tracing:
-    attributes:
-      llm.cost.total: 'llm.cost.total'
-      llm.costRates.input: 'llm.costRates.input'
-      llm.costRates.output: 'llm.costRates.output'
-
-config:
-  metrics:
-    fields:
-      add:
-        llm.cost.total: 'llm.cost.total'
-        llm.costRates.input: 'llm.costRates.input'
+       # Add the input cost
+       input_cost: llm.cost.input
+       # Add ALL cost variables, as `cost.input`, `cost.output`, etc.
+       cost: flatten(llm.cost)
 ```
 
-A priced request produces an access log line that includes the cost fields:
+A priced request produces an access log entry that includes cost data.
 
-```
+```console
 ... protocol=llm gen_ai.provider.name=openai gen_ai.request.model=gpt-4o-mini
 gen_ai.usage.input_tokens=14 gen_ai.usage.output_tokens=6 agw.ai.usage.cost.total=0.0000057 ...
 ```
 
-For more examples, see [Observe traffic]({{< link-hextra path="/llm/observability/" >}}) and the [CEL reference]({{< link-hextra path="/reference/cel/cel-context" >}}).
+## Monitor catalog lookups
 
-## Step 4: Generate traffic
-
-Generate traffic through agentgateway that matches a model entry from the catalog. For example steps, try the [LLM getting started]({{< link-hextra path="/quickstart/llm/" >}}).
-
-## Step 5: Monitor catalog lookups
-
-Every cost lookup increments the `agentgateway_cost_catalog_lookups_total` counter, labeled with the lookup `status` and the request's `gen_ai_system` (provider), `gen_ai_request_model`, and `gen_ai_response_model`. Use the lookup to confirm that your catalog prices your traffic.
-
-The `status` label is one of the following values:
+Every cost lookup increments the `agentgateway_cost_catalog_lookups_total` counter. The metric is labeled with lookup `status`, provider, request model, and response model.
 
 | Status | Meaning |
 |--------|---------|
@@ -154,18 +150,54 @@ The `status` label is one of the following values:
 | `Missing` | The provider or model was not found in the catalog. |
 | `NoCatalog` | No catalog is configured. |
 
-For example, the metrics endpoint at `http://localhost:15020/metrics` shows lines such as the following:
-
-agentgateway_cost_catalog_lookups_total{status="Exact",gen_ai_system="openai",gen_ai_request_model="gpt-4o-mini",...} 1
-agentgateway_cost_catalog_lookups_total{status="Missing",gen_ai_system="openai",gen_ai_request_model="gpt-3.5-turbo",...} 1
-```
-
 A rising `Missing` or `Unpriced` count means requests are flowing through models that your catalog does not price. Add the missing providers or models to your catalog and reload.
 
 {{< callout type="info" >}}
 In traces, the corresponding cost-resolution `status` attribute uses lowercase values: `exact`, `unpriced`, `missing`, and `noCatalog`.
 {{< /callout >}}
 
+## Enforce budgets
+
+The model catalog provides pricing data for spend visibility. To block or throttle traffic, combine cost visibility with rate limiting or virtual key management.
+
+- Use [Rate limiting]({{< link-hextra path="/configuration/resiliency/rate-limits/" >}}) to cap request or token usage per route, user, or API key.
+- Use [Virtual keys]({{< link-hextra path="/llm/virtual-keys/" >}}) to issue keys with per-key controls and attribution.
+
+## Advanced: Catalog format
+
+Usually, you do not need to write catalog JSON by hand. Use `agctl costs import` or the Admin UI to generate the base catalog, then add overrides only when needed.
+
+{{< reuse "agw-docs/snippets/model-catalog-json-format.md" >}}
+
+The following minimal example prices one OpenAI model and one tiered Gemini model.
+
+```json
+{
+  "providers": {
+    "openai": {
+      "models": {
+        "gpt-4o-mini": {
+          "rates": { "input": "0.15", "output": "0.6", "cacheRead": "0.075" }
+        }
+      }
+    },
+    "gcp.gemini": {
+      "models": {
+        "gemini-2.5-pro": {
+          "rates": { "input": "1.25", "output": "10", "cacheRead": "0.125" },
+          "tiers": [
+            {
+              "contextOver": 200000,
+              "rates": { "input": "2.5", "output": "15", "cacheRead": "0.25" }
+            }
+          ]
+        }
+      }
+    }
+  }
+}
+```
+
 {{< doc-test paths="costs" >}}
 # Verify that agentgateway loads a catalog from a file source.
 cat > /tmp/costs-catalog.json <<'EOF'
diff --git a/content/docs/standalone/main/llm/providers/_index.md b/content/docs/standalone/main/llm/providers/_index.md
index 46f0e4544..c72510262 100644
--- a/content/docs/standalone/main/llm/providers/_index.md
+++ b/content/docs/standalone/main/llm/providers/_index.md
@@ -9,34 +9,15 @@ Learn how to configure agentgateway for a particular LLM {{< gloss "Provider" >}
 
 ## First-class providers
 
-Use the dedicated provider pages when agentgateway already knows the upstream base URL and request format. This list includes Anthropic, OpenAI, and many OpenAI-compatible providers.
+Use the dedicated provider pages when agentgateway already knows the upstream base URL and request format. This list includes Anthropic, OpenAI, and many more!
 
-## OpenAI-compatible fallback
+## Custom providers
 
-Use [OpenAI-compatible]({{< link-hextra path="/llm/providers/openai-compatible/" >}}) only for providers that do not have a first-class shortcut, such as Perplexity, vLLM, LM Studio, or another service that exposes the OpenAI API format.
-
-### Override the upstream base URL
-
-When you need a custom upstream endpoint, set `params.baseUrl` on the model instead of older host or path override fields.
-
-```yaml
-# yaml-language-server: $schema=https://agentgateway.dev/schema/config
-
-llm:
-  models:
-  - name: "*"
-    provider: openAI
-    auth:
-      key:
-        value: "$PERPLEXITY_API_KEY"
-    params:
-      baseUrl: "https://api.perplexity.ai"
-    tls: {}
-```
+Use [Custom providers]({{< link-hextra path="/llm/providers/custom/" >}}) only for providers that do not have a first-class shortcut, such as Perplexity, vLLM, LM Studio, or another service that exposes a compatible [API format](../api-types).
 
 ## Authentication
 
-For simplified `llm` configuration, upstream provider authentication is configured per model via `llm.models[].auth`. In routing-based configurations, use `policies.backendAuth` on a route instead.
+For simplified `llm` configuration, upstream provider authentication is configured per model via `llm.models[]` (typically `params.apiKey` for API-key providers, and `auth` for cloud-native flows). In routing-based configurations, use `policies.backendAuth` on a route instead.
 
 ### API key
 
@@ -47,9 +28,8 @@ llm:
   models:
   - name: "*"
     provider: openAI
-    auth:
-      key:
-        value: "$OPENAI_API_KEY"
+    params:
+      apiKey: "$OPENAI_API_KEY"
 ```
 
 Use `auth.key.location` only when a provider needs the credential somewhere other than its default location. For example, Azure often uses `api-key`:
@@ -58,10 +38,10 @@ Use `auth.key.location` only when a provider needs the credential somewhere othe
 llm:
   models:
   - name: "*"
-    provider: azure
+    provider: custom
     auth:
       key:
-        value: "$AZURE_API_KEY"
+        value: "$API_KEY"
         location:
           header:
             name: api-key
@@ -109,4 +89,8 @@ llm:
 
 ## Standalone upstream TLS
 
-Use `llm.models[].tls` to configure TLS when connecting to an upstream provider. You might use this configuration to trust a private CA when using a self-hosted HTTPS endpoint. Common fields include `root` for a trusted CA bundle, `hostname` and `subjectAltNames` for upstream identity checks, `cert` and `key` for client certificates, and `keyExchangeGroups` for TLS negotiation. In agentgateway versions prior to 1.3, this model-level setting was called `backendTLS`.
+Use `llm.models[].tls` to configure advanced TLS when connecting to an upstream provider.
+When using built in providers, default TLS settings are used.
+When using custom a `baseUrl`, the `https://` scheme will automatically use TLS.
+
+However, if you need advanced configurations such as client certificates or customized verification steps, you may set fields such as `root` for a trusted CA bundle, `hostname` and `subjectAltNames` for upstream identity checks, `cert` and `key` for client certificates.
diff --git a/content/docs/standalone/main/llm/providers/anthropic.md b/content/docs/standalone/main/llm/providers/anthropic.md
index 906970632..6b5e38445 100644
--- a/content/docs/standalone/main/llm/providers/anthropic.md
+++ b/content/docs/standalone/main/llm/providers/anthropic.md
@@ -1,6 +1,6 @@
 ---
 title: Anthropic
-weight: 50
+weight: 15
 description: Configuration and setup for Anthropic Claude provider
 ---
 
@@ -36,7 +36,7 @@ llm:
 
 After running agentgateway with the configuration from the previous section, you can send a request to the `v1/messages` endpoint. Agentgateway automatically adds the `x-api-key` authorization and `anthropic-version` headers to the request. The request is forwarded to the Anthropic API and the response is returned to the client.
 
-```json
+```sh
 curl -X POST http://localhost:4000/v1/messages \
   -H "Content-Type: application/json" \
   -d '{
@@ -98,7 +98,7 @@ Example response:
 }
 ```
 
-## Extended thinking and reasoing
+## Extended thinking and reasoning
 
 Extended thinking and reasoning lets Claude reason through complex problems before generating a response. You can opt in to extended thinking and reasoning by adding specific parameters to your request. 
 
@@ -130,7 +130,7 @@ The following values are supported:
 The following example request uses adaptive extended thinking. Note that this setting requires the `output_config.effort` field to be set too. 
 
 ```sh
-curl "localhost:3000/v1/messages" -H content-type:application/json -d '{
+curl "localhost:4000/v1/messages" -H content-type:application/json -d '{
   "model": "",
   "max_tokens": 1024,
   "thinking": {
@@ -181,7 +181,7 @@ Structured outputs constrain the model to respond with a specific JSON schema. Y
 Provide the JSON schema definition in the `output_config.format` field. 
 
 ```sh
-curl "localhost:3000/v1/messages" -H content-type:application/json -d '{
+curl "localhost:4000/v1/messages" -H content-type:application/json -d '{
   "model": "",
   "max_tokens": 256,
   "output_config": {
@@ -234,15 +234,11 @@ Example output:
 
 [Claude Platform on AWS](https://docs.aws.amazon.com/claude-platform/latest/userguide/welcome.html) hosts Anthropic's native Messages API on AWS infrastructure at `aws-external-anthropic.{region}.api.aws`. Because the API is the same Anthropic Messages API, you point the `anthropic` provider at the AWS endpoint and choose either API-key or AWS SigV4 authentication.
 
-<!--TODO 1.3 release -->
-{{< callout type="info" >}}
-Before you begin, [install agentgateway with the nightly build]({{< link-hextra path="/quickstart/">}}).
-{{< /callout >}}
-
 {{< tabs tabTotal="2" items="API key, AWS SigV4" >}}
 {{% tab tabName="API key" %}}
 
-Store your Anthropic-on-AWS API key in a file and reference it from the provider configuration. Override the upstream host to point at the Claude Platform endpoint.
+Store your Claude Platform on AWS API key in an environment variable or file and reference it from the provider configuration.
+Override the upstream host to point at the Claude Platform endpoint.
 
 ```yaml
 # yaml-language-server: $schema=https://agentgateway.dev/schema/config
@@ -253,25 +249,20 @@ llm:
     provider: anthropic
     requestHeaders:
       set:
+        # Replace with your workspace ID
         anthropic-workspace-id: wrkspc_XXXXX
     params:
-      awsRegion: us-west-2
-      hostOverride: aws-external-anthropic.us-west-2.api.aws:443
-      pathPrefix: /v1
-    auth:
-      key:
-        value:
-          file: $HOME/.secrets/anthropic-aws
-    tls: {}
+      apiKey: $ANTHROPIC_AWS_API_KEY
+      # Replace with your region
+      baseUrl: https://aws-external-anthropic.us-west-2.api.aws/v1
 ```
 
-| Setting | Description |
-|---------|-------------|
+| Setting                                     | Description |
+|---------------------------------------------|-------------|
 | `requestHeaders.set.anthropic-workspace-id` | The Anthropic workspace ID that scopes the request. Replace `wrkspc_XXXXX` with your workspace ID. |
-| `params.hostOverride` | The Claude Platform endpoint host and port. Use the form `aws-external-anthropic.{region}.api.aws:443`. |
-| `params.pathPrefix` | The Anthropic API path prefix on Claude Platform, set to `/v1`. |
-| `auth.key.value.file` | A path to a file that contains the API key. |
-| `tls: {}` | Enables TLS to the upstream host. Required because Claude Platform is served over HTTPS. |
+| `params.hostOverride`                       | The Claude Platform endpoint host and port. Use the form `aws-external-anthropic.{region}.api.aws:443`. |
+| `params.pathPrefix`                         | The Anthropic API path prefix on Claude Platform, set to `/v1`. |
+| `params.apiKey`                             | API key. |
 
 {{% /tab %}}
 {{% tab tabName="AWS SigV4" %}}
diff --git a/content/docs/standalone/main/llm/providers/azure.md b/content/docs/standalone/main/llm/providers/azure.md
index 5d56e4b45..7961ec69d 100644
--- a/content/docs/standalone/main/llm/providers/azure.md
+++ b/content/docs/standalone/main/llm/providers/azure.md
@@ -1,6 +1,6 @@
 ---
 title: Azure
-weight: 60
+weight: 15
 description: Configuration and setup for Azure AI services provider
 ---
 
@@ -8,7 +8,7 @@ Configure Microsoft Azure AI as an LLM provider in agentgateway.
 
 ## Authentication
 
-Before you can use Azure as an LLM provider, you must authenticate by using one of the standard [Azure authentication methods](https://learn.microsoft.com/en-us/azure/ai-services/authentication). In standalone mode, this authentication is configured via `llm.models[].auth` (for example, `auth.azure.implicit` or `auth.key`). In routing-based configurations, use `policies.backendAuth.azure`.
+Before you can use Azure as an LLM provider, you must authenticate by using one of the standard [Azure authentication methods](https://learn.microsoft.com/en-us/azure/ai-services/authentication). In standalone mode, this authentication is configured with `llm.models[]` fields (for example, `params.apiKey` or `auth.azure`). In routing-based configurations, use `policies.backendAuth.azure`.
 
 ## Configuration
 
@@ -29,9 +29,6 @@ llm:
   models:
   - name: "*"
     provider: azure
-    auth:
-      azure:
-        implicit: {}
     params:
       azureResourceName: "your-resource-name"
       azureResourceType: foundry
@@ -68,9 +65,6 @@ llm:
   models:
   - name: "gpt-4.1"
     provider: azure
-    auth:
-      azure:
-        implicit: {}
     params:
       azureResourceName: "your-resource-name"
       azureResourceType: openAI
@@ -90,7 +84,7 @@ llm:
 | `params.azureProjectName` | The Foundry project name. Required for `foundry` type. If omitted, defaults to `azureResourceName`. |
 | `params.azureApiVersion` | Optional API version override. Defaults to `v1`. For legacy deployments, use a dated version like `2024-04-01-preview`. |
 | `params.model` | The specific Azure model to use. If set, this model is used for all requests. If not set, the request must include the model to use. |
-| `auth` | Authentication for the upstream Azure endpoint. Use `auth.azure` for Entra ID auth, or `auth.key.value` for API key auth. Set `auth.key.location.header.name: api-key` if needed. |
+| `params.apiKey` | The Azure API key for authentication. If unset, implicit Entra ID authentication is used. You can reference environment variables using the `$VAR_NAME` syntax. |
 
 ## Advanced configuration
 
@@ -110,13 +104,6 @@ binds:
     - matches:
       - path:
           pathPrefix: /azure
-      policies:
-        urlRewrite:
-          authority: auto
-        backendAuth:
-          azure:
-            implicit: {}
-        backendTLS: {}
       backends:
       - ai:
           name: azure
@@ -147,8 +134,6 @@ binds:
       - path:
           pathPrefix: /azure
       policies:
-        urlRewrite:
-          authority: auto
         backendAuth:
           azure:
             explicitConfig:
@@ -156,7 +141,6 @@ binds:
                 tenantId: "<your-tenant-id>"
                 clientId: "<your-client-id>"
                 clientSecret: "<your-client-secret>"
-        backendTLS: {}
       backends:
       - ai:
           name: azure
@@ -198,7 +182,6 @@ binds:
                 tenantId: "<your-tenant-id>"
                 clientId: "<your-client-id>"
                 clientSecret: "<your-client-secret>"
-        backendTLS: {}
 ```
 
 {{< reuse "agw-docs/snippets/review-configuration.md" >}}
@@ -235,7 +218,6 @@ binds:
           azure:
             explicitConfig:
               managedIdentity: {}
-        backendTLS: {}
 ```
 
 {{< reuse "agw-docs/snippets/review-configuration.md" >}}
@@ -278,7 +260,6 @@ binds:
                   # OR use objectId or resourceId instead
                   # objectId: "your-managed-identity-object-id"
                   # resourceId: "/subscriptions/.../resourceGroups/.../providers/Microsoft.ManagedIdentity/userAssignedIdentities/..."
-        backendTLS: {}
 ```
 
 {{< reuse "agw-docs/snippets/review-configuration.md" >}}
diff --git a/content/docs/standalone/main/llm/providers/baseten.md b/content/docs/standalone/main/llm/providers/baseten.md
index a70a5b000..1258efcf6 100644
--- a/content/docs/standalone/main/llm/providers/baseten.md
+++ b/content/docs/standalone/main/llm/providers/baseten.md
@@ -1,6 +1,6 @@
 ---
 title: Baseten
-weight: 61
+weight: 20
 description: Configuration and setup for Baseten LLM provider
 ---
 
@@ -19,8 +19,6 @@ llm:
     provider: baseten
     params:
       apiKey: "$BASETEN_API_KEY"
-      # Optional. If omitted, agentgateway uses the default:
-      # baseUrl: "https://inference.baseten.co/v1"
 ```
 
 {{< reuse "agw-docs/snippets/review-configuration.md" >}}
@@ -35,7 +33,7 @@ llm:
 
 ## Example request
 
-After running agentgateway with the configuration from the previous section, you can send an OpenAI-compatible request to the `v1/chat/completions` endpoint by replacing `<your-baseten-model-id>` with your Baseten model or deployment ID:
+After running agentgateway with the configuration from the previous section, you can send an OpenAI-compatible request to the `v1/chat/completions` endpoint by replacing `<your-baseten-model-id>` with your Baseten model or deployment ID:
 
 ```bash
 curl -X POST http://localhost:4000/v1/chat/completions \
diff --git a/content/docs/standalone/main/llm/providers/bedrock.md b/content/docs/standalone/main/llm/providers/bedrock.md
index e1116e8fc..1e7ff2ec8 100644
--- a/content/docs/standalone/main/llm/providers/bedrock.md
+++ b/content/docs/standalone/main/llm/providers/bedrock.md
@@ -1,20 +1,21 @@
 ---
 title: Amazon Bedrock
-weight: 40
+weight: 15
 description: Configuration and setup for Amazon Bedrock provider
 ---
 
 Configure Amazon Bedrock as an LLM provider in agentgateway.
 
 {{< callout type="info" >}}
-Agentgateway accepts only OpenAI-formatted requests (such as the `/v1/chat/completions` request body shape) and returns OpenAI-formatted responses, regardless of the route path that you configure. Agentgateway translates between OpenAI and Bedrock formats internally. Bedrock's native [Converse API](https://docs.aws.amazon.com/bedrock/latest/userguide/conversation-inference-call.html) request and response shapes are not supported. Usage fields in responses follow the OpenAI shape (`prompt_tokens`, `completion_tokens`, `total_tokens`), not the Bedrock shape (`inputTokens`, `outputTokens`, `totalTokens`).
+Agentgateway accepts requests in one of the supported [API formats](../api-types) (such as the `/v1/chat/completions` request body shape) and returns responses in that format.
+Agentgateway translates between these formats and Bedrock formats internally using Bedrock's [Converse API](https://docs.aws.amazon.com/bedrock/latest/userguide/conversation-inference-call.html).
+Directly sending `Converse` or `Invoke` request shapes are not directly supported; see [passthrough](#passthrough) for more information if you need these APIs.
 {{< /callout >}}
 
 ## Authentication
 
 Before you can use Bedrock as an LLM provider, you must authenticate by using the standard [AWS authentication sources](https://docs.aws.amazon.com/sdkref/latest/guide/creds-config-files.html).
-
-The default SigV4 service name for Bedrock is handled automatically, so you do not need to set `auth.aws.serviceName`.
+Agentgateway will automatically detect the local ambient credentials, but these can be explicitly configured with `auth.aws`.
 
 ## Configuration
 
@@ -40,9 +41,107 @@ llm:
 | `params.model` | The specific Bedrock model to use. If set, this model is used for all requests. If not set, the request must include the model to use. |
 | `params.awsRegion` | The AWS region where the Bedrock model is hosted. |
 
+## Passthrough
+
+If your applications directly use the AWS `Converse` or `Invoke` APIs, Agentgateway cannot translate these APIs to other providers.
+However, it can pass the request through to Bedrock itself following the [passthrough](../api-types/passthrough) approach.
+
+This can provide telemetry data for these requests.
+
+First, setup passthrough mode:
+
+```yaml
+# yaml-language-server: $schema=https://agentgateway.dev/schema/config
+llm:
+  models:
+  - name: us.anthropic*
+    provider: bedrock
+    params:
+      awsRegion: us-west-2
+    passthrough: detect
+```
+
+Then, you can send native Converse and Invoke requests:
+
+{{< tabs items="Converse,Invoke" >}}
+{{% tab %}}
+
+```python
+import json
+
+import boto3
+
+client = boto3.client(
+    'bedrock-runtime',
+    region_name='us-west-2',
+    endpoint_url='http://localhost:4000',
+)
+response = client.converse(
+    modelId='us.anthropic.claude-sonnet-4-6',
+    messages=[
+        {
+            'role': 'user',
+            'content': [{'text': 'give 1 word answer'}]
+        }
+    ]
+)
+print('converse response:')
+print(response)
+```
+
+{{% /tab %}}
+{{% tab %}}
+
+```python
+import json
+
+import boto3
+
+client = boto3.client(
+    'bedrock-runtime',
+    region_name='us-west-2',
+    endpoint_url='http://localhost:4000',
+)
+response = client.invoke_model(
+    modelId='us.anthropic.claude-sonnet-4-6',
+    body=json.dumps({
+        'anthropic_version': 'bedrock-2023-05-31',
+        'max_tokens': 10,
+        'messages': [
+            {
+                'role': 'user',
+                'content': [{'type': 'text', 'text': 'give 1 word answer'}],
+            }
+        ],
+    }),
+)
+body = json.loads(response['body'].read())
+
+print('invoke response:')
+print(body)
+```
+
+{{% /tab %}}
+{{< /tabs >}}
+
+
+{{< callout type="info" >}}
+Model translations are not supported with passthrough, so avoid using a model match like `aws/*`, as it cannot be transformed.
+{{< /callout >}}
+
+## Claude Platform on AWS
+
+See [here](../anthropic/#use-claude-platform-on-aws) for connect to [Claude Platform on AWS](https://docs.aws.amazon.com/claude-platform/latest/userguide/welcome.html).
+
+## Bedrock Mantle
+
+The [Bedrock Mantle](https://docs.aws.amazon.com/bedrock/latest/userguide/bedrock-mantle.html) endpoint is not currently supported.
+Follow the [GitHub issue](https://github.com/agentgateway/agentgateway/issues/2041) if you are interested!
+
 ## Token counting
 
-Bedrock supports token counting for Anthropic models via the `count_tokens` endpoint. Agentgateway automatically handles the required formatting for Bedrock's count-tokens endpoint, including adding the `max_tokens: 1` parameter and Base64 encoding the request body.
+Bedrock supports token counting for Anthropic models via the `count_tokens` endpoint.
+Agentgateway automatically handles the required formatting for Bedrock's count-tokens endpoint.
 
 ```bash
 curl -X POST http://localhost:4000/v1/messages/count_tokens \
@@ -81,7 +180,7 @@ Use the `reasoning_effort` field to control how much reasoning the model applies
 Note that `max_tokens` must be greater than the thinking budget, and the minimum thinking budget is 1,024 tokens.
 
 ```sh
-curl "localhost:3000/v1/chat/completions" -H content-type:application/json -d '{
+curl "localhost:4000/v1/chat/completions" -H content-type:application/json -d '{
   "model": "",
   "max_tokens": 6000,
   "reasoning_effort": "high",
@@ -99,7 +198,7 @@ curl "localhost:3000/v1/chat/completions" -H content-type:application/json -d '{
 Structured outputs constrain the model to respond with a specific JSON schema. Provide the schema definition in the OpenAI `response_format` field of your request. Agentgateway translates this to Bedrock's native format automatically.
 
 ```sh
-curl "localhost:3000/v1/chat/completions" -H content-type:application/json -d '{
+curl "localhost:4000/v1/chat/completions" -H content-type:application/json -d '{
   "model": "",
   "max_tokens": 256,
   "response_format": {
diff --git a/content/docs/standalone/main/llm/providers/cerebras.md b/content/docs/standalone/main/llm/providers/cerebras.md
index aac497e27..da0566da6 100644
--- a/content/docs/standalone/main/llm/providers/cerebras.md
+++ b/content/docs/standalone/main/llm/providers/cerebras.md
@@ -1,6 +1,6 @@
 ---
 title: Cerebras
-weight: 61
+weight: 20
 description: Configuration and setup for Cerebras LLM provider
 ---
 
@@ -19,8 +19,6 @@ llm:
     provider: cerebras
     params:
       apiKey: "$CEREBRAS_API_KEY"
-      # Optional. If omitted, agentgateway uses the default:
-      # baseUrl: "https://api.cerebras.ai/v1"
 ```
 
 {{< reuse "agw-docs/snippets/review-configuration.md" >}}
diff --git a/content/docs/standalone/main/llm/providers/cohere.md b/content/docs/standalone/main/llm/providers/cohere.md
index ea1f641ab..25b91207a 100644
--- a/content/docs/standalone/main/llm/providers/cohere.md
+++ b/content/docs/standalone/main/llm/providers/cohere.md
@@ -1,6 +1,6 @@
 ---
 title: Cohere
-weight: 61
+weight: 20
 description: Configuration and setup for Cohere LLM provider
 ---
 
@@ -19,8 +19,6 @@ llm:
     provider: cohere
     params:
       apiKey: "$COHERE_API_KEY"
-      # Optional. If omitted, agentgateway uses the default:
-      # baseUrl: "https://api.cohere.ai"
 ```
 
 {{< reuse "agw-docs/snippets/review-configuration.md" >}}
diff --git a/content/docs/standalone/main/llm/providers/custom.md b/content/docs/standalone/main/llm/providers/custom.md
new file mode 100644
index 000000000..27c44d982
--- /dev/null
+++ b/content/docs/standalone/main/llm/providers/custom.md
@@ -0,0 +1,74 @@
+---
+title: Custom
+weight: 99
+description: Configure agentgateway for providers without built-in support that implement the OpenAI API format.
+aliases: /llm/providers/openai-compatible
+test:
+  openai-compatible-validate:
+  - file: content/docs/standalone/main/llm/providers/openai-compatible.md
+    path: openai-compat-validate
+---
+
+Use this page for providers that implement the OpenAI API format but do not have a first-class `provider:` support yet. For built-in providers such as [Baseten]({{< link-hextra path="/llm/providers/baseten/" >}}), [Cerebras]({{< link-hextra path="/llm/providers/cerebras/" >}}), [Cohere]({{< link-hextra path="/llm/providers/cohere/" >}}), [DeepInfra]({{< link-hextra path="/llm/providers/deepinfra/" >}}), [DeepSeek]({{< link-hextra path="/llm/providers/deepseek/" >}}), [Fireworks AI]({{< link-hextra path="/llm/providers/fireworks/" >}}), [Groq]({{< link-hextra path="/llm/providers/groq/" >}}), [Hugging Face]({{< link-hextra path="/llm/providers/huggingface/" >}}), [Mistral]({{< link-hextra path="/llm/providers/mistral/" >}}), [OpenRouter]({{< link-hextra path="/llm/providers/openrouter/" >}}), [Together AI]({{< link-hextra path="/llm/providers/togetherai/" >}}), [xAI]({{< link-hextra path="/llm/providers/xai/" >}}), and [Ollama]({{< link-hextra path="/llm/providers/ollama/" >}}), use the dedicated provider pages instead.
+
+{{< callout type="info" >}}
+Many providers provide "OpenAI compatible" or "Anthropic compatible" endpoints.
+While these _can_ be used with `provider: openai`/`provider: anthropic` and a customized `baseUrl`, prefer to use `provider: custom`.
+
+Using a specific vendor's provider may introduce semantics specific to that provider.
+{{< /callout >}}
+
+## Before you begin
+
+{{< reuse "agw-docs/snippets/prereq-agentgateway.md" >}}
+
+You also need the following prerequisites.
+
+- An API key for your chosen provider, unless you are pointing to a local endpoint such as vLLM or LM Studio.
+
+{{< doc-test paths="openai-compat-validate" >}}
+# Install agentgateway binary for testing
+mkdir -p "$HOME/.local/bin"
+export PATH="$HOME/.local/bin:$PATH"
+VERSION="v{{< reuse "agw-docs/versions/n-patch.md" >}}"
+BINARY_URL="https://github.com/agentgateway/agentgateway/releases/download/${VERSION}/agentgateway-$(uname -s | tr '[:upper:]' '[:lower:]')-$(uname -m | sed 's/x86_64/amd64/')"
+curl -sL "$BINARY_URL" -o "$HOME/.local/bin/agentgateway"
+chmod +x "$HOME/.local/bin/agentgateway"
+
+# Set placeholder API keys for validation (--validate-only still resolves env vars)
+export PERPLEXITY_API_KEY="${PERPLEXITY_API_KEY:-test}"
+{{< /doc-test >}}
+
+## Configuring a custom provider
+
+With a custom provider, you provide the API endpoint and a list of formats it supports.
+Agentgateway will automatically handle mapping between the incoming format and the supported formats.
+
+Below shows an example of connecting to [Perplexity](https://www.perplexity.ai/), which exposes an OpenAI-compatible API for search-augmented models and does not currently have a first-class provider.
+
+```yaml {paths="openai-compat-validate"}
+cat > /tmp/test-perplexity.yaml << 'EOF'
+# yaml-language-server: $schema=https://agentgateway.dev/schema/config
+llm:
+  models:
+  - name: "*"
+    provider:
+      custom:
+        formats:
+          # Indicate this provider supports the completions API. With no `path` specified, this defaults to <baseUrl>/chat/completions
+          - type: completions
+          # Indicate this provider supports the messages API, on a custom path /messages-api
+          # - type: messages
+          #   path: /messages-api
+          # All possible APIs:
+          # - type: embeddings
+          # - type: responses
+          # - type: realtime
+          # - type: anthropicTokenCount
+          # - type: rerank
+    params:
+      apiKey: "$PERPLEXITY_API_KEY"
+      model: llama-3.1-sonar-large-128k-online
+      baseUrl: "https://api.perplexity.ai"
+EOF
+```
diff --git a/content/docs/standalone/main/llm/providers/deepinfra.md b/content/docs/standalone/main/llm/providers/deepinfra.md
index d5fc7137d..71106cb9b 100644
--- a/content/docs/standalone/main/llm/providers/deepinfra.md
+++ b/content/docs/standalone/main/llm/providers/deepinfra.md
@@ -1,6 +1,6 @@
 ---
 title: DeepInfra
-weight: 61
+weight: 20
 description: Configuration and setup for DeepInfra LLM provider
 ---
 
@@ -19,8 +19,6 @@ llm:
     provider: deepinfra
     params:
       apiKey: "$DEEPINFRA_API_KEY"
-      # Optional. If omitted, agentgateway uses the default:
-      # baseUrl: "https://api.deepinfra.com/v1/openai"
 ```
 
 {{< reuse "agw-docs/snippets/review-configuration.md" >}}
diff --git a/content/docs/standalone/main/llm/providers/deepseek.md b/content/docs/standalone/main/llm/providers/deepseek.md
index e2cec7310..1daa94936 100644
--- a/content/docs/standalone/main/llm/providers/deepseek.md
+++ b/content/docs/standalone/main/llm/providers/deepseek.md
@@ -1,6 +1,6 @@
 ---
 title: DeepSeek
-weight: 61
+weight: 20
 description: Configuration and setup for DeepSeek LLM provider
 ---
 
@@ -19,8 +19,6 @@ llm:
     provider: deepseek
     params:
       apiKey: "$DEEPSEEK_API_KEY"
-      # Optional. If omitted, agentgateway uses the default:
-      # baseUrl: "https://api.deepseek.com/v1"
 ```
 
 {{< reuse "agw-docs/snippets/review-configuration.md" >}}
diff --git a/content/docs/standalone/main/llm/providers/fireworks.md b/content/docs/standalone/main/llm/providers/fireworks.md
index 94f47e990..5ed6e1db6 100644
--- a/content/docs/standalone/main/llm/providers/fireworks.md
+++ b/content/docs/standalone/main/llm/providers/fireworks.md
@@ -1,6 +1,6 @@
 ---
 title: Fireworks AI
-weight: 61
+weight: 20
 description: Configuration and setup for Fireworks AI LLM provider
 ---
 
@@ -19,8 +19,6 @@ llm:
     provider: fireworks
     params:
       apiKey: "$FIREWORKS_API_KEY"
-      # Optional. If omitted, agentgateway uses the default:
-      # baseUrl: "https://api.fireworks.ai/inference/v1"
 ```
 
 {{< reuse "agw-docs/snippets/review-configuration.md" >}}
diff --git a/content/docs/standalone/main/llm/providers/gemini.md b/content/docs/standalone/main/llm/providers/gemini.md
index 71cd3926b..b85202463 100644
--- a/content/docs/standalone/main/llm/providers/gemini.md
+++ b/content/docs/standalone/main/llm/providers/gemini.md
@@ -1,6 +1,6 @@
 ---
 title: Gemini
-weight: 30
+weight: 15
 description: Configuration and setup for Google Gemini provider
 ---
 
@@ -17,9 +17,8 @@ llm:
   models:
   - name: "*"
     provider: gemini
-    auth:
-      key:
-        value: "$GEMINI_API_KEY"
+    params:
+      apiKey: "$GEMINI_API_KEY"
 ```
 
 {{< reuse "agw-docs/snippets/review-configuration.md" >}}
@@ -29,4 +28,4 @@ llm:
 | `name` | The model name to match in incoming requests. When a client sends `"model": "<name>"`, the request is routed to this provider. Use `*` to match any model name. |
 | `provider` | The LLM provider, set to `gemini` for Google Gemini models. |
 | `params.model` | The specific Gemini model to use. If set, this model is used for all requests. If not set, the request must include the model to use. |
-| `auth.key.value` | The Gemini API key for authentication. You can reference environment variables using the `$VAR_NAME` syntax. |
+| `params.apiKey` | The Gemini API key for authentication. You can reference environment variables using the `$VAR_NAME` syntax. |
diff --git a/content/docs/standalone/main/llm/providers/groq.md b/content/docs/standalone/main/llm/providers/groq.md
index 06ee1d92d..0e238e22f 100644
--- a/content/docs/standalone/main/llm/providers/groq.md
+++ b/content/docs/standalone/main/llm/providers/groq.md
@@ -1,6 +1,6 @@
 ---
 title: Groq
-weight: 61
+weight: 20
 description: Configuration and setup for Groq LLM provider
 ---
 
@@ -19,8 +19,6 @@ llm:
     provider: groq
     params:
       apiKey: "$GROQ_API_KEY"
-      # Optional. If omitted, agentgateway uses the default:
-      # baseUrl: "https://api.groq.com/openai/v1"
 ```
 
 {{< reuse "agw-docs/snippets/review-configuration.md" >}}
diff --git a/content/docs/standalone/main/llm/providers/huggingface.md b/content/docs/standalone/main/llm/providers/huggingface.md
index fffe17733..944b5c575 100644
--- a/content/docs/standalone/main/llm/providers/huggingface.md
+++ b/content/docs/standalone/main/llm/providers/huggingface.md
@@ -1,6 +1,6 @@
 ---
 title: Hugging Face
-weight: 61
+weight: 20
 description: Configuration and setup for Hugging Face LLM provider
 ---
 
@@ -19,8 +19,6 @@ llm:
     provider: huggingface
     params:
       apiKey: "$HUGGINGFACE_API_KEY"
-      # Optional. If omitted, agentgateway uses the default:
-      # baseUrl: "https://router.huggingface.co/v1"
 ```
 
 {{< reuse "agw-docs/snippets/review-configuration.md" >}}
diff --git a/content/docs/standalone/main/llm/providers/mistral.md b/content/docs/standalone/main/llm/providers/mistral.md
index e4dd0d44a..180eb077f 100644
--- a/content/docs/standalone/main/llm/providers/mistral.md
+++ b/content/docs/standalone/main/llm/providers/mistral.md
@@ -1,6 +1,6 @@
 ---
 title: Mistral
-weight: 61
+weight: 20
 description: Configuration and setup for Mistral LLM provider
 ---
 
@@ -19,8 +19,6 @@ llm:
     provider: mistral
     params:
       apiKey: "$MISTRAL_API_KEY"
-      # Optional. If omitted, agentgateway uses the default:
-      # baseUrl: "https://api.mistral.ai/v1"
 ```
 
 {{< reuse "agw-docs/snippets/review-configuration.md" >}}
diff --git a/content/docs/standalone/main/llm/providers/multiple-llms.md b/content/docs/standalone/main/llm/providers/multiple-llms.md
index 940cb12ea..22fc90672 100644
--- a/content/docs/standalone/main/llm/providers/multiple-llms.md
+++ b/content/docs/standalone/main/llm/providers/multiple-llms.md
@@ -1,96 +1,33 @@
 ---
 title: Multiple LLM providers
-weight: 90
+weight: 30
 description: Configure load balancing across multiple LLM providers.
 ---
 
-Create a group of LLM providers for the same route. agentgateway automatically load balances requests across the providers in the group using the **Power of Two Choices (P2C)** algorithm. This algorithm picks two random providers, scores each one based on health, latency, and pending requests, and routes the request to the higher-scoring provider. All providers in a single group are treated as equally preferred — P2C distributes traffic across healthy providers but does not implement failover.
-
-**Load balancing vs. failover:** The single-group configuration on this page is load balancing, not failover. Failover requires multiple priority groups and a health/eviction policy. When all providers in a priority group are evicted (for example, due to repeated errors or rate limiting), the gateway automatically routes to the next priority group. For a failover example, see the [Kubernetes deployment of agentgateway](https://agentgateway.dev/docs/kubernetes/latest/llm/failover/).
-
-The P2C algorithm provides better performance than simple round-robin, random, or least-connections strategies by adapting in real-time to each provider's health and performance characteristics.
-
-## Reusable providers in simplified LLM mode
-
-For simplified `llm` configuration, you can define named provider defaults once in `llm.providers[]` and reference them from multiple `llm.models[]` entries with `provider.reference`. This is different from the previous group example. Here, the reusable provider acts as a preset, not as a load-balancing pool.
+For simplified `llm` configuration, you can define named provider defaults once in `llm.providers[]` and reference them from multiple `llm.models[]` entries with `provider.reference`.
 
 ```yaml
 llm:
   providers:
-  - name: openai-default
-    provider: openAI
+  - name: openai-prod
+    provider: openai
     params:
       apiKey: "$OPENAI_API_KEY"
-  - name: openai-backup
-    provider: openAI
-    params:
-      apiKey: "$OPENAI_BACKUP_API_KEY"
 
   models:
   - name: fast
     provider:
-      reference: openai-default
+      reference: openai-prod
     params:
       model: gpt-4o-mini
   - name: smart
     provider:
-      reference: openai-backup
-    params:
-      model: gpt-4o
-```
-
-When a model references a named provider with `provider.reference`, provider defaults are reused automatically. Keep shared settings on `llm.providers[]`, and only override `params.model` on the model itself.
-
-```yaml
-llm:
-  providers:
-  - name: openai-default
-    provider: openAI
-    params:
-      apiKey: "$OPENAI_API_KEY"
-
-  models:
-  - name: smart
-    provider:
-      reference: openai-default
+      reference: openai-prod
     params:
       model: gpt-4o
 ```
 
 In this example, `smart` inherits the upstream API key from `llm.providers[]` and only changes the model name.
 
-Named providers can hold shared upstream settings you want to reuse, such as authentication, host overrides, path overrides, or other model defaults. Keep the shared values on `llm.providers[]` and only set per-model differences on `llm.models[]`.
-
-## Configuration
-
-{{< callout type="info" >}}
-Provider groups with load balancing require the traditional `binds/listeners/routes` configuration format. For more information, see the [Routing-based configuration guide]({{< link-hextra path="/llm/configuration-modes/" >}}).
-{{< /callout >}}
-
-{{< reuse "agw-docs/snippets/review-configuration.md" >}} The example sets two providers, OpenAI and Gemini. Each provider can have its own individual settings, such as host and path overrides, API keys, backend TLS, and more.
-
-```yaml
-# yaml-language-server: $schema=https://agentgateway.dev/schema/config
-binds:
-- port: 3000
-  listeners:
-  - routes:
-    - backends:
-      - ai:
-          groups:
-          - providers: 
-            - name: openai
-              provider:
-                openAI:
-                  # Optional; overrides the model in requests
-                  model: gpt-3.5-turbo
-              backendAuth:
-                key: "$OPENAI_API_KEY"
-            - name: gemini
-              provider:
-                gemini:
-                  # Optional; overrides the model in requests
-                  model: gemini-1.5-flash-latest
-              backendAuth:
-                key: "$GEMINI_API_KEY"
-```
+Named providers can hold shared upstream settings you want to reuse, such as authentication, host overrides, path overrides, or other model defaults.
+Keep the shared values on `llm.providers[]` and only set per-model differences on `llm.models[]`.
diff --git a/content/docs/standalone/main/llm/providers/ollama.md b/content/docs/standalone/main/llm/providers/ollama.md
index f848c7abd..0c99b4b5f 100644
--- a/content/docs/standalone/main/llm/providers/ollama.md
+++ b/content/docs/standalone/main/llm/providers/ollama.md
@@ -26,7 +26,6 @@ chmod +x "$HOME/.local/bin/agentgateway"
 # Write and validate the ollama config from the guide
 cat > /tmp/test-ollama-standalone.yaml << 'EOF'
 llm:
-  port: 3000
   models:
   - name: "*"
     provider: ollama
diff --git a/content/docs/standalone/main/llm/providers/openai-compatible.md b/content/docs/standalone/main/llm/providers/openai-compatible.md
deleted file mode 100644
index 8e60fa933..000000000
--- a/content/docs/standalone/main/llm/providers/openai-compatible.md
+++ /dev/null
@@ -1,141 +0,0 @@
----
-title: OpenAI-compatible providers
-weight: 10
-description: Configure agentgateway for providers without built-in support that implement the OpenAI API format.
-test:
-  openai-compatible-validate:
-  - file: content/docs/standalone/main/llm/providers/openai-compatible.md
-    path: openai-compat-validate
----
-
-Use this page for providers that implement the OpenAI API format but do not have a first-class `provider:` shortcut yet. For built-in providers such as [Baseten]({{< link-hextra path="/llm/providers/baseten/" >}}), [Cerebras]({{< link-hextra path="/llm/providers/cerebras/" >}}), [Cohere]({{< link-hextra path="/llm/providers/cohere/" >}}), [DeepInfra]({{< link-hextra path="/llm/providers/deepinfra/" >}}), [DeepSeek]({{< link-hextra path="/llm/providers/deepseek/" >}}), [Fireworks AI]({{< link-hextra path="/llm/providers/fireworks/" >}}), [Groq]({{< link-hextra path="/llm/providers/groq/" >}}), [Hugging Face]({{< link-hextra path="/llm/providers/huggingface/" >}}), [Mistral]({{< link-hextra path="/llm/providers/mistral/" >}}), [OpenRouter]({{< link-hextra path="/llm/providers/openrouter/" >}}), [Together AI]({{< link-hextra path="/llm/providers/togetherai/" >}}), [xAI]({{< link-hextra path="/llm/providers/xai/" >}}), and [Ollama]({{< link-hextra path="/llm/providers/ollama/" >}}), use the dedicated provider pages instead.
-
-If you need a different upstream endpoint for one of those built-in standalone providers, keep the first-class `provider:` value and set `params.baseUrl` on that provider instead of switching to `provider: openAI`.
-
-In standalone mode, configure upstream authentication per model with `llm.models[].auth` and upstream TLS with `llm.models[].tls`. For an overview of the available auth and TLS options, see [Providers]({{< link-hextra path="/llm/providers/" >}}).
-
-## Before you begin
-
-{{< reuse "agw-docs/snippets/prereq-agentgateway.md" >}}
-
-You also need the following prerequisites.
-
-- An API key for your chosen provider, unless you are pointing to a local endpoint such as vLLM or LM Studio.
-
-{{< doc-test paths="openai-compat-validate" >}}
-# Install agentgateway binary for testing
-mkdir -p "$HOME/.local/bin"
-export PATH="$HOME/.local/bin:$PATH"
-VERSION="v{{< reuse "agw-docs/versions/n-patch.md" >}}"
-BINARY_URL="https://github.com/agentgateway/agentgateway/releases/download/${VERSION}/agentgateway-$(uname -s | tr '[:upper:]' '[:lower:]')-$(uname -m | sed 's/x86_64/amd64/')"
-curl -sL "$BINARY_URL" -o "$HOME/.local/bin/agentgateway"
-chmod +x "$HOME/.local/bin/agentgateway"
-
-# Set placeholder API keys for validation (--validate-only still resolves env vars)
-export PERPLEXITY_API_KEY="${PERPLEXITY_API_KEY:-test}"
-{{< /doc-test >}}
-
-## Managed provider fallback
-
-### Perplexity
-
-[Perplexity](https://www.perplexity.ai/) exposes an OpenAI-compatible API for search-augmented models and does not currently have a first-class standalone provider shortcut.
-
-```yaml {paths="openai-compat-validate"}
-cat > /tmp/test-perplexity.yaml << 'EOF'
-# yaml-language-server: $schema=https://agentgateway.dev/schema/config
-llm:
-  port: 3000
-  models:
-  - name: "*"
-    provider: openAI
-    auth:
-      key:
-        value: "$PERPLEXITY_API_KEY"
-    params:
-      model: llama-3.1-sonar-large-128k-online
-      baseUrl: "https://api.perplexity.ai"
-    tls: {}
-EOF
-```
-
-{{< doc-test paths="openai-compat-validate" >}}
-agentgateway -f /tmp/test-perplexity.yaml --validate-only
-{{< /doc-test >}}
-
-## Self-hosted OpenAI-compatible endpoints
-
-### vLLM
-
-[vLLM](https://github.com/vllm-project/vllm) is a high-performance model server for self-hosted OpenAI-compatible inference.
-
-```yaml {paths="openai-compat-validate"}
-cat > /tmp/test-vllm.yaml << 'EOF'
-# yaml-language-server: $schema=https://agentgateway.dev/schema/config
-llm:
-  port: 3000
-  models:
-  - name: "*"
-    provider: openAI
-    params:
-      baseUrl: "http://localhost:8000/v1"
-EOF
-```
-
-{{< doc-test paths="openai-compat-validate" >}}
-agentgateway -f /tmp/test-vllm.yaml --validate-only
-{{< /doc-test >}}
-
-If your vLLM server uses HTTPS, set `params.baseUrl` to the HTTPS endpoint and add `tls: {}` to the model configuration. (In agentgateway versions prior to 1.3, this model-level setting was called `backendTLS`.)
-
-### LM Studio
-
-[LM Studio](https://lmstudio.ai/) runs models locally and exposes an OpenAI-compatible API for desktop testing.
-
-```yaml {paths="openai-compat-validate"}
-cat > /tmp/test-lmstudio.yaml << 'EOF'
-# yaml-language-server: $schema=https://agentgateway.dev/schema/config
-llm:
-  port: 3000
-  models:
-  - name: llama-3.2-90b
-    provider: openAI
-    params:
-      baseUrl: "http://localhost:1234/v1"
-EOF
-```
-
-{{< doc-test paths="openai-compat-validate" >}}
-agentgateway -f /tmp/test-lmstudio.yaml --validate-only
-{{< /doc-test >}}
-
-Enable the local server in LM Studio: **Settings** > **Local Server** > **Start Server**.
-
-## Generic configuration
-
-Use the following template for any OpenAI-compatible provider without built-in support:
-
-```yaml
-llm:
-  port: 3000
-  models:
-  - name: "*"
-    provider: openAI
-    auth:
-      key:
-        value: "$PROVIDER_API_KEY"
-    params:
-      model: "<upstream-model-name>"
-      baseUrl: "https://provider.example.com/v1"
-    tls: {}  # only for HTTPS providers
-```
-
-Set `params.baseUrl` to the provider's API root. This can include provider-specific prefixes such as `/v1`, `/openai/v1`, or another base path. If the provider already has a first-class page, use that provider shortcut and its documented default base URL instead.
-
-| Field | Description |
-|-------|-------------|
-| `provider` | Set to `openAI` for OpenAI-compatible providers without a first-class shortcut. |
-| `auth.key.value` | Optional. The API key for the provider. Reference environment variables with the `$VAR_NAME` syntax. Omit for local endpoints that do not require authentication. |
-| `params.model` | Optional. Override the upstream model name. Omit to pass the client-provided model through. |
-| `params.baseUrl` | The provider's API root URL, including scheme and any required base path prefix. |
-| `tls` | Enable TLS for the upstream connection. Required for HTTPS providers, omit for local HTTP providers. (In agentgateway versions prior to 1.3, this model-level setting was called `backendTLS`.) |
diff --git a/content/docs/standalone/main/llm/providers/openai.md b/content/docs/standalone/main/llm/providers/openai.md
index 36c2564c8..25adb8270 100644
--- a/content/docs/standalone/main/llm/providers/openai.md
+++ b/content/docs/standalone/main/llm/providers/openai.md
@@ -17,9 +17,8 @@ llm:
   models:
   - name: "*"
     provider: openAI
-    auth:
-      key:
-        value: "$OPENAI_API_KEY"
+    params:
+      apiKey: "$OPENAI_API_KEY"
 ```
 
 {{< reuse "agw-docs/snippets/review-configuration.md" >}}
@@ -29,7 +28,7 @@ llm:
 | `name` | The model name to match in incoming requests. When a client sends `"model": "<name>"`, the request is routed to this provider. Use `*` to match any model name. |
 | `provider` | The LLM provider, set to `openAI` for OpenAI models. |
 | `params.model` | The specific OpenAI model to use. If set, this model is used for all requests. If not set, the request must include the model to use. |
-| `auth.key.value` | The OpenAI API key for authentication. You can reference environment variables using the `$VAR_NAME` syntax. |
+| `params.apiKey` | The OpenAI API key for authentication. You can reference environment variables using the `$VAR_NAME` syntax. |
 
 {{< callout type="info" >}}
 For advanced routing scenarios that require path-based routing or custom endpoints, use the traditional `binds/listeners/routes` configuration format. See the [Routing-based configuration guide]({{< link-hextra path="/llm/configuration-modes/" >}}) for more information.
diff --git a/content/docs/standalone/main/llm/providers/openrouter.md b/content/docs/standalone/main/llm/providers/openrouter.md
index 55a3efa5f..692793ee2 100644
--- a/content/docs/standalone/main/llm/providers/openrouter.md
+++ b/content/docs/standalone/main/llm/providers/openrouter.md
@@ -1,6 +1,6 @@
 ---
 title: OpenRouter
-weight: 61
+weight: 20
 description: Configuration and setup for OpenRouter LLM provider
 ---
 
@@ -19,8 +19,6 @@ llm:
     provider: openrouter
     params:
       apiKey: "$OPENROUTER_API_KEY"
-      # Optional. If omitted, agentgateway uses the default:
-      # baseUrl: "https://openrouter.ai/api/v1"
 ```
 
 {{< reuse "agw-docs/snippets/review-configuration.md" >}}
diff --git a/content/docs/standalone/main/llm/providers/togetherai.md b/content/docs/standalone/main/llm/providers/togetherai.md
index 052a6a43e..3a9a427ae 100644
--- a/content/docs/standalone/main/llm/providers/togetherai.md
+++ b/content/docs/standalone/main/llm/providers/togetherai.md
@@ -1,6 +1,6 @@
 ---
 title: Together AI
-weight: 61
+weight: 20
 description: Configuration and setup for Together AI LLM provider
 ---
 
@@ -19,8 +19,6 @@ llm:
     provider: togetherai
     params:
       apiKey: "$TOGETHER_API_KEY"
-      # Optional. If omitted, agentgateway uses the default:
-      # baseUrl: "https://api.together.xyz/v1"
 ```
 
 {{< reuse "agw-docs/snippets/review-configuration.md" >}}
diff --git a/content/docs/standalone/main/llm/providers/vertex.md b/content/docs/standalone/main/llm/providers/vertex.md
index 1938b4968..63cb62e5d 100644
--- a/content/docs/standalone/main/llm/providers/vertex.md
+++ b/content/docs/standalone/main/llm/providers/vertex.md
@@ -1,6 +1,6 @@
 ---
 title: Vertex AI
-weight: 20
+weight: 15
 description: Configuration and setup for Google Cloud Vertex AI provider
 ---
 
@@ -25,9 +25,6 @@ llm:
   models:
   - name: gemini-2.5-flash
     provider: vertex
-    auth:
-      gcp:
-        type: accessToken
     params:
       model: google/gemini-2.5-flash-lite-preview-06-17
       vertexProject: my-project-id
diff --git a/content/docs/standalone/main/llm/providers/xai.md b/content/docs/standalone/main/llm/providers/xai.md
index 41e926f7c..6b4bd2edb 100644
--- a/content/docs/standalone/main/llm/providers/xai.md
+++ b/content/docs/standalone/main/llm/providers/xai.md
@@ -1,6 +1,6 @@
 ---
 title: xAI
-weight: 61
+weight: 20
 description: Configuration and setup for xAI (Grok) LLM provider
 ---
 
@@ -19,8 +19,6 @@ llm:
     provider: xai
     params:
       apiKey: "$XAI_API_KEY"
-      # Optional. If omitted, agentgateway uses the default:
-      # baseUrl: "https://api.x.ai/v1"
 ```
 
 {{< reuse "agw-docs/snippets/review-configuration.md" >}}
diff --git a/content/docs/standalone/main/llm/spending.md b/content/docs/standalone/main/llm/spending.md
index 92c947f52..e69de29bb 100644
--- a/content/docs/standalone/main/llm/spending.md
+++ b/content/docs/standalone/main/llm/spending.md
@@ -1,9 +0,0 @@
----
-title: Control spend
-weight: 50
-description: Control cost with token budgets and spend limits to prevent unexpected bills and LLM misuse.
-aliases:
-  - /llm/spending/
----
-
-{{< redirect path="/llm/costs/" >}}
diff --git a/release.md b/release.md
new file mode 100644
index 000000000..faac71ee7
--- /dev/null
+++ b/release.md
@@ -0,0 +1,182 @@
+🎉 Welcome to the 1.3.0 release of the agentgateway project!
+
+This release is a major step forward for LLM, MCP, and agentic traffic. Agentgateway v1.3.0 adds a purpose-built UI, AI cost analysis, virtual models, reusable providers and guardrails, 13 new LLM providers, richer MCP support, and many improvements across traffic policy, TLS, telemetry, and operations.
+
+## Artifacts
+
+**Docker images** are available:
+* `cr.agentgateway.dev/agentgateway:v1.3.0`
+* `cr.agentgateway.dev/controller:v1.3.0`
+
+**Helm charts** are available:
+* `cr.agentgateway.dev/charts/agentgateway:v1.3.0`
+* `cr.agentgateway.dev/charts/agentgateway-crds:v1.3.0`
+
+**Binaries** are available below.
+
+## Quick Start
+
+Follow the [Kubernetes](https://agentgateway.dev/docs/kubernetes/latest/quickstart/) or [Standalone](https://agentgateway.dev/docs/standalone/latest/quickstart/) quick start guide to get started.
+
+## 🔥 Breaking changes
+
+### `agctl` commands reorganized under `proxy` and `controller`
+
+The experimental `agctl` CLI now groups its inspection, tracing, and management commands under the `proxy` and `controller` parent commands, and adds commands for log-level management and version information. Update any scripts or automation that call the previous top-level commands.
+
+Kubernetes examples:
+
+Before:
+
+```sh
+agctl config all gateway/agentgateway-proxy -n agentgateway-system -o yaml
+agctl config backends gateway/agentgateway-proxy -n agentgateway-system
+agctl trace gateway/agentgateway-proxy -n agentgateway-system --port 80 -- http://www.example.com/
+```
+
+Now:
+
+```sh
+agctl proxy config all gateway/agentgateway-proxy -n agentgateway-system -o yaml
+agctl proxy config backends gateway/agentgateway-proxy -n agentgateway-system
+agctl proxy trace gateway/agentgateway-proxy -n agentgateway-system --port 80 -- http://www.example.com/
+```
+
+Standalone examples:
+
+Before:
+
+```sh
+agctl config all --file /tmp/agw-dump.json -o yaml
+agctl trace --local --port 3000 -- http://example.com/headers
+```
+
+Now:
+
+```sh
+agctl proxy config all --file /tmp/agw-dump.json -o yaml
+agctl proxy trace --local --port 3000 -- http://example.com/headers
+```
+
+The reorganization also introduces the following capabilities:
+
+- `agctl version` prints version information for the `agctl` CLI.
+- `agctl proxy log` gets or sets the proxy log level at runtime.
+- `agctl controller log` gets or sets the agentgateway controller log level per component at runtime.
+
+For more information, see the Kubernetes docs for [installing `agctl`](https://agentgateway.dev/docs/kubernetes/main/operations/agctl/), [inspecting agentgateway configuration](https://agentgateway.dev/docs/kubernetes/main/operations/inspect-config/), [tracing requests with `agctl`](https://agentgateway.dev/docs/kubernetes/main/operations/trace-requests/), [debug logs](https://agentgateway.dev/docs/kubernetes/main/operations/debug/#debug-logs), and the [`agctl` CLI reference](https://agentgateway.dev/docs/kubernetes/main/reference/agctl/). For standalone mode, see [installing `agctl`](https://agentgateway.dev/docs/standalone/main/operations/agctl/), [inspecting agentgateway configuration](https://agentgateway.dev/docs/standalone/main/operations/inspect-config/), [tracing requests with `agctl`](https://agentgateway.dev/docs/standalone/main/operations/trace-requests/), and the [`agctl` CLI reference](https://agentgateway.dev/docs/standalone/main/reference/agctl/).
+
+## 🌟 New features
+
+### New UI for LLM, MCP, and traffic management
+
+Agentgateway now includes a rebuilt UI organized around three native views:
+
+- **LLM**: Models, providers, policies, guardrails, costs, virtual API keys, and analytics.
+- **MCP**: Servers, tools, resources, authentication, and MCP policy configuration.
+- **Traffic**: Gateway API traffic configuration and policy management.
+
+The UI includes onboarding for LLM, MCP, and API capabilities, model and provider setup, per-model policies, request and response guardrails, and unified logs for LLM, MCP, and A2A calls. For more information, see the [Kubernetes UI observability docs](https://agentgateway.dev/docs/kubernetes/main/observability/ui/) and the [LLM](https://agentgateway.dev/docs/kubernetes/main/llm/) and [MCP](https://agentgateway.dev/docs/kubernetes/main/mcp/) docs.
+
+### AI cost and token analysis
+
+Agentgateway can now calculate token usage and dollar cost for LLM requests, attribute usage, and surface the data in logs, traces, metrics, `agctl`, and the UI.
+
+Cost and token data can be grouped by model, provider, user, team, and client tool. This makes it possible to analyze spend, export reports, build chargeback workflows, and apply policy decisions such as budgets, alerts, quotas, or cost-sensitive routing at the gateway.
+
+For more information, see [Kubernetes LLM cost tracking](https://agentgateway.dev/docs/kubernetes/main/llm/cost-tracking/), [Standalone LLM spending](https://agentgateway.dev/docs/standalone/main/llm/spending/), and the [`agctl costs` reference](https://agentgateway.dev/docs/kubernetes/main/reference/agctl/agctl-costs/).
+
+### Virtual models
+
+Virtual models let clients send one model name while agentgateway chooses the real backend model at request time. This moves routing policy out of clients and into the gateway.
+
+Supported strategies include:
+
+- **Weighted routing** to split traffic across models for A/B testing, migrations, and cost optimization.
+- **Failover routing** to automatically retry fallback models when a primary model fails or is rate-limited.
+- **Conditional routing** to select models with CEL expressions based on request attributes such as headers, user tier, or prompt shape.
+
+For more information, see [Standalone virtual models](https://agentgateway.dev/docs/standalone/main/llm/virtual-models/), [Kubernetes LLM load balancing](https://agentgateway.dev/docs/kubernetes/main/llm/load-balancing/), [Kubernetes LLM failover](https://agentgateway.dev/docs/kubernetes/main/llm/failover/), and [Kubernetes LLM content routing](https://agentgateway.dev/docs/kubernetes/main/llm/content-routing/).
+
+### Reusable providers and guardrails
+
+Providers and guardrails can now be defined once and referenced across many models. This simplifies large LLM deployments where many incoming model names share provider configuration, credentials, or policy.
+
+Standalone deployments can also declare shared guardrails as top-level resources instead of repeating guardrail configuration on every route. For more information, see [Standalone guardrails](https://agentgateway.dev/docs/standalone/main/llm/prompt-guards/overview/), [Standalone multi-layer guardrails](https://agentgateway.dev/docs/standalone/main/llm/prompt-guards/multi-layer/), and [Kubernetes guardrails](https://agentgateway.dev/docs/kubernetes/main/llm/guardrails/overview/).
+
+### New and improved LLM providers
+
+Agentgateway adds 13 new first-class LLM providers, including Mistral, Hugging Face, and Cohere, along with expanded custom provider support for providers without built-in integrations. For more information, see the [Standalone LLM provider docs](https://agentgateway.dev/docs/standalone/main/llm/providers/) and [Kubernetes LLM provider docs](https://agentgateway.dev/docs/kubernetes/main/llm/providers/).
+
+Additional LLM gateway improvements include:
+
+- Rerank request and response support across providers.
+- Custom LLM providers for InferencePool backends.
+- More precise per-model matching, with exact matches preferred.
+- Streaming guardrails for streaming requests.
+- Webhook guardrail `failureMode` support.
+- Per-model LLM authorization.
+- Local LLM TLS and CORS support.
+- Latency and throughput telemetry attributes on LLM requests.
+- Bedrock detect-passthrough support, Application Inference Profile prompt cache support, Anthropic beta-header allowlists, host override support, URL-encoded model IDs, and reasoning-signature replay.
+- Anthropic system messages and extra-high thinking support.
+
+### MCP improvements
+
+MCP support now includes Okta as a first-class authentication provider, MCP-aware external auth and external processing, resource subscribe and unsubscribe support, improved multiplexing behavior, and broader protocol compliance fixes.
+
+The UI also includes native MCP policy views for access control, traffic shaping, and mutation policies such as authorization, CORS, JWT, rate limiting, transformations, and external processing. For more information, see the [Kubernetes MCP docs](https://agentgateway.dev/docs/kubernetes/main/mcp/), [Standalone MCP docs](https://agentgateway.dev/docs/standalone/main/mcp/), [MCP authentication](https://agentgateway.dev/docs/kubernetes/main/mcp/auth/), and [MCP guardrails](https://agentgateway.dev/docs/kubernetes/main/mcp/guardrails/).
+
+### Request handling and extensibility
+
+Traffic policies can now buffer request bodies before forwarding, giving policies and extensions access to full request bodies before backend selection. For more information, see [Kubernetes body buffering](https://agentgateway.dev/docs/kubernetes/main/traffic-management/buffer/) and [Standalone body buffering](https://agentgateway.dev/docs/standalone/main/configuration/traffic-management/buffer/).
+
+External processing support is also expanded with richer processing-mode configuration, and external processors can return an immediate response from request-body and response-body phases. For more information, see [Kubernetes external processing](https://agentgateway.dev/docs/kubernetes/main/traffic-management/extproc/) and [Standalone external processing](https://agentgateway.dev/docs/standalone/main/configuration/traffic-management/extproc/).
+
+### Authentication and authorization
+
+Authorization can now run in the pre-routing phase, and external-auth cache TTL can be configured as an expression. This release also includes external-authz caching, expanded credential-location expressions, and scheme derivation from `X-Forwarded-Proto`. For more information, see [Kubernetes external auth](https://agentgateway.dev/docs/kubernetes/main/security/extauth/), [Standalone external auth](https://agentgateway.dev/docs/standalone/main/configuration/security/external-authz/), [Standalone HTTP authorization](https://agentgateway.dev/docs/standalone/main/configuration/security/http-authz/), and [Standalone JWT authentication](https://agentgateway.dev/docs/standalone/main/configuration/security/jwt-authn/).
+
+### TLS, networking, and policy
+
+This release adds dynamic SSL certificates for Kubernetes listener TLS, generalized backend TLS and backend references, a new `BackendReferenceGrantMode`, configurable policy inheritance strategy, and composable AI backend policies. For more information, see [Kubernetes TLS encryption](https://agentgateway.dev/docs/kubernetes/main/install/tls/), [Kubernetes backend TLS](https://agentgateway.dev/docs/kubernetes/main/security/backendtls/), and [Standalone backend TLS](https://agentgateway.dev/docs/standalone/main/configuration/security/backend-tls/).
+
+Additional networking and policy improvements include terminating inbound CONNECT, configurable admin interfaces including Unix Domain Sockets, AWS AssumeRole support, custom AWS service names, and mTLS certificate passthrough with CEL.
+
+### CEL and `agctl`
+
+CEL support is expanded with helpers for URL encode/decode, timestamp conversions, bit operations on bytes, raw JWT token access, gRPC response status, expressions in direct responses, and CEL-based retry conditions. For more information, see the [Standalone CEL reference](https://agentgateway.dev/docs/standalone/main/reference/cel/).
+
+The `agctl` CLI now includes proxy and controller log commands, version reporting with mismatch checks, route groups in config output, and evicted-backend visibility.
+
+### Operations and observability
+
+Agentgateway now exposes proxy timing measurements, a config-synchronization metric, request and connection IDs for troubleshooting, and richer distributed traces with JSON mode, body snapshots, effective gateway and route policies, and raw-output file opening. For more information, see [Kubernetes observability](https://agentgateway.dev/docs/kubernetes/main/observability/), [Kubernetes tracing](https://agentgateway.dev/docs/kubernetes/main/observability/tracing/), [Standalone metrics](https://agentgateway.dev/docs/standalone/main/reference/observability/metrics/), and [Standalone traces](https://agentgateway.dev/docs/standalone/main/reference/observability/traces/).
+
+## 🪲 Notable fixes
+
+- Fixed TCP route precedence.
+- Fixed Gateway status handling when no listeners are valid.
+- Fixed route-level OIDC cookie handling.
+- Fixed capacity-weighted load balancing.
+- Fixed backend eviction retries.
+- Fixed streaming-completion capture across Bedrock, Messages, and Responses API paths.
+- Fixed credential-location expression behavior.
+- Fixed scheme handling from `X-Forwarded-Proto`.
+- Improved MCP multiplexing and list behavior.
+- Improved MCP protocol compliance across tools, prompts, and resources.
+
+## Contributors
+
+Thank you to everyone who contributed code, reviews, documentation, bug reports, and CI improvements for this release, including more than twenty first-time contributors.
+
+Special thanks to the contributors who drove many of the changes in this release:
+
+- @howardjohn
+- @stevenctl
+- @keithmattix
+- @danehans
+- @TwilightTechie
+- @filintod
+
+See the full contributor list below.
diff --git a/static/integrations/providers/anthropic.svg b/static/integrations/providers/anthropic.svg
new file mode 100644
index 000000000..5640eaca8
--- /dev/null
+++ b/static/integrations/providers/anthropic.svg
@@ -0,0 +1,3 @@
+<svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 24 24" fill="currentColor" fill-rule="evenodd">
+  <path d="M13.827 3.52h3.603L24 20h-3.603l-6.57-16.48zm-7.258 0h3.767L16.906 20h-3.674l-1.343-3.461H5.017l-1.344 3.46H0L6.57 3.522zm4.132 9.959L8.453 7.687 6.205 13.48H10.7z"/>
+</svg>
diff --git a/static/integrations/providers/azure.svg b/static/integrations/providers/azure.svg
new file mode 100644
index 000000000..a8a297d96
--- /dev/null
+++ b/static/integrations/providers/azure.svg
@@ -0,0 +1,11 @@
+<svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 24 24">
+  <path d="M7.242 1.613A1.11 1.11 0 018.295.857h6.977L8.03 22.316a1.11 1.11 0 01-1.052.755h-5.43a1.11 1.11 0 01-1.053-1.466L7.242 1.613z" fill="url(#azure-a)"/>
+  <path d="M18.397 15.296H7.4a.51.51 0 00-.347.882l7.066 6.595c.206.192.477.298.758.298h6.226l-2.706-7.775z" fill="#0078D4"/>
+  <path d="M15.272.857H7.497L0 23.071h7.775l1.596-4.73 5.068 4.73h6.665l-2.707-7.775h-7.998L15.272.857z" fill="url(#azure-b)"/>
+  <path d="M17.193 1.613a1.11 1.11 0 00-1.052-.756h-7.81.035c.477 0 .9.304 1.052.756l6.748 19.992a1.11 1.11 0 01-1.052 1.466h-.12 7.895a1.11 1.11 0 001.052-1.466L17.193 1.613z" fill="url(#azure-c)"/>
+  <defs>
+    <linearGradient gradientUnits="userSpaceOnUse" id="azure-a" x1="8.247" x2="1.002" y1="1.626" y2="23.03"><stop stop-color="#114A8B"/><stop offset="1" stop-color="#0669BC"/></linearGradient>
+    <linearGradient gradientUnits="userSpaceOnUse" id="azure-b" x1="14.042" x2="12.324" y1="15.302" y2="15.888"><stop stop-opacity=".3"/><stop offset=".071" stop-opacity=".2"/><stop offset=".321" stop-opacity=".1"/><stop offset=".623" stop-opacity=".05"/><stop offset="1" stop-opacity="0"/></linearGradient>
+    <linearGradient gradientUnits="userSpaceOnUse" id="azure-c" x1="12.841" x2="20.793" y1="1.626" y2="22.814"><stop stop-color="#3CCBF4"/><stop offset="1" stop-color="#2892DF"/></linearGradient>
+  </defs>
+</svg>
diff --git a/static/integrations/providers/baseten.svg b/static/integrations/providers/baseten.svg
new file mode 100644
index 000000000..ffd4fbd8b
--- /dev/null
+++ b/static/integrations/providers/baseten.svg
@@ -0,0 +1,3 @@
+<svg width="24" height="24" viewBox="0 0 24 24" fill="none" xmlns="http://www.w3.org/2000/svg">
+<path d="M4.97934 6.78009H15.6237V10.26H8.59995C8.57096 10.2595 8.54214 10.2647 8.51516 10.2753C8.48819 10.286 8.4636 10.3019 8.44283 10.3221C8.42205 10.3423 8.40551 10.3665 8.39415 10.3932C8.38279 10.4199 8.37684 10.4485 8.37665 10.4775V13.5225C8.37665 13.6465 8.47815 13.74 8.59995 13.74H15.6237V17.22H12.2264C12.1974 17.2194 12.1685 17.2246 12.1416 17.2352C12.1146 17.2459 12.09 17.2618 12.0692 17.282C12.0485 17.3023 12.0319 17.3264 12.0206 17.3531C12.0092 17.3798 12.0032 17.4085 12.0031 17.4375V20.4824C12.0031 20.6064 12.1053 20.6999 12.2264 20.6999H15.4004C15.4295 20.701 15.4585 20.6962 15.4857 20.6857C15.5129 20.6752 15.5377 20.6593 15.5586 20.639C15.5795 20.6187 15.596 20.5943 15.6072 20.5674C15.6184 20.5405 15.624 20.5116 15.6237 20.4824V17.22H19.0268C19.0558 17.2205 19.0846 17.2154 19.1116 17.2047C19.1385 17.194 19.1631 17.1781 19.1839 17.1579C19.2047 17.1377 19.2212 17.1135 19.2326 17.0868C19.2439 17.0601 19.2499 17.0315 19.2501 17.0025V13.9575C19.2501 13.8335 19.1486 13.74 19.0268 13.74H15.6237V10.26H19.0268C19.0558 10.2606 19.0846 10.2554 19.1116 10.2448C19.1385 10.2341 19.1631 10.2182 19.1839 10.198C19.2047 10.1777 19.2212 10.1536 19.2326 10.1269C19.2439 10.1002 19.2499 10.0715 19.2501 10.0425V6.99758C19.2501 6.87361 19.1486 6.78009 19.0268 6.78009H15.6237V3.51762C15.6237 3.39365 15.5222 3.30012 15.4004 3.30012H4.97934C4.92022 3.29895 4.86302 3.32112 4.82014 3.36183C4.77725 3.40255 4.75214 3.45852 4.75024 3.51762V6.56259C4.75024 6.68656 4.85174 6.78009 4.97934 6.78009Z" fill="currentColor"/>
+</svg>
diff --git a/static/integrations/providers/bedrock.svg b/static/integrations/providers/bedrock.svg
new file mode 100644
index 000000000..eef5460cc
--- /dev/null
+++ b/static/integrations/providers/bedrock.svg
@@ -0,0 +1,10 @@
+<svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 24 24" width="20" height="20">
+  <defs>
+    <linearGradient id="bedrock-g" x1="80%" x2="20%" y1="20%" y2="80%">
+      <stop offset="0%" stop-color="#6350FB"/>
+      <stop offset="50%" stop-color="#3D8FFF"/>
+      <stop offset="100%" stop-color="#9AD8F8"/>
+    </linearGradient>
+  </defs>
+  <path d="M13.05 15.513h3.08c.214 0 .389.177.389.394v1.82a1.704 1.704 0 011.296 1.661c0 .943-.755 1.708-1.685 1.708-.931 0-1.686-.765-1.686-1.708 0-.807.554-1.484 1.297-1.662v-1.425h-2.69v4.663a.395.395 0 01-.188.338l-2.69 1.641a.385.385 0 01-.405-.002l-4.926-3.086a.395.395 0 01-.185-.336V16.3L2.196 14.87A.395.395 0 012 14.555L2 14.528V9.406c0-.14.073-.27.192-.34l2.465-1.462V4.448c0-.129.062-.249.165-.322l.021-.014L9.77 1.058a.385.385 0 01.407 0l2.69 1.675a.395.395 0 01.185.336V7.6h3.856V5.683a1.704 1.704 0 01-1.296-1.662c0-.943.755-1.708 1.685-1.708.931 0 1.685.765 1.685 1.708 0 .807-.553 1.484-1.296 1.662v2.311a.391.391 0 01-.389.394h-4.245v1.806h6.624a1.69 1.69 0 011.64-1.313c.93 0 1.685.764 1.685 1.707 0 .943-.754 1.708-1.685 1.708a1.69 1.69 0 01-1.64-1.314H13.05v1.937h4.953l.915 1.18a1.66 1.66 0 01.84-.227c.931 0 1.685.764 1.685 1.707 0 .943-.754 1.708-1.685 1.708-.93 0-1.685-.765-1.685-1.708 0-.346.102-.668.276-.937l-.724-.935H13.05v1.806zM9.973 1.856L7.93 3.122V6.09h-.778V3.604L5.435 4.669v2.945l2.11 1.36L9.712 7.61V5.334h.778V7.83c0 .136-.07.263-.184.335L7.963 9.638v2.081l1.422 1.009-.446.646-1.406-.998-1.53 1.005-.423-.66 1.605-1.055v-1.99L5.038 8.29l-2.26 1.34v1.676l1.972-1.189.398.677-2.37 1.429V14.3l2.166 1.258 2.27-1.368.397.677-2.176 1.311V19.3l1.876 1.175 2.365-1.426.398.678-2.017 1.216 1.918 1.201 2.298-1.403v-5.78l-4.758 2.893-.4-.675 5.158-3.136V3.289L9.972 1.856zM16.13 18.47a.913.913 0 00-.908.92c0 .507.406.918.908.918a.913.913 0 00.907-.919.913.913 0 00-.907-.92zm3.63-3.81a.913.913 0 00-.908.92c0 .508.406.92.907.92a.913.913 0 00.908-.92.913.913 0 00-.908-.92zm1.555-4.99a.913.913 0 00-.908.92c0 .507.407.918.908.918a.913.913 0 00.907-.919.913.913 0 00-.907-.92zM17.296 3.1a.913.913 0 00-.907.92c0 .508.406.92.907.92a.913.913 0 00.908-.92.913.913 0 00-.908-.92z" fill="url(#bedrock-g)" fill-rule="nonzero"/>
+</svg>
diff --git a/static/integrations/providers/cerebras.svg b/static/integrations/providers/cerebras.svg
new file mode 100644
index 000000000..ac57eb2df
--- /dev/null
+++ b/static/integrations/providers/cerebras.svg
@@ -0,0 +1,4 @@
+<svg width="24" height="24" viewBox="0 0 24 24" fill="none" xmlns="http://www.w3.org/2000/svg">
+  <path clip-rule="evenodd" d="M14.121 2.701a9.299 9.299 0 000 18.598V22.7c-5.91 0-10.7-4.791-10.7-10.701S8.21 1.299 14.12 1.299V2.7zm4.752 3.677A7.353 7.353 0 109.42 17.643l-.901 1.074a8.754 8.754 0 01-1.08-12.334 8.755 8.755 0 0112.335-1.08l-.901 1.075zm-2.255.844a5.407 5.407 0 00-5.048 9.563l-.656 1.24a6.81 6.81 0 016.358-12.043l-.654 1.24zM14.12 8.539a3.46 3.46 0 100 6.922v1.402a4.863 4.863 0 010-9.726v1.402z" fill="#F15A29" fill-rule="evenodd"/>
+  <path d="M15.407 10.836a2.24 2.24 0 00-.51-.409 1.084 1.084 0 00-.544-.152c-.255 0-.483.047-.684.14a1.58 1.58 0 00-.84.912c-.074.203-.11.416-.11.631 0 .218.036.43.11.631a1.594 1.594 0 00.84.913c.2.093.43.14.684.14.216 0 .417-.046.602-.135.188-.09.35-.225.475-.392l.928 1.006c-.14.14-.3.261-.482.363a3.367 3.367 0 01-1.083.38c-.17.026-.317.04-.44.04a3.315 3.315 0 01-1.182-.21 2.825 2.825 0 01-.961-.597 2.816 2.816 0 01-.644-.929 2.987 2.987 0 01-.238-1.21c0-.444.08-.847.238-1.21.15-.35.368-.666.643-.929.278-.261.605-.464.962-.596a3.315 3.315 0 011.182-.21c.355 0 .712.068 1.072.204.361.138.685.36.944.649l-.962.97z" fill="#111827"/>
+</svg>
diff --git a/static/integrations/providers/cohere.svg b/static/integrations/providers/cohere.svg
new file mode 100644
index 000000000..20351993c
--- /dev/null
+++ b/static/integrations/providers/cohere.svg
@@ -0,0 +1,5 @@
+<svg width="28" height="28" viewBox="0 0 28 28" fill="none" xmlns="http://www.w3.org/2000/svg">
+  <path fill-rule="evenodd" clip-rule="evenodd" d="M9.48006 16.4482C10.1707 16.4482 11.5451 16.4097 13.4444 15.628C15.6576 14.7168 20.0617 13.0613 23.2386 11.3627C25.4611 10.175 26.4352 8.60235 26.4352 6.48602C26.4352 5.78728 26.2976 5.0954 26.0302 4.44987C25.7627 3.80434 25.3708 3.21782 24.8766 2.7238C24.3825 2.22977 23.7959 1.83793 23.1503 1.57064C22.5047 1.30336 21.8128 1.16586 21.1141 1.16602H8.80456C6.77807 1.16633 4.83468 1.97156 3.40184 3.40462C1.969 4.83768 1.16406 6.78119 1.16406 8.80768C1.16406 13.0275 4.36656 16.4482 9.48006 16.4482Z" fill="#39594D"/>
+  <path fill-rule="evenodd" clip-rule="evenodd" d="M11.5625 21.7119C11.5624 20.7002 11.8622 19.7113 12.4239 18.8699C12.9856 18.0285 13.784 17.3724 14.7183 16.9846L18.5952 15.3746C22.5163 13.7482 26.8318 16.6299 26.8318 20.8754C26.8318 21.6575 26.6778 22.4319 26.3784 23.1544C26.0791 23.8769 25.6404 24.5334 25.0873 25.0864C24.5343 25.6393 23.8777 26.0779 23.1551 26.3771C22.4325 26.6763 21.6581 26.8302 20.876 26.8301L16.6795 26.8289C16.0074 26.8289 15.3419 26.6965 14.721 26.4393C14.1001 26.182 13.536 25.805 13.0608 25.3297C12.5856 24.8545 12.2088 24.2902 11.9517 23.6693C11.6946 23.0483 11.5623 22.3828 11.5625 21.7107V21.7119Z" fill="#D18EE2"/>
+  <path d="M5.5694 17.4551C4.99084 17.4549 4.41792 17.5688 3.88337 17.7901C3.34882 18.0114 2.86312 18.3359 2.45401 18.745C2.04491 19.1541 1.72042 19.6398 1.49909 20.1744C1.27775 20.7089 1.16391 21.2819 1.16406 21.8604V22.4309C1.18287 23.5867 1.65522 24.6888 2.47922 25.4995C3.30323 26.3102 4.41286 26.7646 5.56881 26.7646C6.72476 26.7646 7.8344 26.3102 8.6584 25.4995C9.48241 24.6888 9.95475 23.5867 9.97356 22.4309V21.8592C9.97356 21.2809 9.85965 20.7082 9.63832 20.1738C9.41699 19.6395 9.09258 19.154 8.68361 18.745C8.27465 18.3361 7.78914 18.0117 7.2548 17.7903C6.72046 17.569 6.14776 17.4551 5.5694 17.4551Z" fill="#FF7759"/>
+</svg>
diff --git a/static/integrations/providers/copilot.svg b/static/integrations/providers/copilot.svg
new file mode 100644
index 000000000..d29181a9a
--- /dev/null
+++ b/static/integrations/providers/copilot.svg
@@ -0,0 +1,3 @@
+<svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 24 24" fill="currentColor" fill-rule="evenodd">
+  <path d="M12 0c6.63 0 12 5.276 12 11.79-.001 5.067-3.29 9.567-8.175 11.187-.6.118-.825-.25-.825-.56 0-.398.015-1.665.015-3.242 0-1.105-.375-1.813-.81-2.181 2.67-.295 5.475-1.297 5.475-5.822 0-1.297-.465-2.344-1.23-3.169.12-.295.54-1.503-.12-3.125 0 0-1.005-.324-3.3 1.209a11.32 11.32 0 00-3-.398c-1.02 0-2.04.133-3 .398-2.295-1.518-3.3-1.209-3.3-1.209-.66 1.622-.24 2.83-.12 3.125-.765.825-1.23 1.887-1.23 3.169 0 4.51 2.79 5.527 5.46 5.822-.345.294-.66.81-.765 1.577-.69.31-2.415.81-3.495-.973-.225-.354-.9-1.223-1.845-1.209-1.005.015-.405.56.015.781.51.28 1.095 1.327 1.23 1.666.24.663 1.02 1.93 4.035 1.385 0 .988.015 1.916.015 2.196 0 .31-.225.664-.825.56C3.303 21.374-.003 16.867 0 11.791 0 5.276 5.37 0 12 0z"/>
+</svg>
diff --git a/static/integrations/providers/deepinfra.svg b/static/integrations/providers/deepinfra.svg
new file mode 100644
index 000000000..096ea0aea
--- /dev/null
+++ b/static/integrations/providers/deepinfra.svg
@@ -0,0 +1,4 @@
+<svg width="24" height="24" viewBox="0 0 24 24" fill="none" xmlns="http://www.w3.org/2000/svg">
+<path d="M5.47441 8.86575C5.01828 8.86515 4.58101 8.6837 4.25849 8.36117C3.93596 8.03864 3.7545 7.60137 3.75391 7.14525C3.7543 6.689 3.93567 6.25154 4.25822 5.92885C4.58077 5.60616 5.01815 5.4246 5.47441 5.424C5.93092 5.4242 6.36869 5.60558 6.69156 5.92832C7.01443 6.25105 7.19601 6.68874 7.19641 7.14525C7.19561 7.6015 7.01386 8.03881 6.69103 8.36121C6.3682 8.68362 5.93066 8.86555 5.47441 8.86575ZM5.47441 6.09975C5.20378 6.11 4.94765 6.22471 4.75981 6.4198C4.57197 6.61489 4.46703 6.87518 4.46703 7.146C4.46703 7.41682 4.57197 7.67711 4.75981 7.8722C4.94765 8.06729 5.20378 8.182 5.47441 8.19225C5.74503 8.182 6.00116 8.06729 6.189 7.8722C6.37684 7.67711 6.48178 7.41682 6.48178 7.146C6.48178 6.87518 6.37684 6.61489 6.189 6.4198C6.00116 6.22471 5.74503 6.11 5.47441 6.09975ZM5.47441 13.7198C5.01815 13.7192 4.58077 13.5376 4.25822 13.2149C3.93567 12.8922 3.7543 12.4548 3.75391 11.9985C3.7545 11.5424 3.93596 11.1051 4.25849 10.7826C4.58101 10.4601 5.01828 10.2786 5.47441 10.278C5.93066 10.2784 6.36812 10.4598 6.69081 10.7823C7.0135 11.1049 7.19506 11.5422 7.19566 11.9985C7.19526 12.4549 7.01379 12.8925 6.69107 13.2152C6.36836 13.5379 5.93079 13.7194 5.47441 13.7198ZM5.47441 10.9537C5.20398 10.964 4.94803 11.0786 4.76033 11.2736C4.57263 11.4685 4.46776 11.7286 4.46776 11.9992C4.46776 12.2699 4.57263 12.53 4.76033 12.7249C4.94803 12.9199 5.20398 13.0345 5.47441 13.0447C5.61504 13.0501 5.75529 13.027 5.88679 12.9768C6.01828 12.9267 6.13831 12.8505 6.23969 12.7529C6.34106 12.6553 6.42171 12.5383 6.47679 12.4088C6.53187 12.2793 6.56026 12.14 6.56026 11.9992C6.56026 11.8585 6.53187 11.7192 6.47679 11.5897C6.42171 11.4602 6.34106 11.3432 6.23969 11.2456C6.13831 11.148 6.01828 11.0718 5.88679 11.0217C5.75529 10.9715 5.61504 10.9484 5.47441 10.9537ZM5.47441 18.5707C5.01828 18.5702 4.58101 18.3887 4.25849 18.0662C3.93596 17.7436 3.7545 17.3064 3.75391 16.8502C3.7543 16.394 3.93567 15.9565 4.25822 15.6338C4.58077 15.3112 5.01815 15.1296 5.47441 15.129C5.93079 15.1294 6.36836 15.3109 6.69107 15.6336C7.01379 15.9563 7.19526 16.3939 7.19566 16.8502C7.19506 17.3065 7.0135 17.7439 6.69081 18.0664C6.36812 18.389 5.93066 18.5704 5.47441 18.5707ZM5.47441 15.8047C5.20378 15.815 4.94765 15.9297 4.75981 16.1248C4.57197 16.3199 4.46703 16.5802 4.46703 16.851C4.46703 17.1218 4.57197 17.3821 4.75981 17.5772C4.94765 17.7723 5.20378 17.887 5.47441 17.8973C5.74503 17.887 6.00116 17.7723 6.189 17.5772C6.37684 17.3821 6.48178 17.1218 6.48178 16.851C6.48178 16.5802 6.37684 16.3199 6.189 16.1248C6.00116 15.9297 5.74503 15.815 5.47441 15.8047ZM18.5574 8.86575C18.1012 8.86535 17.6637 8.68398 17.341 8.36143C17.0183 8.03889 16.8368 7.6015 16.8362 7.14525C16.8366 6.689 17.0179 6.25154 17.3405 5.92885C17.663 5.60616 18.1004 5.4246 18.5567 5.424C19.013 5.4244 19.4506 5.60587 19.7733 5.92858C20.096 6.25129 20.2775 6.68887 20.2779 7.14525C20.2773 7.60137 20.0959 8.03864 19.7733 8.36117C19.4508 8.6837 19.0135 8.86515 18.5574 8.86575ZM18.5574 6.09975C18.3503 6.0996 18.1477 6.1609 17.9754 6.27589C17.8031 6.39088 17.6688 6.55439 17.5895 6.74573C17.5102 6.93708 17.4894 7.14766 17.5297 7.35083C17.5701 7.55399 17.6698 7.74062 17.8163 7.88709C17.9628 8.03356 18.1494 8.13329 18.3526 8.17367C18.5557 8.21404 18.7663 8.19325 18.9577 8.11391C19.149 8.03457 19.3125 7.90026 19.4275 7.72796C19.5425 7.55567 19.6038 7.35314 19.6037 7.146C19.6037 6.5685 19.1342 6.09975 18.5574 6.09975ZM18.5574 13.7198C18.101 13.7194 17.6634 13.5379 17.3407 13.2152C17.018 12.8925 16.8366 12.4549 16.8362 11.9985C16.8368 11.5424 17.0182 11.1051 17.3407 10.7826C17.6633 10.4601 18.1005 10.2786 18.5567 10.278C19.0129 10.2784 19.4504 10.4598 19.7731 10.7823C20.0957 11.1049 20.2773 11.5422 20.2779 11.9985C20.2775 12.4548 20.0961 12.8922 19.7736 13.2149C19.451 13.5376 19.0137 13.7192 18.5574 13.7198ZM18.5574 10.9537C17.9807 10.9537 17.5112 11.4225 17.5112 11.9985C17.5214 12.2691 17.6361 12.5253 17.8312 12.7131C18.0263 12.9009 18.2866 13.0059 18.5574 13.0059C18.8282 13.0059 19.0885 12.9009 19.2836 12.7131C19.4787 12.5253 19.5934 12.2691 19.6037 11.9985C19.6037 11.421 19.1342 10.9537 18.5574 10.9537ZM18.5574 18.5707C18.1012 18.5704 17.6637 18.389 17.341 18.0664C17.0183 17.7439 16.8368 17.3065 16.8362 16.8502C16.8366 16.394 17.0179 15.9565 17.3405 15.6338C17.663 15.3112 18.1004 15.1296 18.5567 15.129C19.013 15.1294 19.4506 15.3109 19.7733 15.6336C20.096 15.9563 20.2775 16.3939 20.2779 16.8502C20.2773 17.3064 20.0959 17.7436 19.7733 18.0662C19.4508 18.3887 19.0135 18.5702 18.5574 18.5707ZM18.5574 15.8047C18.3503 15.8046 18.1477 15.8659 17.9754 15.9809C17.8031 16.0959 17.6688 16.2594 17.5895 16.4507C17.5102 16.6421 17.4894 16.8527 17.5297 17.0558C17.5701 17.259 17.6698 17.4456 17.8163 17.5921C17.9628 17.7386 18.1494 17.8383 18.3526 17.8787C18.5557 17.919 18.7663 17.8982 18.9577 17.8189C19.149 17.7396 19.3125 17.6053 19.4275 17.433C19.5425 17.2607 19.6038 17.0581 19.6037 16.851C19.6037 16.2735 19.1342 15.8047 18.5574 15.8047ZM12.0159 11.2928C11.5598 11.2922 11.1225 11.1107 10.8 10.7882C10.4775 10.4656 10.296 10.0284 10.2954 9.57225C10.2958 9.116 10.4772 8.67854 10.7997 8.35585C11.1223 8.03316 11.5597 7.8516 12.0159 7.851C12.4723 7.8514 12.9099 8.03287 13.2326 8.35558C13.5553 8.67829 13.7368 9.11587 13.7372 9.57225C13.7366 10.0285 13.555 10.4659 13.2323 10.7884C12.9096 11.111 12.4722 11.2924 12.0159 11.2928ZM12.0159 8.52675C11.8088 8.5266 11.6062 8.5879 11.4339 8.70289C11.2616 8.81788 11.1273 8.98139 11.048 9.17273C10.9687 9.36408 10.9479 9.57466 10.9882 9.77783C11.0286 9.98099 11.1283 10.1676 11.2748 10.3141C11.4213 10.4606 11.6079 10.5603 11.8111 10.6007C12.0142 10.641 12.2248 10.6202 12.4162 10.5409C12.6075 10.4616 12.771 10.3273 12.886 10.155C13.001 9.98267 13.0623 9.78014 13.0622 9.573C13.0622 8.9955 12.5934 8.52675 12.0159 8.52675ZM12.0167 6.44175C11.5603 6.44135 11.1227 6.25988 10.8 5.93717C10.4773 5.61446 10.2958 5.17688 10.2954 4.7205C10.296 4.26425 10.4776 3.82686 10.8003 3.50432C11.1229 3.18177 11.5604 3.0004 12.0167 3C12.4728 3.0006 12.91 3.18205 13.2326 3.50458C13.5551 3.82711 13.7366 4.26438 13.7372 4.7205C13.7368 5.17675 13.5554 5.61421 13.2328 5.9369C12.9103 6.25959 12.4729 6.44115 12.0167 6.44175ZM12.0167 3.67575C11.8095 3.6756 11.607 3.7369 11.4347 3.85189C11.2624 3.96688 11.1281 4.13039 11.0487 4.32173C10.9694 4.51308 10.9486 4.72366 10.989 4.92683C11.0294 5.12999 11.1291 5.31662 11.2756 5.46309C11.422 5.60956 11.6087 5.70929 11.8118 5.74967C12.015 5.79004 12.2256 5.76925 12.4169 5.68991C12.6083 5.61057 12.7718 5.47626 12.8868 5.30396C13.0018 5.13167 13.0631 4.92914 13.0629 4.722C13.0629 4.1445 12.5934 3.67575 12.0167 3.67575ZM12.0167 16.1467C11.5603 16.1464 11.1227 15.9649 10.8 15.6422C10.4773 15.3195 10.2958 14.8819 10.2954 14.4255C10.296 13.9692 10.4776 13.5319 10.8003 13.2093C11.1229 12.8868 11.5604 12.7054 12.0167 12.705C12.4728 12.7056 12.91 12.8871 13.2326 13.2096C13.5551 13.5321 13.7366 13.9694 13.7372 14.4255C13.7368 14.8818 13.5554 15.3192 13.2328 15.6419C12.9103 15.9646 12.4729 16.1462 12.0167 16.1467ZM12.0167 13.3807C11.8095 13.3806 11.607 13.4419 11.4347 13.5569C11.2624 13.6719 11.1281 13.8354 11.0487 14.0267C10.9694 14.2181 10.9486 14.4287 10.989 14.6318C11.0294 14.835 11.1291 15.0216 11.2756 15.1681C11.422 15.3146 11.6087 15.4143 11.8118 15.4547C12.015 15.495 12.2256 15.4742 12.4169 15.3949C12.6083 15.3156 12.7718 15.1813 12.8868 15.009C13.0018 14.8367 13.0631 14.6341 13.0629 14.427C13.0629 13.8495 12.5934 13.3807 12.0167 13.3807ZM12.0159 21C11.5597 20.9994 11.1223 20.8178 10.7997 20.4952C10.4772 20.1725 10.2958 19.735 10.2954 19.2787C10.296 18.8226 10.4775 18.3854 10.8 18.0628C11.1225 17.7403 11.5598 17.5588 12.0159 17.5582C12.4722 17.5586 12.9096 17.74 13.2323 18.0626C13.555 18.3851 13.7366 18.8225 13.7372 19.2787C13.7368 19.7351 13.5553 20.1727 13.2326 20.4954C12.9099 20.8181 12.4723 20.9996 12.0159 21ZM12.0159 18.234C11.8088 18.2339 11.6062 18.2952 11.4339 18.4101C11.2616 18.5251 11.1273 18.6886 11.048 18.88C10.9687 19.0713 10.9479 19.2819 10.9882 19.4851C11.0286 19.6882 11.1283 19.8749 11.2748 20.0213C11.4213 20.1678 11.6079 20.2675 11.8111 20.3079C12.0142 20.3483 12.2248 20.3275 12.4162 20.2482C12.6075 20.1688 12.771 20.0345 12.886 19.8622C13.001 19.6899 13.0623 19.4874 13.0622 19.2803C13.0622 18.7028 12.5934 18.234 12.0159 18.234Z" fill="#2A3275"/>
+<path d="M9.27627 9.16647C9.20512 9.16662 9.13459 9.15313 9.06852 9.12672L7.94802 8.67972C7.87728 8.65388 7.81245 8.6141 7.75737 8.56274C7.70229 8.51138 7.65809 8.44948 7.62738 8.38072C7.59667 8.31195 7.58008 8.23772 7.5786 8.16242C7.57712 8.08713 7.59077 8.0123 7.61875 7.94238C7.64673 7.87246 7.68846 7.80887 7.74148 7.75538C7.79449 7.70189 7.85771 7.65959 7.92738 7.63099C7.99704 7.60239 8.07175 7.58807 8.14705 7.58888C8.22236 7.5897 8.29674 7.60562 8.36577 7.63572L9.48552 8.08197C9.60669 8.13061 9.70717 8.2199 9.76971 8.33452C9.83226 8.44913 9.85298 8.58194 9.82833 8.71017C9.80368 8.83839 9.73519 8.95404 9.6346 9.0373C9.53401 9.12055 9.40759 9.16622 9.27702 9.16647H9.27627ZM9.27627 13.9245C9.20512 13.9246 9.1346 13.9111 9.06852 13.8847L7.94802 13.4385C7.80948 13.3831 7.69862 13.2749 7.63982 13.1378C7.58103 13.0006 7.57912 12.8458 7.63452 12.7072C7.68992 12.5687 7.79808 12.4578 7.93522 12.399C8.07235 12.3402 8.22723 12.3383 8.36577 12.3937L9.48552 12.8407C9.60622 12.8897 9.70618 12.979 9.76836 13.0934C9.83054 13.2079 9.85108 13.3403 9.82648 13.4682C9.80187 13.5961 9.73364 13.7115 9.63344 13.7947C9.53323 13.8779 9.40726 13.9238 9.27702 13.9245H9.27627ZM15.847 11.5477C15.7758 11.5476 15.7053 11.5339 15.6393 11.5072L14.5195 11.061C14.4478 11.036 14.3819 10.9967 14.3258 10.9455C14.2697 10.8944 14.2245 10.8324 14.193 10.7633C14.1615 10.6942 14.1443 10.6194 14.1424 10.5435C14.1406 10.4676 14.1542 10.3921 14.1823 10.3215C14.2104 10.251 14.2525 10.1869 14.3061 10.133C14.3597 10.0792 14.4236 10.0368 14.494 10.0083C14.5644 9.97988 14.6398 9.96597 14.7158 9.96745C14.7917 9.96894 14.8665 9.98578 14.9358 10.017L16.0555 10.4632C16.1767 10.5119 16.2772 10.6012 16.3397 10.7158C16.4023 10.8304 16.423 10.9632 16.3983 11.0914C16.3737 11.2196 16.3052 11.3353 16.2046 11.4185C16.104 11.5018 15.9776 11.5475 15.847 11.5477ZM15.847 6.78747C15.7758 6.78734 15.7053 6.7736 15.6393 6.74697L14.5195 6.29997C14.3874 6.24045 14.2833 6.13224 14.229 5.99787C14.1747 5.86351 14.1743 5.71338 14.228 5.57876C14.2817 5.44414 14.3852 5.33544 14.5171 5.2753C14.649 5.21516 14.7989 5.20824 14.9358 5.25597L16.0555 5.70297C16.1762 5.75192 16.2762 5.84124 16.3384 5.95568C16.4005 6.07012 16.4211 6.20259 16.3965 6.33049C16.3719 6.45839 16.3036 6.57379 16.2034 6.65699C16.1032 6.74019 15.9773 6.78604 15.847 6.78672V6.78747ZM15.847 16.3042C15.7759 16.3044 15.7053 16.2909 15.6393 16.2645L14.5188 15.8175C14.4475 15.7921 14.3821 15.7526 14.3265 15.7013C14.2708 15.65 14.2261 15.5881 14.1949 15.5191C14.1638 15.4502 14.1469 15.3757 14.1452 15.3C14.1435 15.2244 14.1571 15.1492 14.1852 15.0789C14.2133 15.0087 14.2552 14.9448 14.3085 14.8911C14.3619 14.8375 14.4255 14.7951 14.4955 14.7666C14.5656 14.7381 14.6407 14.724 14.7163 14.7252C14.792 14.7263 14.8666 14.7428 14.9358 14.7735L16.0555 15.2205C16.1762 15.2694 16.2762 15.3587 16.3384 15.4732C16.4005 15.5876 16.4211 15.7201 16.3965 15.848C16.3719 15.9759 16.3036 16.0913 16.2034 16.1745C16.1032 16.2577 15.9773 16.3035 15.847 16.3042ZM8.18652 16.4745C8.05622 16.474 7.93013 16.4282 7.82979 16.3451C7.72946 16.2619 7.6611 16.1466 7.6364 16.0186C7.61169 15.8907 7.63218 15.7581 7.69435 15.6436C7.75652 15.5291 7.85652 15.4397 7.97727 15.3907L9.09777 14.9445C9.23621 14.8892 9.39095 14.8911 9.52795 14.9499C9.66494 15.0087 9.77297 15.1195 9.82827 15.258C9.88357 15.3964 9.88161 15.5511 9.82281 15.6881C9.76402 15.8251 9.65321 15.9332 9.51477 15.9885L8.39502 16.4347C8.32894 16.4611 8.25767 16.4746 8.18652 16.4745ZM8.18652 11.745C8.05589 11.7449 7.92935 11.6993 7.82863 11.6162C7.72791 11.533 7.65928 11.4173 7.63453 11.289C7.60979 11.1608 7.63045 11.0279 7.69299 10.9132C7.75553 10.7985 7.85604 10.7091 7.97727 10.6605L9.09777 10.2142C9.23484 10.1654 9.38551 10.1716 9.51812 10.2315C9.65073 10.2913 9.75497 10.4003 9.80894 10.5354C9.86292 10.6706 9.86243 10.8214 9.80757 10.9561C9.75272 11.0909 9.64777 11.1992 9.51477 11.2582L8.39502 11.7045C8.32894 11.7309 8.25842 11.7444 8.18727 11.7442L8.18652 11.745ZM8.18652 6.77472C8.05589 6.77464 7.92935 6.7291 7.82863 6.6459C7.72791 6.56271 7.65928 6.44705 7.63453 6.31878C7.60979 6.19051 7.63045 6.05763 7.69299 5.94293C7.75553 5.82824 7.85604 5.73889 7.97727 5.69022L9.09777 5.24397C9.23484 5.19513 9.38551 5.2013 9.51812 5.2612C9.65073 5.3211 9.75497 5.43006 9.80894 5.56519C9.86292 5.70032 9.86243 5.85111 9.80757 5.98588C9.75272 6.12066 9.64777 6.22893 9.51477 6.28797L8.39502 6.73497C8.32894 6.76134 8.25767 6.77483 8.18652 6.77472ZM9.27627 19.023C9.20512 19.0231 9.1346 19.0096 9.06852 18.9832L7.94802 18.5362C7.81588 18.4767 7.71181 18.3685 7.65751 18.2341C7.6032 18.0998 7.60284 17.9496 7.65652 17.815C7.71019 17.6804 7.81375 17.5717 7.9456 17.5115C8.07746 17.4514 8.22743 17.4445 8.36427 17.4922L9.48477 17.9385C9.60594 17.9871 9.70642 18.0764 9.76896 18.191C9.83151 18.3056 9.85223 18.4384 9.82758 18.5667C9.80293 18.6949 9.73444 18.8105 9.63385 18.8938C9.53326 18.977 9.40684 19.0227 9.27627 19.023ZM14.7265 9.16722C14.5956 9.1674 14.4687 9.12193 14.3678 9.03862C14.2668 8.95532 14.198 8.8394 14.1733 8.71084C14.1486 8.58229 14.1695 8.44914 14.2325 8.33435C14.2954 8.21956 14.3964 8.1303 14.518 8.08197L15.6385 7.63572C15.7766 7.58277 15.9299 7.58632 16.0654 7.64559C16.2009 7.70486 16.3075 7.81508 16.3624 7.9524C16.4172 8.08973 16.4157 8.24312 16.3583 8.37937C16.3009 8.51563 16.1921 8.62379 16.0555 8.68047L14.935 9.12672C14.8692 9.15325 14.799 9.167 14.728 9.16722H14.7265ZM14.7265 13.9252C14.596 13.925 14.4695 13.8793 14.3689 13.796C14.2684 13.7128 14.1999 13.5971 14.1752 13.4689C14.1506 13.3407 14.1713 13.2079 14.2338 13.0933C14.2964 12.9787 14.3969 12.8894 14.518 12.8407L15.6385 12.3937C15.7772 12.3384 15.9321 12.3405 16.0692 12.3994C16.2064 12.4583 16.3145 12.5693 16.3698 12.708C16.4251 12.8466 16.423 13.0015 16.3641 13.1387C16.3052 13.2758 16.1942 13.3839 16.0555 13.4392L14.935 13.8855C14.8692 13.9118 14.7989 13.9253 14.728 13.9252H14.7265ZM14.7265 19.0222C14.5958 19.0224 14.4691 18.9771 14.3682 18.8939C14.2673 18.8107 14.1986 18.695 14.1739 18.5666C14.1492 18.4382 14.1701 18.3052 14.2329 18.1906C14.2957 18.0759 14.3965 17.9868 14.518 17.9385L15.6385 17.4922C15.7078 17.461 15.7826 17.4442 15.8585 17.4427C15.9345 17.4412 16.0099 17.4551 16.0803 17.4836C16.1507 17.512 16.2146 17.5545 16.2682 17.6083C16.3218 17.6621 16.3639 17.7262 16.392 17.7968C16.4201 17.8673 16.4337 17.9428 16.4319 18.0187C16.43 18.0947 16.4128 18.1694 16.3813 18.2385C16.3498 18.3076 16.3046 18.3696 16.2485 18.4208C16.1924 18.4719 16.1265 18.5112 16.0548 18.5362L14.9343 18.9825C14.8682 19.0089 14.7977 19.0224 14.7265 19.0222Z" fill="#2A3275"/>
+</svg>
diff --git a/static/integrations/providers/deepseek.svg b/static/integrations/providers/deepseek.svg
new file mode 100644
index 000000000..5f7cdcfa8
--- /dev/null
+++ b/static/integrations/providers/deepseek.svg
@@ -0,0 +1,3 @@
+<svg width="24" height="24" viewBox="0 0 40 40" xmlns="http://www.w3.org/2000/svg">
+<path d="M35.6638 9.91965C35.3251 9.75432 35.1785 10.0703 34.9811 10.2316C34.9131 10.2836 34.8558 10.3516 34.7985 10.413C34.3025 10.9423 33.7238 11.289 32.9678 11.2476C31.8625 11.1863 30.9186 11.533 30.0839 12.3783C29.9066 11.3356 29.3173 10.7143 28.4213 10.3143C27.9519 10.1063 27.4773 9.89965 27.148 9.44766C26.9186 9.12633 26.856 8.76767 26.7413 8.41568C26.668 8.20235 26.5946 7.98502 26.3506 7.94902C26.084 7.90769 25.98 8.13035 25.876 8.31702C25.4587 9.07967 25.2973 9.91965 25.3133 10.7703C25.3493 12.6849 26.1573 14.2102 27.764 15.2942C27.9466 15.4182 27.9933 15.5435 27.9359 15.7249C27.8266 16.0982 27.696 16.4609 27.5813 16.8355C27.508 17.0742 27.3986 17.1248 27.1426 17.0222C26.2777 16.6504 25.4919 16.1164 24.828 15.4489C23.6854 14.3449 22.6534 13.1263 21.3654 12.1716C21.067 11.9511 20.7606 11.7416 20.4468 11.5436C19.1335 10.2676 20.6201 9.21967 20.9641 9.09567C21.3241 8.965 21.0881 8.51968 19.9254 8.52501C18.7628 8.53035 17.6988 8.91834 16.3428 9.43699C16.1413 9.51421 15.934 9.57529 15.7229 9.61966C14.4557 9.38091 13.1598 9.33506 11.8789 9.48366C9.36565 9.76365 7.35902 10.953 5.88305 12.9809C4.10975 15.4182 3.69243 18.1888 4.20308 21.0768C4.74041 24.122 6.29504 26.6433 8.683 28.6139C11.1603 30.6579 14.0122 31.6592 17.2668 31.4672C19.2428 31.3539 21.4441 31.0886 23.9254 28.9873C24.552 29.2993 25.208 29.4233 26.2986 29.5166C27.1386 29.5953 27.9466 29.4766 28.5719 29.3459C29.5519 29.1379 29.4839 28.23 29.1306 28.0646C26.2573 26.726 26.888 27.2713 26.3133 26.83C27.7746 25.102 29.9746 23.3074 30.8359 17.4928C30.9026 17.0302 30.8452 16.7395 30.8359 16.3662C30.8306 16.1395 30.8826 16.0502 31.1426 16.0249C31.8639 15.95 32.5637 15.7349 33.2025 15.3915C35.0638 14.3742 35.8158 12.7049 35.9931 10.7023C36.0198 10.3956 35.9878 10.081 35.6638 9.91965ZM19.4414 27.9433C16.6562 25.754 15.3055 25.0327 14.7482 25.0634C14.2256 25.0954 14.3202 25.6913 14.4349 26.0807C14.5549 26.4647 14.7109 26.7286 14.9295 27.066C15.0815 27.2886 15.1855 27.6206 14.7789 27.87C13.8816 28.4246 12.3229 27.6833 12.2496 27.6473C10.435 26.578 8.91632 25.1673 7.84834 23.2381C6.81637 21.3808 6.21638 19.3888 6.11771 17.2622C6.09105 16.7475 6.24171 16.5662 6.7537 16.4729C7.42583 16.3442 8.11451 16.3267 8.79233 16.4209C11.6349 16.8368 14.0536 18.1075 16.0828 20.1194C17.2402 21.2661 18.1161 22.6354 19.0188 23.974C19.9788 25.3953 21.0108 26.75 22.3254 27.8593C22.7894 28.2486 23.1587 28.5446 23.5134 28.7619C22.4441 28.8819 20.6601 28.9086 19.4414 27.9433ZM20.7748 19.3568C20.7745 19.2906 20.7904 19.2253 20.8211 19.1666C20.8517 19.1078 20.8962 19.0575 20.9507 19.0198C21.0052 18.9821 21.068 18.9583 21.1337 18.9503C21.1995 18.9424 21.2662 18.9505 21.3281 18.9741C21.407 19.0024 21.475 19.0546 21.5228 19.1235C21.5706 19.1923 21.5958 19.2743 21.5947 19.3581C21.5949 19.4123 21.5843 19.4659 21.5636 19.5159C21.5428 19.5659 21.5123 19.6113 21.4738 19.6494C21.4354 19.6875 21.3897 19.7176 21.3395 19.7378C21.2893 19.7581 21.2356 19.7682 21.1814 19.7675C21.1277 19.7676 21.0745 19.7571 21.0248 19.7365C20.9752 19.7158 20.9302 19.6855 20.8925 19.6473C20.8548 19.609 20.825 19.5636 20.805 19.5138C20.785 19.4639 20.7739 19.4105 20.7748 19.3568ZM24.9213 21.4848C24.6547 21.5928 24.3893 21.6861 24.1347 21.6981C23.7516 21.7114 23.3756 21.5918 23.0707 21.3594C22.7054 21.0528 22.4441 20.8821 22.3347 20.3488C22.297 20.0881 22.3042 19.823 22.3561 19.5648C22.4494 19.1288 22.3454 18.8488 22.0374 18.5955C21.7881 18.3875 21.4694 18.3302 21.1201 18.3302C21.0005 18.3232 20.8843 18.2875 20.7814 18.2262C20.6348 18.1542 20.5148 17.9728 20.6294 17.7488C20.6668 17.6768 20.8428 17.5008 20.8854 17.4688C21.3601 17.1995 21.9081 17.2875 22.4134 17.4902C22.8827 17.6822 23.2374 18.0342 23.748 18.5328C24.2694 19.1341 24.364 19.3008 24.6613 19.7515C24.896 20.1048 25.1093 20.4674 25.2547 20.8821C25.344 21.1421 25.2293 21.3541 24.9213 21.4848Z" fill="#4D6BFE"/>
+</svg>
diff --git a/static/integrations/providers/fireworks.svg b/static/integrations/providers/fireworks.svg
new file mode 100644
index 000000000..11ed92373
--- /dev/null
+++ b/static/integrations/providers/fireworks.svg
@@ -0,0 +1,3 @@
+<svg width="128" height="128" viewBox="0 0 128 128" fill="none" xmlns="http://www.w3.org/2000/svg">
+  <path d="M102.16 59.6128L80.7231 81.2856L111.279 81.1147L114.203 88.0132L80.7339 88.0952L80.7231 88.0845H80.729C77.9532 88.0845 75.4627 86.4411 74.3853 83.9019C73.3026 81.3406 73.8633 78.4164 75.8198 76.4321L99.2358 52.7144L102.16 59.6128ZM52.1851 76.4155C54.1417 78.3943 54.708 81.3293 53.6196 83.8853C52.5424 86.4301 50.0415 88.0678 47.2769 88.0679L13.8081 87.9917L13.7974 88.0024L16.7212 81.104L47.2769 81.2739L25.8452 59.5962L28.77 52.6978L52.1851 76.4155ZM63.9976 66.5825L75.7163 38.4995H83.2407L70.3071 69.2095C69.2353 71.7597 66.7402 73.4144 63.9536 73.4146C61.1669 73.4146 58.6656 71.76 57.5991 69.1987L44.7427 38.4995H52.2671L63.9976 66.5825Z" fill="#4A1DBD"/>
+</svg>
diff --git a/static/integrations/providers/gemini.svg b/static/integrations/providers/gemini.svg
new file mode 100644
index 000000000..b1235b4a5
--- /dev/null
+++ b/static/integrations/providers/gemini.svg
@@ -0,0 +1,11 @@
+<svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 24 24">
+  <path d="M20.616 10.835a14.147 14.147 0 01-4.45-3.001 14.111 14.111 0 01-3.678-6.452.503.503 0 00-.975 0 14.134 14.134 0 01-3.679 6.452 14.155 14.155 0 01-4.45 3.001c-.65.28-1.318.505-2.002.678a.502.502 0 000 .975c.684.172 1.35.397 2.002.677a14.147 14.147 0 014.45 3.001 14.112 14.112 0 013.679 6.453.502.502 0 00.975 0c.172-.685.397-1.351.677-2.003a14.145 14.145 0 013.001-4.45 14.113 14.113 0 016.453-3.678.503.503 0 000-.975 13.245 13.245 0 01-2.003-.678z" fill="#3186FF"/>
+  <path d="M20.616 10.835a14.147 14.147 0 01-4.45-3.001 14.111 14.111 0 01-3.678-6.452.503.503 0 00-.975 0 14.134 14.134 0 01-3.679 6.452 14.155 14.155 0 01-4.45 3.001c-.65.28-1.318.505-2.002.678a.502.502 0 000 .975c.684.172 1.35.397 2.002.677a14.147 14.147 0 014.45 3.001 14.112 14.112 0 013.679 6.453.502.502 0 00.975 0c.172-.685.397-1.351.677-2.003a14.145 14.145 0 013.001-4.45 14.113 14.113 0 016.453-3.678.503.503 0 000-.975 13.245 13.245 0 01-2.003-.678z" fill="url(#gemini-a)"/>
+  <path d="M20.616 10.835a14.147 14.147 0 01-4.45-3.001 14.111 14.111 0 01-3.678-6.452.503.503 0 00-.975 0 14.134 14.134 0 01-3.679 6.452 14.155 14.155 0 01-4.45 3.001c-.65.28-1.318.505-2.002.678a.502.502 0 000 .975c.684.172 1.35.397 2.002.677a14.147 14.147 0 014.45 3.001 14.112 14.112 0 013.679 6.453.502.502 0 00.975 0c.172-.685.397-1.351.677-2.003a14.145 14.145 0 013.001-4.45 14.113 14.113 0 016.453-3.678.503.503 0 000-.975 13.245 13.245 0 01-2.003-.678z" fill="url(#gemini-b)"/>
+  <path d="M20.616 10.835a14.147 14.147 0 01-4.45-3.001 14.111 14.111 0 01-3.678-6.452.503.503 0 00-.975 0 14.134 14.134 0 01-3.679 6.452 14.155 14.155 0 01-4.45 3.001c-.65.28-1.318.505-2.002.678a.502.502 0 000 .975c.684.172 1.35.397 2.002.677a14.147 14.147 0 014.45 3.001 14.112 14.112 0 013.679 6.453.502.502 0 00.975 0c.172-.685.397-1.351.677-2.003a14.145 14.145 0 013.001-4.45 14.113 14.113 0 016.453-3.678.503.503 0 000-.975 13.245 13.245 0 01-2.003-.678z" fill="url(#gemini-c)"/>
+  <defs>
+    <linearGradient gradientUnits="userSpaceOnUse" id="gemini-a" x1="7" x2="11" y1="15.5" y2="12"><stop stop-color="#08B962"/><stop offset="1" stop-color="#08B962" stop-opacity="0"/></linearGradient>
+    <linearGradient gradientUnits="userSpaceOnUse" id="gemini-b" x1="8" x2="11.5" y1="5.5" y2="11"><stop stop-color="#F94543"/><stop offset="1" stop-color="#F94543" stop-opacity="0"/></linearGradient>
+    <linearGradient gradientUnits="userSpaceOnUse" id="gemini-c" x1="3.5" x2="17.5" y1="13.5" y2="12"><stop stop-color="#FABC12"/><stop offset=".46" stop-color="#FABC12" stop-opacity="0"/></linearGradient>
+  </defs>
+</svg>
diff --git a/static/integrations/providers/googlecloud.svg b/static/integrations/providers/googlecloud.svg
new file mode 100644
index 000000000..80def4a19
--- /dev/null
+++ b/static/integrations/providers/googlecloud.svg
@@ -0,0 +1 @@
+<svg height="1em" style="flex:none;line-height:1" viewBox="0 0 24 24" width="1em" xmlns="http://www.w3.org/2000/svg"><title>GoogleCloud</title><path d="M15.961 7.327l2.086-2.086.14-.879C14.384.905 8.34 1.297 4.913 5.18A9.643 9.643 0 002.88 8.991l.747-.105 4.172-.688.322-.33c1.856-2.038 4.994-2.312 7.137-.578l.703.037z" fill="#EA4335"></path><path d="M21.02 8.93a9.399 9.399 0 00-2.834-4.568L15.258 7.29a5.204 5.204 0 011.91 4.129v.52a2.606 2.606 0 012.607 2.605c0 1.44-1.167 2.577-2.606 2.577h-5.22l-.512.556v3.126l.513.49h5.219c3.743.03 6.802-2.952 6.83-6.695a6.778 6.778 0 00-2.98-5.668z" fill="#4285F4"></path><path d="M6.738 21.293h5.212v-4.172H6.738c-.371 0-.731-.08-1.069-.234l-.74.227-2.1 2.086-.183.71a6.763 6.763 0 004.092 1.383z" fill="#34A853"></path><path d="M6.738 7.759A6.778 6.778 0 002.646 19.91l3.023-3.023a2.606 2.606 0 113.448-3.448l3.023-3.023a6.771 6.771 0 00-5.402-2.657z" fill="#FBBC05"></path></svg>
\ No newline at end of file
diff --git a/static/integrations/providers/groq.svg b/static/integrations/providers/groq.svg
new file mode 100644
index 000000000..ff1105567
--- /dev/null
+++ b/static/integrations/providers/groq.svg
@@ -0,0 +1,3 @@
+<svg width="24" height="24" viewBox="0 0 40 40" xmlns="http://www.w3.org/2000/svg">
+<path d="M20.056 4.50022C14.0839 4.44597 9.20616 9.15015 9.15036 15.0106C9.09611 20.8726 13.8855 25.6621 19.8576 25.7163H23.6085V21.7391H20.056C16.3252 21.7825 13.2671 18.8468 13.2237 15.1827C13.1787 11.5216 16.1702 8.52086 19.901 8.47746H20.056C23.7868 8.47746 26.8108 11.4457 26.8216 15.1083V24.8809C26.8216 28.5109 23.8085 31.4683 20.1211 31.5132C18.3617 31.5007 16.6759 30.8049 15.42 29.5726L12.551 32.3905C14.5529 34.3571 17.239 35.4715 20.0451 35.4998H20.1877C26.0823 35.413 30.8175 30.7212 30.85 24.9351V14.8603C30.7059 9.0928 25.9165 4.50022 20.056 4.50022Z" fill="#F05237"/>
+</svg>
diff --git a/static/integrations/providers/huggingface.svg b/static/integrations/providers/huggingface.svg
new file mode 100644
index 000000000..ae94384ff
--- /dev/null
+++ b/static/integrations/providers/huggingface.svg
@@ -0,0 +1,8 @@
+<svg viewBox="0 0 24 24" width="24" height="24" xmlns="http://www.w3.org/2000/svg">
+  <path d="M2.25 11.535c0-3.407 1.847-6.554 4.844-8.258a9.822 9.822 0 019.687 0c2.997 1.704 4.844 4.851 4.844 8.258 0 5.266-4.337 9.535-9.687 9.535S2.25 16.8 2.25 11.535z" fill="#FF9D0B"/>
+  <path d="M11.938 20.086c4.797 0 8.687-3.829 8.687-8.551 0-4.722-3.89-8.55-8.687-8.55-4.798 0-8.688 3.828-8.688 8.55 0 4.722 3.89 8.55 8.688 8.55z" fill="#FFD21E"/>
+  <path d="M11.875 15.113c2.457 0 3.25-2.156 3.25-3.263 0-.576-.393-.394-1.023-.089-.582.283-1.365.675-2.224.675-1.798 0-3.25-1.693-3.25-.586 0 1.107.79 3.263 3.25 3.263h-.003z" fill="#FF323D"/>
+  <path d="M14.76 9.21c.32.108.445.753.767.585.447-.233.707-.708.659-1.204a1.235 1.235 0 00-.879-1.059 1.262 1.262 0 00-1.33.394c-.322.384-.377.92-.14 1.36.153.283.638-.177.925-.079l-.002.003zm-5.887 0c-.32.108-.448.753-.768.585a1.226 1.226 0 01-.658-1.204c.048-.495.395-.913.878-1.059a1.262 1.262 0 011.33.394c.322.384.377.92.14 1.36-.152.283-.64-.177-.925-.079l.003.003zm1.12 5.34a2.166 2.166 0 011.325-1.106c.07-.02.144.06.219.171l.192.306c.069.1.139.175.209.175.074 0 .15-.074.223-.172l.205-.302c.08-.11.157-.188.234-.165.537.168.986.536 1.25 1.026.932-.724 1.275-1.905 1.275-2.633 0-.508-.306-.426-.81-.19l-.616.296c-.52.24-1.148.48-1.824.48-.676 0-1.302-.24-1.823-.48l-.589-.283c-.52-.248-.838-.342-.838.177 0 .703.32 1.831 1.187 2.56l.18.14z" fill="#3A3B45"/>
+  <path d="M17.812 10.366a.806.806 0 00.813-.8c0-.441-.364-.8-.813-.8a.806.806 0 00-.812.8c0 .442.364.8.812.8zm-11.624 0a.806.806 0 00.812-.8c0-.441-.364-.8-.812-.8a.806.806 0 00-.813.8c0 .442.364.8.813.8zM4.515 13.073c-.405 0-.765.162-1.017.46a1.455 1.455 0 00-.333.925 1.801 1.801 0 00-.485-.074c-.387 0-.737.146-.985.409a1.41 1.41 0 00-.2 1.722 1.302 1.302 0 00-.447.694c-.06.222-.12.69.2 1.166a1.267 1.267 0 00-.093 1.236c.238.533.81.958 1.89 1.405l.24.096c.768.3 1.473.492 1.478.494.89.243 1.808.375 2.732.394 1.465 0 2.513-.443 3.115-1.314.93-1.342.842-2.575-.274-3.763l-.151-.154c-.692-.684-1.155-1.69-1.25-1.912-.195-.655-.71-1.383-1.562-1.383-.46.007-.889.233-1.15.605-.25-.31-.495-.553-.715-.694a1.87 1.87 0 00-.993-.312zm14.97 0c.405 0 .767.162 1.017.46.216.262.333.588.333.925.158-.047.322-.071.487-.074.388 0 .738.146.985.409a1.41 1.41 0 01.2 1.722c.22.178.377.422.445.694.06.222.12.69-.2 1.166.244.37.279.836.093 1.236-.238.533-.81.958-1.889 1.405l-.239.096c-.77.3-1.475.492-1.48.494-.89.243-1.808.375-2.732.394-1.465 0-2.513-.443-3.115-1.314-.93-1.342-.842-2.575.274-3.763l.151-.154c.695-.684 1.157-1.69 1.252-1.912.195-.655.708-1.383 1.56-1.383.46.007.889.233 1.15.605.25-.31.495-.553.718-.694.244-.162.523-.265.814-.3l.176-.012z" fill="#FF9D0B"/>
+  <path d="M9.785 20.132c.688-.994.638-1.74-.305-2.667-.945-.928-1.495-2.288-1.495-2.288s-.205-.788-.672-.714c-.468.074-.81 1.25.17 1.971.977.721-.195 1.21-.573.534-.375-.677-1.405-2.416-1.94-2.751-.532-.332-.907-.148-.782.541.125.687 2.357 2.35 2.14 2.707-.218.362-.983-.42-.983-.42S2.953 14.9 2.43 15.46c-.52.558.398 1.026 1.7 1.803 1.308.778 1.41.985 1.225 1.28-.187.295-3.07-2.1-3.34-1.083-.27 1.011 2.943 1.304 2.745 2.006-.2.7-2.265-1.324-2.685-.537-.425.79 2.913 1.718 2.94 1.725 1.075.276 3.813.859 4.77-.522zm4.432 0c-.687-.994-.64-1.74.305-2.667.943-.928 1.493-2.288 1.493-2.288s.205-.788.675-.714c.465.074.807 1.25-.17 1.971-.98.721.195 1.21.57.534.377-.677 1.407-2.416 1.94-2.751.532-.332.91-.148.782.541-.125.687-2.355 2.35-2.137 2.707.215.362.98-.42.98-.42S21.05 14.9 21.57 15.46c.52.558-.395 1.026-1.7 1.803-1.308.778-1.408.985-1.225 1.28.187.295 3.07-2.1 3.34-1.083.27 1.011-2.94 1.304-2.743 2.006.2.7 2.263-1.324 2.685-.537.423.79-2.912 1.718-2.94 1.725-1.077.276-3.815.859-4.77-.522z" fill="#FFD21E"/>
+</svg>
diff --git a/static/integrations/providers/mistral.svg b/static/integrations/providers/mistral.svg
new file mode 100644
index 000000000..0183909ba
--- /dev/null
+++ b/static/integrations/providers/mistral.svg
@@ -0,0 +1,7 @@
+<svg width="28" height="28" viewBox="0 0 28 28" fill="none" xmlns="http://www.w3.org/2000/svg">
+  <path d="M4 3.9668H8.0005V7.96613H4V3.9668ZM19.9997 3.9668H24.0013V7.96613H19.9997V3.9668Z" fill="#FFD700"/>
+  <path d="M4 7.9668H11.9998V11.9673H4.00117L4 7.9668ZM16.0003 7.9668H24.0002V11.9673H16.0003V7.9668Z" fill="#FFAF00"/>
+  <path d="M4 11.9668H24.0013V15.9661H4V11.9668Z" fill="#FF8205"/>
+  <path d="M4 15.9668H8.0005V19.9661H4V15.9668ZM12.001 15.9668H16.0015V19.9661H12.001V15.9668ZM19.9997 15.9668H24.0013V19.9661H19.9997V15.9668Z" fill="#FA500F"/>
+  <path d="M0 19.9668H12.0003V23.9673H0V19.9668ZM15.9997 19.9668H28V23.9673H15.9997V19.9668Z" fill="#E10500"/>
+</svg>
diff --git a/static/integrations/providers/ollama.svg b/static/integrations/providers/ollama.svg
new file mode 100644
index 000000000..2635b08db
--- /dev/null
+++ b/static/integrations/providers/ollama.svg
@@ -0,0 +1,8 @@
+<svg viewBox="0 0 28 28" fill="none" xmlns="http://www.w3.org/2000/svg">
+  <path
+    fill-rule="evenodd"
+    clip-rule="evenodd"
+    d="M9.22529 1.27126C9.47729 1.37043 9.70479 1.53376 9.91129 1.7496C10.2555 2.1066 10.546 2.6176 10.7676 3.2231C10.9905 3.8321 11.1351 4.50643 11.19 5.1831C11.9245 4.76754 12.7397 4.5145 13.5805 4.4411L13.64 4.43643C14.655 4.35476 15.6583 4.53793 16.5333 4.98943C16.6511 5.05126 16.7666 5.11776 16.8798 5.18776C16.9381 4.52393 17.0805 3.86476 17.2998 3.26976C17.5215 2.6631 17.812 2.15326 18.155 1.7951C18.3466 1.58774 18.5811 1.42453 18.8421 1.31676C19.142 1.2001 19.4605 1.1791 19.7708 1.26776C20.2386 1.40076 20.64 1.6971 20.9561 2.1276C21.2455 2.52076 21.4625 3.02476 21.6106 3.6291C21.879 4.71876 21.9256 6.1526 21.7448 7.8816L21.8066 7.92826L21.837 7.95043C22.7201 8.62243 23.335 9.58026 23.6605 10.6921C24.168 12.4269 23.9125 14.3729 23.0375 15.4614L23.0165 15.4859L23.0188 15.4894C23.5053 16.3784 23.8005 17.3176 23.8635 18.2894L23.8658 18.3244C23.9405 19.5669 23.6325 20.8176 22.9161 22.0461L22.908 22.0578L22.9196 22.0858C23.4703 23.4356 23.643 24.7948 23.4306 26.1528L23.4236 26.1983C23.3907 26.3966 23.2805 26.5739 23.1171 26.6911C22.9538 26.8083 22.7506 26.856 22.5521 26.8236C22.4539 26.8083 22.3596 26.7737 22.2747 26.7218C22.1898 26.67 22.116 26.6019 22.0575 26.5215C21.999 26.4411 21.9569 26.3499 21.9336 26.2532C21.9104 26.1565 21.9065 26.0562 21.9221 25.9579C22.117 24.7528 21.9338 23.5441 21.3621 22.3144C21.3088 22.2002 21.2851 22.0744 21.2933 21.9485C21.3014 21.8227 21.3411 21.701 21.4088 21.5946L21.4135 21.5876C22.1181 20.5096 22.4098 19.4526 22.3468 18.4143C22.2931 17.5054 21.9676 16.6129 21.4135 15.7624C21.3057 15.5971 21.2673 15.396 21.3066 15.2026C21.3459 15.0091 21.4597 14.8389 21.6235 14.7288L21.634 14.7218C21.9175 14.5363 22.1788 14.0626 22.3106 13.4151C22.4561 12.6495 22.4181 11.8602 22.1998 11.1121C21.9606 10.2954 21.5231 9.6141 20.9106 9.1486C20.2165 8.61893 19.2971 8.36343 18.134 8.43693C17.9819 8.44682 17.8303 8.41086 17.6988 8.3337C17.5674 8.25654 17.4621 8.14172 17.3966 8.0041C17.0303 7.22826 16.496 6.67293 15.8298 6.32876C15.1902 6.00956 14.4742 5.87541 13.7625 5.94143C12.31 6.05693 11.029 6.87593 10.6475 7.90843C10.5935 8.05375 10.4964 8.17911 10.3692 8.26772C10.242 8.35634 10.0908 8.40398 9.93579 8.40426C8.69095 8.4066 7.72729 8.69826 7.02262 9.22443C6.41362 9.67943 5.99829 10.3153 5.77895 11.0771C5.58048 11.7942 5.5533 12.5479 5.69962 13.2774C5.83029 13.9284 6.08579 14.4674 6.37862 14.7579L6.38795 14.7661C6.63529 15.0076 6.68779 15.3844 6.51512 15.6819C6.09512 16.4076 5.78129 17.4891 5.72995 18.5286C5.67162 19.7163 5.94695 20.7476 6.56879 21.4873L6.58745 21.5094C6.68129 21.6188 6.74165 21.7529 6.76131 21.8956C6.78096 22.0384 6.75908 22.1838 6.69829 22.3144C6.02629 23.7564 5.81979 24.9418 6.04262 25.8751C6.08267 26.0692 6.04541 26.2712 5.93875 26.4382C5.8321 26.6053 5.66447 26.7241 5.47155 26.7694C5.27863 26.8147 5.07565 26.7829 4.9058 26.6808C4.73595 26.5787 4.61264 26.4144 4.56212 26.2228C4.27862 25.0351 4.47112 23.6748 5.11395 22.1418L5.13029 22.1009L5.12095 22.0869C4.80501 21.6203 4.56921 21.1041 4.42329 20.5598L4.41745 20.5376C4.24037 19.8585 4.17069 19.1558 4.21095 18.4551C4.26229 17.3934 4.53529 16.3061 4.93662 15.4334L4.95062 15.4031L4.94829 15.4008C4.60645 14.9131 4.35329 14.2889 4.21329 13.5983L4.20745 13.5703C4.01456 12.6069 4.05174 11.6116 4.31595 10.6653C4.62162 9.59776 5.22245 8.68076 6.10795 8.0181C6.17795 7.9656 6.25145 7.9131 6.32495 7.8641C6.13945 6.12226 6.18612 4.6791 6.45562 3.58243C6.60379 2.9781 6.82195 2.4741 7.11129 2.08093C7.42629 1.6516 7.82762 1.35526 8.29545 1.2211C8.60579 1.13243 8.92545 1.15226 9.22529 1.2701V1.27126ZM14.0273 11.8763C15.1193 11.8763 16.1273 12.2414 16.881 12.8738C17.616 13.4886 18.0535 14.3146 18.0535 15.1371C18.0535 16.1731 17.5798 16.9804 16.7316 17.4961C16.0083 17.9336 15.0388 18.1459 13.9281 18.1459C12.751 18.1459 11.7453 17.8438 11.0196 17.2896C10.2998 16.7413 9.89612 15.9713 9.89612 15.1371C9.89612 14.3123 10.3605 13.4839 11.1281 12.8668C11.9075 12.2403 12.9365 11.8763 14.0273 11.8763ZM14.0273 12.9216C13.2179 12.9145 12.43 13.1818 11.792 13.6799C11.2541 14.1116 10.9496 14.6541 10.9496 15.1383C10.9496 15.6376 11.1946 16.1054 11.6613 16.4613C12.1921 16.8661 12.9726 17.1006 13.9281 17.1006C14.8603 17.1006 15.6466 16.9291 16.1821 16.6036C16.7223 16.2769 16.9988 15.8033 16.9988 15.1371C16.9988 14.6436 16.7118 14.0988 16.202 13.6718C15.6373 13.1993 14.872 12.9216 14.0273 12.9216ZM14.7996 14.3333L14.8043 14.3379C14.9443 14.5141 14.9151 14.7696 14.739 14.9096L14.3983 15.1779V15.6983C14.3977 15.8141 14.3511 15.925 14.2689 16.0065C14.1867 16.0881 14.0755 16.1337 13.9596 16.1334C13.8438 16.1337 13.7326 16.0881 13.6503 16.0065C13.5681 15.925 13.5216 15.8141 13.521 15.6983V15.1616L13.2048 14.9073C13.1631 14.8738 13.1284 14.8325 13.1028 14.7856C13.0771 14.7387 13.061 14.6872 13.0554 14.6341C13.0497 14.5809 13.0547 14.5272 13.0699 14.476C13.0851 14.4247 13.1104 14.377 13.1441 14.3356C13.213 14.2518 13.3121 14.1985 13.4201 14.1874C13.528 14.1762 13.6359 14.2081 13.7205 14.2761L13.9713 14.4768L14.228 14.2738C14.3122 14.2072 14.4191 14.1762 14.5259 14.1873C14.6327 14.1984 14.7309 14.2508 14.7996 14.3333ZM8.91962 12.0944C9.47729 12.0944 9.93112 12.5494 9.93112 13.1106C9.93143 13.3796 9.82495 13.6377 9.63507 13.8282C9.44519 14.0188 9.18745 14.1261 8.91845 14.1268C8.64987 14.1258 8.39259 14.0185 8.203 13.8282C8.01341 13.638 7.90695 13.3804 7.90695 13.1118C7.90633 12.8428 8.01252 12.5845 8.20218 12.3938C8.39184 12.203 8.65063 12.0954 8.91962 12.0944ZM19.0766 12.0944C19.6366 12.0944 20.0893 12.5494 20.0893 13.1106C20.0896 13.3796 19.9831 13.6377 19.7932 13.8282C19.6034 14.0188 19.3456 14.1261 19.0766 14.1268C18.808 14.1258 18.5508 14.0185 18.3612 13.8282C18.1716 13.638 18.0651 13.3804 18.0651 13.1118C18.0645 12.8428 18.1707 12.5845 18.3603 12.3938C18.55 12.203 18.8076 12.0954 19.0766 12.0944ZM8.68279 2.68293L8.67929 2.68526C8.54413 2.74404 8.42872 2.84042 8.34679 2.96293L8.34095 2.96993C8.17995 3.19043 8.03995 3.51476 7.93495 3.9406C7.73662 4.74793 7.68295 5.84343 7.79029 7.18626C8.29195 7.03693 8.83912 6.9436 9.42829 6.90976L9.43995 6.9086L9.46212 6.86893C9.51579 6.77326 9.57295 6.6811 9.63479 6.5901C9.77829 5.6906 9.66045 4.6161 9.33962 3.73876C9.18329 3.3141 8.99312 2.98043 8.81112 2.79026C8.77355 2.75073 8.73168 2.71551 8.68629 2.68526L8.68279 2.68293ZM19.3858 2.7296L19.3835 2.73076C19.3381 2.76101 19.2962 2.79623 19.2586 2.83576C19.0766 3.02593 18.8853 3.36076 18.7301 3.78543C18.3918 4.71176 18.2786 5.85743 18.4618 6.7861L18.5295 6.89926L18.5388 6.9156H18.5738C19.1528 6.91575 19.7288 6.99904 20.2841 7.16293C20.3845 5.8516 20.3285 4.77943 20.1348 3.98726C20.0298 3.56143 19.8898 3.2371 19.7276 3.0166L19.723 3.0096C19.6412 2.88665 19.5258 2.78985 19.3905 2.73076H19.3858V2.7296Z"
+    fill="black"
+  />
+</svg>
diff --git a/static/integrations/providers/openai.svg b/static/integrations/providers/openai.svg
new file mode 100644
index 000000000..308ed8ab7
--- /dev/null
+++ b/static/integrations/providers/openai.svg
@@ -0,0 +1 @@
+<svg fill="currentColor" fill-rule="evenodd" height="1em" viewBox="0 0 24 24" width="1em" xmlns="http://www.w3.org/2000/svg"><path d="M9.205 8.658v-2.26c0-.19.072-.333.238-.428l4.543-2.616c.619-.357 1.356-.523 2.117-.523 2.854 0 4.662 2.212 4.662 4.566 0 .167 0 .357-.024.547l-4.71-2.759a.797.797 0 00-.856 0l-5.97 3.473zm10.609 8.8V12.06c0-.333-.143-.57-.429-.737l-5.97-3.473 1.95-1.118a.433.433 0 01.476 0l4.543 2.617c1.309.76 2.189 2.378 2.189 3.948 0 1.808-1.07 3.473-2.76 4.163zM7.802 12.703l-1.95-1.142c-.167-.095-.239-.238-.239-.428V5.899c0-2.545 1.95-4.472 4.591-4.472 1 0 1.927.333 2.712.928L8.23 5.067c-.285.166-.428.404-.428.737v6.898zM12 15.128l-2.795-1.57v-3.33L12 8.658l2.795 1.57v3.33L12 15.128zm1.796 7.23c-1 0-1.927-.332-2.712-.927l4.686-2.712c.285-.166.428-.404.428-.737v-6.898l1.974 1.142c.167.095.238.238.238.428v5.233c0 2.545-1.974 4.472-4.614 4.472zm-5.637-5.303l-4.544-2.617c-1.308-.761-2.188-2.378-2.188-3.948A4.482 4.482 0 014.21 6.327v5.423c0 .333.143.571.428.738l5.947 3.449-1.95 1.118a.432.432 0 01-.476 0zm-.262 3.9c-2.688 0-4.662-2.021-4.662-4.519 0-.19.024-.38.047-.57l4.686 2.71c.286.167.571.167.856 0l5.97-3.448v2.26c0 .19-.07.333-.237.428l-4.543 2.616c-.619.357-1.356.523-2.117.523zm5.899 2.83a5.947 5.947 0 005.827-4.756C22.287 18.339 24 15.84 24 13.296c0-1.665-.713-3.282-1.998-4.448.119-.5.19-.999.19-1.498 0-3.401-2.759-5.947-5.946-5.947-.642 0-1.26.095-1.88.31A5.962 5.962 0 0010.205 0a5.947 5.947 0 00-5.827 4.757C1.713 5.447 0 7.945 0 10.49c0 1.666.713 3.283 1.998 4.448-.119.5-.19 1-.19 1.499 0 3.401 2.759 5.946 5.946 5.946.642 0 1.26-.095 1.88-.309a5.96 5.96 0 004.162 1.713z"/></svg>
diff --git a/static/integrations/providers/openrouter.svg b/static/integrations/providers/openrouter.svg
new file mode 100644
index 000000000..7e8abc81d
--- /dev/null
+++ b/static/integrations/providers/openrouter.svg
@@ -0,0 +1,8 @@
+<svg width="24" height="24" viewBox="0 0 24 24" fill="none" xmlns="http://www.w3.org/2000/svg">
+<path d="M3.10913 12.07C3.65512 12.07 5.76627 11.5988 6.85825 10.98C7.95023 10.3612 7.95023 10.3612 10.207 8.75965C13.0642 6.73196 15.0845 7.41088 18.3968 7.41088" fill="currentColor"/>
+<path d="M3.10913 12.07C3.65512 12.07 5.76627 11.5988 6.85825 10.98C7.95023 10.3612 7.95023 10.3612 10.207 8.75965C13.0642 6.73196 15.0845 7.41088 18.3968 7.41088" stroke="currentColor" stroke-width="3.27593"/>
+<path d="M21.6 7.43108L16.0037 10.6622V4.20001L21.6 7.43108Z" fill="currentColor" stroke="currentColor" stroke-width="0.0363992"/>
+<path d="M3 12.072C3.54599 12.072 5.65714 12.5432 6.74912 13.162C7.8411 13.7808 7.8411 13.7808 10.0978 15.3823C12.9551 17.41 14.9753 16.7311 18.2877 16.7311" fill="currentColor"/>
+<path d="M3 12.072C3.54599 12.072 5.65714 12.5432 6.74912 13.162C7.8411 13.7808 7.8411 13.7808 10.0978 15.3823C12.9551 17.41 14.9753 16.7311 18.2877 16.7311" stroke="currentColor" stroke-width="3.27593"/>
+<path d="M21.4909 16.7109L15.8945 13.4798V19.942L21.4909 16.7109Z" fill="currentColor" stroke="currentColor" stroke-width="0.0363992"/>
+</svg>
diff --git a/static/integrations/providers/togetherai.svg b/static/integrations/providers/togetherai.svg
new file mode 100644
index 000000000..68413386c
--- /dev/null
+++ b/static/integrations/providers/togetherai.svg
@@ -0,0 +1,4 @@
+<svg width="24" height="24" viewBox="0 0 40 40" xmlns="http://www.w3.org/2000/svg">
+<path opacity="0.25" d="M27.539 18.922C29.2526 18.922 30.8959 18.2413 32.1076 17.0296C33.3193 15.8179 34 14.1746 34 12.461C34 10.7474 33.3193 9.10406 32.1076 7.89238C30.8959 6.68071 29.2526 6 27.539 6C25.8254 6 24.1821 6.68071 22.9704 7.89238C21.7587 9.10406 21.078 10.7474 21.078 12.461C21.078 14.1746 21.7587 15.8179 22.9704 17.0296C24.1821 18.2413 25.8254 18.922 27.539 18.922ZM27.539 34C29.2526 34 30.8959 33.3193 32.1076 32.1076C33.3193 30.8959 34 29.2526 34 27.539C34 25.8254 33.3193 24.1821 32.1076 22.9704C30.8959 21.7587 29.2526 21.078 27.539 21.078C25.8254 21.078 24.1821 21.7587 22.9704 22.9704C21.7587 24.1821 21.078 25.8254 21.078 27.539C21.078 29.2526 21.7587 30.8959 22.9704 32.1076C24.1821 33.3193 25.8254 34 27.539 34ZM12.461 34C14.1746 34 15.8179 33.3193 17.0296 32.1076C18.2413 30.8959 18.922 29.2526 18.922 27.539C18.922 25.8254 18.2413 24.1821 17.0296 22.9704C15.8179 21.7587 14.1746 21.078 12.461 21.078C10.7474 21.078 9.10406 21.7587 7.89238 22.9704C6.68071 24.1821 6 25.8254 6 27.539C6 29.2526 6.68071 30.8959 7.89238 32.1076C9.10406 33.3193 10.7474 34 12.461 34Z" fill="currentColor"/>
+<path d="M12.461 18.922C16.0293 18.922 18.922 16.0293 18.922 12.461C18.922 8.89269 16.0293 6 12.461 6C8.89269 6 6 8.89269 6 12.461C6 16.0293 8.89269 18.922 12.461 18.922Z" fill="currentColor"/>
+</svg>
diff --git a/static/integrations/providers/vertex.svg b/static/integrations/providers/vertex.svg
new file mode 100644
index 000000000..4fc47470b
--- /dev/null
+++ b/static/integrations/providers/vertex.svg
@@ -0,0 +1,10 @@
+<svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 24 24">
+  <path d="M11.995 20.216a1.892 1.892 0 100 3.785 1.892 1.892 0 000-3.785zm0 2.806a.927.927 0 11.927-.914.914.914 0 01-.927.914z" fill="#4285F4"/>
+  <path clip-rule="evenodd" d="M21.687 14.144c.237.038.452.16.605.344a.978.978 0 01-.18 1.3l-8.24 6.082a1.892 1.892 0 00-1.147-1.508l8.28-6.08a.991.991 0 01.682-.138z" fill="#669DF6" fill-rule="evenodd"/>
+  <path clip-rule="evenodd" d="M10.122 21.842l-8.217-6.066a.952.952 0 01-.206-1.287.978.978 0 011.287-.206l8.28 6.08a1.893 1.893 0 00-1.144 1.479z" fill="#AECBFA" fill-rule="evenodd"/>
+  <path d="M4.273 4.475a.978.978 0 01-.965-.965V1.09a.978.978 0 111.943 0v2.42a.978.978 0 01-.978.965zM4.247 13.034a.978.978 0 100-1.956.978.978 0 000 1.956zM4.247 10.19a.978.978 0 100-1.956.978.978 0 000 1.956zM4.247 7.332a.978.978 0 100-1.956.978.978 0 000 1.956z" fill="#AECBFA"/>
+  <path d="M19.718 7.307a.978.978 0 01-.965-.979v-2.42a.965.965 0 011.93 0v2.42a.964.964 0 01-.965.979zM19.743 13.047a.978.978 0 100-1.956.978.978 0 000 1.956zM19.743 10.151a.978.978 0 100-1.956.978.978 0 000 1.956zM19.743 2.068a.978.978 0 100-1.956.978.978 0 000 1.956z" fill="#4285F4"/>
+  <path d="M11.995 15.917a.978.978 0 01-.965-.965v-2.459a.978.978 0 011.943 0v2.433a.976.976 0 01-.978.991zM11.995 18.762a.978.978 0 100-1.956.978.978 0 000 1.956zM11.995 10.64a.978.978 0 100-1.956.978.978 0 000 1.956zM11.995 7.783a.978.978 0 100-1.956.978.978 0 000 1.956z" fill="#669DF6"/>
+  <path d="M15.856 10.177a.978.978 0 01-.965-.965v-2.42a.977.977 0 011.702-.763.979.979 0 01.241.763v2.42a.978.978 0 01-.978.965zM15.869 4.913a.978.978 0 100-1.956.978.978 0 000 1.956zM15.869 15.853a.978.978 0 100-1.956.978.978 0 000 1.956zM15.869 12.996a.978.978 0 100-1.956.978.978 0 000 1.956z" fill="#4285F4"/>
+  <path d="M8.121 15.853a.978.978 0 100-1.956.978.978 0 000 1.956zM8.121 7.783a.978.978 0 100-1.956.978.978 0 000 1.956zM8.121 4.913a.978.978 0 100-1.957.978.978 0 000 1.957zM8.134 12.996a.978.978 0 01-.978-.94V9.611a.965.965 0 011.93 0v2.445a.966.966 0 01-.952.94z" fill="#AECBFA"/>
+</svg>
diff --git a/static/integrations/providers/xai.svg b/static/integrations/providers/xai.svg
new file mode 100644
index 000000000..ccd22443c
--- /dev/null
+++ b/static/integrations/providers/xai.svg
@@ -0,0 +1,3 @@
+<svg width="24" height="24" viewBox="0 0 40 40" xmlns="http://www.w3.org/2000/svg">
+<path d="M12.4579 15.6036L26.1529 35H20.0656L6.37059 15.6036H12.4579ZM12.4524 26.3764L15.4974 30.6909L12.4551 35H6.36377L12.4524 26.3764ZM33.6365 7.15727V35H28.647V14.2236L33.6365 7.15727ZM33.6365 5L20.0656 24.2205L17.0206 19.9073L27.5451 5H33.6365Z" fill="currentColor"/>
+</svg>