Set up a self-hosted provider

Last validated: Apr 15, 2026

Aperture by Tailscale is currently in beta.

Configure a self-hosted LLM server as a provider in Aperture so your team can access private models through your tailnet. Any server that exposes an OpenAI-compatible chat completions endpoint works with the default configuration. Aperture supports both /v1/chat/completions and /chat/completions paths. Common servers include llama.cpp, vLLM, and Ollama.

Aperture routes requests based on the model name, not the LLM client. Any LLM client configured to use Aperture can access any provider your admin has set up. Refer to the provider compatibility reference for the full list of supported providers and API formats.

Prerequisites

Before you begin, you need:

An Aperture instance accessible from your device. Refer to get started with Aperture if you have not set this up.
A self-hosted LLM server accessible from the Aperture host (in the tailnet or on localhost).

Configure the provider

Add your self-hosted server as a provider in your Aperture configuration:

{
  "providers": {
    "private": {
      "baseurl": "http://100.64.0.1:8080/v1",
      "models": ["qwen3-coder-30b", "llama-3.1-70b"]
    }
  }
}

Set baseurl to the full URL of your server's API endpoint, including the path prefix if your server uses one (for example, http://100.64.0.1:8080/v1 for a vLLM server that serves at /v1). If your server serves at the root path without a version prefix, set baseurl to just the host and port (for example, http://100.64.0.1:8080). To find the correct model names, query your server's model list endpoint (typically GET /v1/models) and use the id field from the response.

Whether to include /v1 in baseurl depends on how your LLM clients connect to Aperture. Most clients (Codex, OpenAI-compatible tools) are configured with http://<aperture-hostname>/v1 as the base URL, which sends /v1/chat/completions in the request path. Because Aperture appends the full request path to baseurl, including /v1 in the self-hosted baseurl when the client also sends /v1 produces a doubled path (/v1/v1/chat/completions). If your clients use the standard /v1 base URL, set the self-hosted baseurl to just the host and port (for example, http://100.64.0.1:8080). The configuration example above includes /v1 in baseurl, which works when clients connect to Aperture without /v1 in their base URL (sending /chat/completions instead of /v1/chat/completions). Refer to how Aperture builds upstream URLs for details.

Self-hosted providers use openai_chat compatibility and bearer authorization by default, so no additional flags are needed for servers that expose an OpenAI-compatible API. If your server uses a different API format, set the appropriate compatibility flags.

If your server does not require authentication, omit the apikey field. If your server requires a key, add "apikey": "<your-key>" to the provider block.

After configuring the provider:

Grant model access to the users or groups that need these models.
Set up LLM clients to connect coding tools through Aperture.

Verify the provider

The best way to verify a connection to a specific model is to send a test request through the Models tab of the Aperture dashboard.

Open the Aperture dashboard and select the Models tab.
Find the model you want to test in the list of configured models. If the model is not listed, check your provider configuration and ensure the model name is correct.
Select the Play icon to the left of the model name to send a test request. If the request succeeds, the icon changes to a green check mark. If it fails, the icon changes to a red "X".

This sends a request from your web browser to the tailnet to verify that Aperture can successfully route requests to the model through the configured provider and that your user account has the necessary permissions to access the model.

Next steps

Grant model access to users or groups that need these models.
Set up LLM clients to connect coding tools through Aperture.
Refer to the provider compatibility reference for the full list of compatibility flags and configuration options.