Skip to content

Use Open WebUI with amazee.ai

Open WebUI is a self-hosted AI interface that supports any OpenAI-compatible backend. Connect it to amazee.ai to run Claude and other models through your regional endpoint, with optional pgvector support for retrieval-augmented generation (RAG).

Prerequisites


Connect amazee.ai LLM

  1. Go to Admin Settings
  2. Navigate to Connections → OpenAI
  3. Click Add Connection
  4. Fill in:
Field Value
API Base URL https://llm.[your-region].amazee.ai/v1
API Key your amazee.ai API key
  1. Click Save

You should now see your available models (e.g. claude-sonnet-4-5, claude-haiku-4-5) in the model selector.

See Available Models for the full list.


Connect pgvector (optional)

amazee.ai provides a managed pgvector database for embeddings and RAG workflows. To get pgvector credentials for your workspace, contact ai.support@amazee.io.

Open WebUI configures vector databases via environment variables. Add these at startup:

VECTOR_DB=pgvector
PGVECTOR_DB_URL=postgresql://USERNAME:PASSWORD@vectordb.[your-region].amazee.ai:5432/DATABASE

With Docker Compose:

environment:
  - OPENAI_API_BASE_URL=https://llm.[your-region].amazee.ai/v1
  - OPENAI_API_KEY=your-api-key-here
  - VECTOR_DB=pgvector
  - PGVECTOR_DB_URL=postgresql://USERNAME:PASSWORD@vectordb.[your-region].amazee.ai:5432/DATABASE

Restart Open WebUI after making changes.


Verify your setup

LLM : Send a test message. You should get a response from the model you selected.

Embeddings : Try adding a document to a collection in Open WebUI — it uses the embedding model automatically.

pgvector : Enable RAG on a collection and ask a question about a document you've uploaded. A working pgvector connection returns grounded answers.


Troubleshooting

No models appear after saving the connection : Double-check the base URL includes /v1. Try https://llm.[your-region].amazee.ai/v1/models in your browser — you should get a JSON model list (you'll need your API key as a Bearer token to authenticate).

Embeddings fail : Confirm the embedding model ID in Open WebUI's RAG settings matches an available model. See Available Models.

pgvector connection refused : Verify the host, port (5432), username, password, and database name from your portal. The pgvector instance must be in the same region as your LLM endpoint.