Use Open WebUI with amazee.ai¶

Open WebUI is a self-hosted AI interface that supports any OpenAI-compatible backend. Connect it to amazee.ai to run Claude and other models through your regional endpoint, with optional pgvector support for retrieval-augmented generation (RAG).

Prerequisites¶

Open WebUI running (installation guide)
An amazee.ai API key and endpoint from my.amazee.io

Connect amazee.ai LLM¶

Go to Admin Settings
Navigate to Connections → OpenAI
Click Add Connection
Fill in:

Field	Value
API Base URL	`https://llm.[your-region].amazee.ai/v1`
API Key	your amazee.ai API key

Click Save

You should now see your available models (e.g. claude-sonnet-4-5, claude-haiku-4-5) in the model selector.

See Available Models for the full list.

Connect pgvector (optional)¶

amazee.ai provides a managed pgvector database for embeddings and RAG workflows. To get pgvector credentials for your workspace, contact ai.support@amazee.io.

Open WebUI configures vector databases via environment variables. Add these at startup:

VECTOR_DB=pgvector
PGVECTOR_DB_URL=postgresql://USERNAME:PASSWORD@vectordb.[your-region].amazee.ai:5432/DATABASE

With Docker Compose:

environment:
  - OPENAI_API_BASE_URL=https://llm.[your-region].amazee.ai/v1
  - OPENAI_API_KEY=your-api-key-here
  - VECTOR_DB=pgvector
  - PGVECTOR_DB_URL=postgresql://USERNAME:PASSWORD@vectordb.[your-region].amazee.ai:5432/DATABASE

Restart Open WebUI after making changes.

Verify your setup¶

LLM: Send a test message. You should get a response from the model you selected.
Embeddings: Try adding a document to a collection in Open WebUI — it uses the embedding model automatically.
pgvector: Enable RAG on a collection and ask a question about a document you've uploaded. A working pgvector connection returns grounded answers.

Troubleshooting¶

No models appear after saving the connection: Double-check the base URL includes /v1. Try https://llm.[your-region].amazee.ai/v1/models in your browser — you should get a JSON model list (you'll need your API key as a Bearer token to authenticate).
Embeddings fail: Confirm the embedding model ID in Open WebUI's RAG settings matches an available model. See Available Models.
pgvector connection refused: Verify the host, port (5432), username, password, and database name from your portal. The pgvector instance must be in the same region as your LLM endpoint.