Use Open WebUI with amazee.ai
Open WebUI is a self-hosted AI interface that supports any OpenAI-compatible backend. Connect it to amazee.ai to run Claude and other models through your regional endpoint, with optional pgvector support for retrieval-augmented generation (RAG).
Prerequisites
- Open WebUI running (installation guide)
- An amazee.ai API key and endpoint from my.amazee.io
Connect amazee.ai LLM
- Go to Admin Settings
- Navigate to Connections → OpenAI
- Click Add Connection
- Fill in:
| Field | Value |
|---|---|
| API Base URL | https://llm.[your-region].amazee.ai/v1 |
| API Key | your amazee.ai API key |
- Click Save
You should now see your available models (e.g. claude-sonnet-4-5, claude-haiku-4-5) in the model selector.
See Available Models for the full list.
Connect pgvector (optional)
amazee.ai provides a managed pgvector database for embeddings and RAG workflows. To get pgvector credentials for your workspace, contact ai.support@amazee.io.
Open WebUI configures vector databases via environment variables. Add these at startup:
VECTOR_DB=pgvector
PGVECTOR_DB_URL=postgresql://USERNAME:PASSWORD@vectordb.[your-region].amazee.ai:5432/DATABASE
With Docker Compose:
environment:
- OPENAI_API_BASE_URL=https://llm.[your-region].amazee.ai/v1
- OPENAI_API_KEY=your-api-key-here
- VECTOR_DB=pgvector
- PGVECTOR_DB_URL=postgresql://USERNAME:PASSWORD@vectordb.[your-region].amazee.ai:5432/DATABASE
Restart Open WebUI after making changes.
Verify your setup
LLM : Send a test message. You should get a response from the model you selected.
Embeddings : Try adding a document to a collection in Open WebUI — it uses the embedding model automatically.
pgvector : Enable RAG on a collection and ask a question about a document you've uploaded. A working pgvector connection returns grounded answers.
Troubleshooting
No models appear after saving the connection
: Double-check the base URL includes /v1. Try https://llm.[your-region].amazee.ai/v1/models in your browser — you should get a JSON model list (you'll need your API key as a Bearer token to authenticate).
Embeddings fail : Confirm the embedding model ID in Open WebUI's RAG settings matches an available model. See Available Models.
pgvector connection refused : Verify the host, port (5432), username, password, and database name from your portal. The pgvector instance must be in the same region as your LLM endpoint.