Skip to content

Use AnythingLLM with amazee.ai

AnythingLLM is a full-stack, self-hosted AI application for document chat, RAG, and AI agents. Connect it to amazee.ai to use Claude and other models with your documents, all processed in your chosen region.

Prerequisites


Connect amazee.ai as the LLM provider

  1. Open AnythingLLM and go to Settings → LLM Preference
  2. Select Generic OpenAI (or Custom OpenAI API compatible depending on your version)
  3. Fill in:
Field Value
Base URL https://llm.[your-region].amazee.ai/v1
API Key your amazee.ai API key
Model claude-sonnet-4-5 (or your preferred model)
Token context window 200000 for Claude Sonnet/Opus; 200000 for Haiku
  1. Click Save changes

See Available Models for the full list.


Connect pgvector for embeddings (optional)

AnythingLLM can use pgvector as its vector database. To get pgvector credentials for your workspace, contact ai.support@amazee.io.

  1. Go to Settings → Vector Database
  2. Select PGVector
  3. Fill in:
Field Value
Connection string postgresql://USERNAME:PASSWORD@vectordb.[your-region].amazee.ai:5432/DATABASE
  1. Click Save changes

Embedding model

Go to Settings → Embedding Preference, select Generic OpenAI, and use:

Field Value
Base URL https://llm.[your-region].amazee.ai/v1
API Key your amazee.ai API key
Model amazon.titan-embed-text-v2:0

Verify

Upload a document to a workspace and ask a question about it. A working setup returns an answer grounded in the document's content.


Troubleshooting

"LLM did not respond" errors : Check that the base URL includes /v1 and the model ID exactly matches what's in Available Models.

Embeddings not working : Make sure the embedding model field is set — AnythingLLM won't use the chat model for embeddings automatically.

Slow responses on large documents : This is normal for large context windows. Claude models support up to 200k tokens. For very large documents, consider chunking them into multiple uploads.