Introduction
bigRAG is an open-source, self-hostable RAG platform for document ingestion and Turbopuffer-backed retrieval-augmented generation.
bigRAG is an open-source RAG platform with Turbopuffer at the retrieval layer. It provides a complete pipeline for document ingestion, chunking, embedding, and search behind a REST API, TypeScript/Python SDKs, an admin UI, and an MCP server.
Key Features
- End-to-end RAG pipeline — upload documents, auto-chunk, embed, search, and rerank in one platform
- Any document format — PDF, DOCX, PPTX, XLSX, HTML, Markdown, CSV/TSV/XML/JSON, and scanned images (OCR) via Docling
- Any embedding model — OpenAI, Cohere, Voyage, or
openai_compatiblegateways (Ollama, vLLM, TEI, LiteLLM, Azure, Bedrock) configured per collection - Turbopuffer search backend — vectors, chunk text, payload metadata, BM25 keyword search, and hybrid retrieval in one managed backend
- Namespace isolation — every collection maps to a Turbopuffer namespace for scoped writes, exports, deletion, and retrieval
- Three search modes — semantic, keyword, and hybrid search with optional Cohere reranking
- First-class auth — admin accounts, session cookies, scoped
bigrag_sk_…API keys, and full audit/access logs - Operational features — batch ingestion, S3/R2 prefix sync, webhooks with HMAC signatures, metadata-schema validation, an evaluation runner, and per-collection query analytics
- Admin UI — first-run setup, collection browser, stateless chat, connector management, access logs, MCP credentials, and API-key minting
- Self-hostable control plane — Docker Compose for the API, admin UI, Postgres, and Redis
- MIT licensed — run it anywhere, forever free
Architecture
| Component | Purpose | Default Address |
|---|---|---|
| bigRAG API | REST API server (FastAPI) | http://localhost:4000 |
| Admin UI | Admin web app | http://localhost:3000 |
| PostgreSQL | Users, sessions, API keys, metadata, audit | localhost:5432 |
| Turbopuffer | Vector storage, chunk text, metadata filters, BM25 keyword search, and hybrid retrieval | Managed service |
| Redis | Ingestion + event bus | localhost:6379 |
Explore
Installation
Set up bigRAG on your machine
Quickstart
Configure Turbopuffer, then upload and search your first document
Turbopuffer Search
Semantic, keyword, and hybrid retrieval on Turbopuffer
Admin UI
The admin UI tour — setup, chat, keys, webhooks
Security & Auth
Admin accounts, sessions, API keys, audit
API Reference
Complete REST API documentation
TypeScript SDK
Zero-dependency client with full type safety
MCP Server
Plug bigRAG into Claude Desktop, Cursor, or any MCP runtime
Deployment
Deploy with Docker Compose