Open-source · Self-hosted control plane · Turbopuffer search

Turbopuffer-backed RAG for your documents

Upload documents, parse with Docling, embed with your provider, and use Turbopuffer for semantic, keyword, and hybrid retrieval through one API.

Simple integration

Upload, embed, and search in minutes

bigRAG handles the entire RAG pipeline. Upload any document format, and it automatically parses, chunks, embeds, and indexes in Turbopuffer for vector and full-text search. Use the TypeScript SDK or REST API.

TypeScript SDK with full type safety
Turbopuffer stores vectors, chunk text, metadata, and keyword indexes
Full API reference with Swagger docs at /docs
app.ts
import { BigRAG } from "@bigrag/client";

const client = new BigRAG({
  apiKey: "bigrag_sk_...",
  baseUrl: "http://localhost:4000",
});

const { results } = await client.queries.query("knowledge_base", {
  query: "What is the PTO policy?",
  top_k: 5,
});

15+

Document formats

PDF, DOCX, images, and more

1

Search backend

Turbopuffer

12+

Embedding models

OpenAI, Cohere, Voyage

5 min

To deploy

Docker Compose

Deploy bigRAG with Turbopuffer search

Run the API, admin UI, Postgres, and Redis yourself, then connect Turbopuffer for managed vector and full-text retrieval.

$ git clone https://github.com/bigint/rag.computer
$ cd rag.computer
$ docker compose up -d