You know those API specs and internal docs you keep referencing while coding? This runs a local RAG engine via MCP so Claude can search them without sending anything to external servers. It exposes ingest_file and query tools, handles PDFs and DOCX with hybrid search (BM25 plus embeddings), and includes optional cross-encoder reranking. Setup is one npx command and a BASE_DIR path in your MCP config. It also ships with a web UI on port 3000 for drag-and-drop uploads and a REST API if you want to hit it directly. Supports both stdio for local editors and remote HTTP mode for Claude Desktop or Claude.ai connections.
claude mcp add --transport stdio robthepcguy-rag-vault uvx rag-vault