For the complete documentation index, see llms.txt. This page is also available as Markdown.

Local RAG

A fully local RAG (Retrieval-Augmented Generation) system. Give it any text file, ask it questions. Everything stays local.

  • Encoderfile handles embedding locally.

  • Llamafile runs the LLM locally.

  • NumPy handles similarity search in memory.

This is a good fit for offline environments, sensitive documents, or anywhere you need a simple, self-contained question-answering system without cloud dependencies.

Check out the full code and instructions in GitHub.

Last updated