pmock/localRAG

Fork 0

T

Philipp Mock 08610e0b64 updated parameters and retrieval prompt to massively improve query quality

2026-02-19 15:36:32 +01:00

data

- added three iwm articles as test data

2026-02-03 10:57:05 +01:00

templates

updated parameters and retrieval prompt to massively improve query quality

2026-02-19 15:36:32 +01:00

.gitignore

test RAG with local Ollama models

2026-01-26 19:49:02 +01:00

add_pdfs.py

switched ollama model, added script to add pdfs to the vector store, tuned RAG parameters

2026-02-11 15:21:26 +01:00

local_rag.py

updated parameters and retrieval prompt to massively improve query quality

2026-02-19 15:36:32 +01:00

README.md

Add LLM provider switch, markdown chat UI, and update README

2026-02-11 16:09:42 +01:00

requirements.txt

added OpenAI support and markdown in the chat window

2026-02-11 15:58:34 +01:00

server.py

updated parameters and retrieval prompt to massively improve query quality

2026-02-19 15:36:32 +01:00

README.md

Local RAG Setup

Minimal RAG implementation with LangChain, FAISS, and support for either Ollama or OpenAI (API-key needed).

Dependencies

langchain - Core framework
langchain-community - Loaders, vectorstores
langchain-ollama - Ollama integration
langchain-openai - OpenAI integration
langchain-text-splitters - Text splitting
langchain-huggingface - HuggingFace embeddings
faiss-cpu - Vector search
sentence-transformers - Embeddings
pypdf - PDF loading
fastapi - Web server
uvicorn - ASGI server

Installation

conda create -n local_rag python=3.10 -y
conda activate local_rag
pip install -r requirements.txt

Setup

Ollama (optional)

ollama serve
ollama pull mistral

OpenAI (optional)

Set the API key when using OpenAI:

export OPENAI_API_KEY="your-key"

Add Documents

Option 1: Add PDFs from a folder via script. Edit DATA_ROOT in add_pdfs.py to point at your folder, then run:

python add_pdfs.py

The script clears the existing vector store and indexes all PDFs recursively. Supports .pdf, .txt, .md.

Option 2: Use local_rag.py programmatically:

from local_rag import LocalRAG
rag = LocalRAG()
rag.add_documents(["path/to/doc1.pdf", "path/to/doc2.txt"])

Chat GUI

Start the server:

uvicorn server:app --reload

Open http://localhost:8000. The chat UI provides:

Provider switch – Toggle between Ollama and OpenAI without restart (OpenAI requires OPENAI_API_KEY)
Conversation history – Multi-turn chat with context
Markdown – Assistant replies rendered as markdown (headings, code, lists, links)

Ensure the vector store is populated and at least one provider (Ollama or OpenAI) is configured.

API

POST /api/chat – { "message": "...", "history": [...], "llm_provider": "ollama"|"openai" }
GET /api/providers – { "ollama": true, "openai": true|false }
GET /api/health – Health and vectorstore status

How it works

Load documents – PDFs or text via PyPDFLoader / TextLoader
Chunk – RecursiveCharacterTextSplitter (2000 chars, 400 overlap)
Embed – sentence-transformers/all-MiniLM-L6-v2
Store – FAISS vector store (similarity search with scores)
Query – Retrieve chunks, optionally rephrase with conversation history, generate answer with selected LLM

README.md Unescape Escape

Local RAG Setup

Dependencies

Installation

Setup

Ollama (optional)

OpenAI (optional)

Add Documents

Chat GUI

API

How it works

README.md