Skip to content
All workCase study — 2026

PolicyRAG — auditable QA over SEC filings

Question-answering over SEC filings that shows its sources

ROLE
Solo build
TIMEFRAME
2026
STACK
Python, FastAPI, ChromaDB, React, Supabase
LINKS
github

cited

EVERY ANSWER, SOURCED

The problem

LLMs answer questions about SEC filings fluently — and sometimes wrongly. For analysts and compliance teams a confident wrong answer is worse than no answer, so the system has to show receipts.

Approach

PolicyRAG is a full RAG pipeline behind a chat interface. Documents are chunked into ChromaDB, retrieval goes through a reranker, and generation is constrained to cite the numbered context passages it used. Every answer is then evaluated on three axes — faithfulness (NLI-based scoring of claims against the retrieved context), citation validity, and context relevance — and the scores are surfaced in the UI next to the answer instead of hidden in logs. The backend is FastAPI with SSE streaming and per-request JWT verification through Supabase auth; the LLM provider is swappable behind one interface.

Results

[Add your evaluation numbers: faithfulness score distribution, share of answers with fully valid citations, retrieval hit rate on a held-out question set.]

What broke

[The honest section — chunking strategy? NLI false positives on numeric claims? citation drift across providers?]

PolicyRAG — auditable QA over SEC filings — Aditya Ravi