010 GitHub stars
02Retrieval-augmented generation (RAG) for grounded answers
03PDF ingestion pipeline with text extraction, chunking, and embedding
04Local persistent vector store (ChromaDB) for embeddings
05Natural-language querying of PDF documents
06Source attribution (document name, page number) for answers