Applied AI: Local RAG System for Document Question Answering

This project demonstrates an Applied AI implementation of Retrieval-Augmented Generation (RAG), focusing on retrieval quality, system design tradeoffs, and debuggability rather than model novelty.

The system ingests PDF documents, converts them into semantically meaningful representations, retrieves relevant context using vector search, and generates grounded answers using a local Large Language Model (LLM).

Problem Statement

Large Language Models perform poorly on:

private or domain-specific documents
long-form content exceeding context limits
tasks requiring source attribution

This project addresses these issues by augmenting generation with retrieved, semantically relevant context, enabling grounded and explainable answers.

System Architecture

Ingestion (Offline)

Load PDF documents from disk
Chunk text into semantically coherent nodes
Generate dense vector embeddings for each chunk
Store vectors in FAISS and persist text + metadata

Inference (Online)

Embed user query

Retrieve top-k relevant chunks via FAISS
Augment prompt with retrieved context
Generate an answer using a local LLM
Expose source chunks and similarity scores

Applied AI Focus Areas

Retrieval quality over model size
Chunking strategy and overlap tuning
Recall vs precision tradeoffs
Source attribution and traceability
Failure mode inspection and debugging

Key Features

PDF ingestion and chunk-level indexing
Semantic retrieval using FAISS
Local LLM inference (no external APIs)
Explicit source attribution for answers
Retrieval debugging via node inspection

Tech Stack

Python
LlamaIndex
FAISS
Sentence-Transformers
Ollama (local LLM runtime)

Project Structure

rag-pdf-chat/

├── ingest.py (Document ingestion and indexing)

├── query.py (Retrieval and generation pipeline)

├── requirements.txt

├── README.md

├── data/ (Input PDFs)

├── storage/ (FAISS index + docstore)

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.gitignore		.gitignore
README.md		README.md
config.py		config.py
ingest.py		ingest.py
query.py		query.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Applied AI: Local RAG System for Document Question Answering

Problem Statement

System Architecture

Ingestion (Offline)

Inference (Online)

Applied AI Focus Areas

Key Features

Tech Stack

Project Structure

About

Uh oh!

Releases

Packages

Languages

DevanshuSave/RAG

Folders and files

Latest commit

History

Repository files navigation

Applied AI: Local RAG System for Document Question Answering

Problem Statement

System Architecture

Ingestion (Offline)

Inference (Online)

Applied AI Focus Areas

Key Features

Tech Stack

Project Structure

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages