🚀 End-to-End PDF RAG System

🎬 Streamlit Demo

📊 Grafana Dashboard & Prometheus Alerts

📊 Project Overview

This project is a modern Retrieval-Augmented Generation (RAG) system built to simplify document management and information access. By uploading PDF files, it provides intelligent Q&A capabilities over those documents. As an example use case, it showcases information retrieval from HR documents.

⚡ Core Components

Component	Path	Contents
🌐 API Layer	`src/api.py`	REST API endpoints, session management, monitoring & metrics collection
🧠 Core Logic	`src/helper_func.py`	PDF processing & text extraction, RAG workflow orchestration, model management/optimization, caching
🖥️ Web UI	`src/app.py`	Streamlit-based UI, document upload & management, Q&A interaction interface, result visualization
📝 Logging	`src/loki_logger.py`	Loki integration, Trace ID tracking, structured logging, performance analysis
📊 Monitoring	—	Grafana dashboards, Prometheus metrics, automatic alert rules, real-time monitoring

🎯 Goals & Features

Intelligent RAG Workflow: retrieval, rerank, reflection, multi-hop support
Performance: caching, GPU support, asynchronous processing, model warmup/preloading
Monitoring & Logging: Prometheus, Grafana, Loki integration
Scalability: containerization

✨ Tech Stack

🏗️ Architecture & Infrastructure

Python 3.12+ — modern language features and type hints
Docker & Docker Compose — containerized, reproducible services
Grafana & Prometheus — metrics collection and visualization
Loki — structured log aggregation and querying

🌐 Application Layer

FastAPI + Uvicorn — high-performance API layer
Streamlit — interactive web UI

💻 Development Environment

uv — fast package/env management and command runner (uv sync, uv run)

🚀 Setup & Run

Requirements

Python 3.12+
uv (recommended package/env manager)
Docker & Docker Compose

pip install uv # if not installed

Steps

1) Clone the Repository

git clone https://github.com/mertafacan/end-to-end-pdf-rag-system.git
cd end-to-end-pdf-rag-system

2) Configure Environment Variables

cp .env.example .env

3) Install Dependencies (uv)

# Create the environment 
uv venv

# Activate the environment

# Linux/Mac source
.venv/bin/activate

# Windows:
.venv\Scripts\activate

# Install dependencies
uv sync

4) Start Docker Services

cd config
docker-compose up -d

5) Start the Application

with uv:

cd src && uv run uvicorn api:app --port 8000 --reload
cd src && uv run streamlit run app.py

Available Services

🏗️ Project Architecture

🔧 Architecture

flowchart TB
  U[User] --> C[Streamlit Client]

  C -- Upload PDF --> INDEX[POST /index]
  INDEX --> CH[PDF / pages / chunks]
  CH --> EMB[Embedding]
  EMB --> VDB[Qdrant Vector DB]

  C -- Question --> ASK[POST /ask]
  ASK --> RET[Retriever - Qdrant]
  RET --> RER[Optional Reranker - CrossEncoder]
  RET --> LG[LangGraph - retrieve / decide / generate / reflect]
  RER --> LG
  LG --> LLM[LLM - ChatLiteLLM]
  LLM --> C

  PROM[Prometheus /metrics] --- GRAF[Grafana Dashboard]
  LOKI[Loki & Console Logs - trace_id] --- GRAF

📁 Directory Structure

src/
├── api.py              # FastAPI endpoints
├── app.py              # Streamlit UI
├── helper_func.py      # Business logic
├── loki_logger.py      # Logging system
└── uploaded_docs/      # Uploaded documents

config/
├── alert_rules.yml     # Prometheus alert rules
├── docker-compose.yml  # Docker services (Qdrant, Prometheus, Grafana, Loki)
├── loki.yml            # Loki log server configuration
└── prometheus.yml      # Prometheus metrics collection configuration

grafana/
└── provisioning/
    ├── dashboards/
    │   ├── dashboards.yml              # Dashboard provisioning
    │   ├── PDF rag-loki-logs.json      # Loki log dashboard
    │   └── rag-system-dashboard.json   # System dashboard
    └── datasources/
        └── prometheus.yml              # Prometheus & Loki data sources

🧩 Core Components & Responsibilities

`src/api.py` — API Layer

Exposes REST API endpoints, handles session management, authentication/authorization, and collects metrics.

`src/helper_func.py` — Business Logic

PDF processing and text extraction, coordination of the RAG workflow, model management/optimization, and caching.

`src/app.py` — Web UI

Document upload and management screens, Q&A interaction, and visualization of results (Streamlit).

`src/loki_logger.py` — Logging

Structured logging integrated with Loki, Trace ID tracking, and a rich log format for performance analysis.

Configuration

config/alert_rules.yml — Prometheus alert rules (FastAPI latency, error rate, Qdrant, disk/RAM).
config/prometheus.yml — Metrics collection (FastAPI, Qdrant, system).
config/loki.yml — Loki logging configuration.
config/docker-compose.yml — Services: Qdrant, Prometheus, Grafana, Loki.

Grafana

grafana/provisioning/dashboards/*.json — Automatic dashboard provisioning (logs, system, RAG).
grafana/provisioning/datasources/prometheus.yml — Prometheus & Loki data sources.

Highlights

Alerts: latency (>1s), error rate (>10%), Qdrant health, disk/RAM.
Dashboards: real-time metrics & log visualization.
Logging: structured logs, Trace ID tracking.
Metrics: HTTP requests, Qdrant queries, LLM calls, resource usage.

📬 Contact

Mert Afacan – https://www.linkedin.com/in/mert-afacan/ – mert0afacan@gmail.com

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
assets		assets
config		config
grafana/provisioning		grafana/provisioning
src		src
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🚀 End-to-End PDF RAG System

📊 Project Overview

⚡ Core Components

🎯 Goals & Features

✨ Tech Stack

🏗️ Architecture & Infrastructure

🌐 Application Layer

💻 Development Environment

🚀 Setup & Run

Requirements

Steps

1) Clone the Repository

2) Configure Environment Variables

3) Install Dependencies (uv)

4) Start Docker Services

5) Start the Application

Available Services

🏗️ Project Architecture

🔧 Architecture

📁 Directory Structure

🧩 Core Components & Responsibilities

`src/api.py` — API Layer

`src/helper_func.py` — Business Logic

`src/app.py` — Web UI

`src/loki_logger.py` — Logging

Configuration

Grafana

Highlights

📬 Contact

About

Uh oh!

Releases

Packages

Languages

mertafacan/end-to-end-pdf-rag-system

Folders and files

Latest commit

History

Repository files navigation

🚀 End-to-End PDF RAG System

📊 Project Overview

⚡ Core Components

🎯 Goals & Features

✨ Tech Stack

🏗️ Architecture & Infrastructure

🌐 Application Layer

💻 Development Environment

🚀 Setup & Run

Requirements

Steps

1) Clone the Repository

2) Configure Environment Variables

3) Install Dependencies (uv)

4) Start Docker Services

5) Start the Application

Available Services

🏗️ Project Architecture

🔧 Architecture

📁 Directory Structure

🧩 Core Components & Responsibilities

src/api.py — API Layer

src/helper_func.py — Business Logic

src/app.py — Web UI

src/loki_logger.py — Logging

Configuration

Grafana

Highlights

📬 Contact

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

`src/api.py` — API Layer

`src/helper_func.py` — Business Logic

`src/app.py` — Web UI

`src/loki_logger.py` — Logging

Packages