RAG app using Ollama

A modern RAG (Retrieval-Augmented Generation) application with separated frontend and backend architecture using LangGraph, FastAPI, and Streamlit.

Architecture

Backend (FastAPI + LangGraph)

FastAPI: RESTful API server
LangGraph: Workflow orchestration for RAG pipeline
FAISS: Vector storage for document embeddings
Ollama: LLM integration for response generation
Ollama (nomic-embed-text): Text embeddings

Frontend (Streamlit)

Streamlit: Web interface for document upload and chat
API Client: HTTP client for backend communication

Features

📄 PDF document upload and processing
🔍 Semantic search with vector embeddings
💬 Chat interface with context-aware responses
🤖 Integration with Ollama models (Llama2)
🔄 LangGraph workflow for RAG pipeline
🌐 Separated frontend/backend architecture
📊 Real-time document chunking and indexing
🎯 Session-based conversation continuity

Setup

Prerequisites

Ollama Installation: Install and start Ollama

# Install Ollama
curl -fsSL https://ollama.ai/install.sh | sh

# Start Ollama server
ollama serve

# Pull a model
ollama pull llama2

Python Environment: Python 3.9+

Installation

Backend Setup:

cd backend
pip install -r requirements.txt

Frontend Setup:

cd frontend
pip install -r requirements.txt

Running the Application

1. Start the Backend

cd backend
python main.py

The backend will start on http://localhost:8000

2. Start the Frontend

cd frontend
streamlit run app.py

The frontend will start on http://localhost:8501

API Endpoints

Backend Endpoints

GET /: Health check
POST /upload: Upload and process PDF documents
POST /chat: Chat with the RAG system
GET /documents: Get document information
DELETE /documents: Clear all documents

Usage Examples

Upload Document

curl -X POST "http://localhost:8000/upload" \
  -H "accept: application/json" \
  -H "Content-Type: multipart/form-data" \
  -F "file=@document.pdf"

Chat

curl -X POST "http://localhost:8000/chat" \
  -H "accept: application/json" \
  -H "Content-Type: application/json" \
  -d '{
    "message": "What is this document about?",
    "model": "llama2"
  }'

LangGraph Workflow

The RAG pipeline is implemented using LangGraph with the following nodes:

Retrieve: Search for relevant documents using vector similarity
Generate: Create response using Ollama with retrieved context
Format: Format the final response with sources

Configuration

Environment Variables

Backend:

OLLAMA_HOST: Ollama server host (default: http://localhost:11434)

Frontend:

API_BASE_URL: Backend API URL (default: http://localhost:8000)

Default Settings

Embedding model: nomic-embed-text
Chunk size: 1000 characters with 200 character overlap
Search results: 4 most relevant chunks
Default LLM: llama2

Project Structure

├── backend/
│   ├── main.py                 # FastAPI application
│   ├── services/
│   │   ├── rag_workflow.py     # LangGraph RAG workflow
│   │   ├── pdf_service.py      # PDF processing
│   │   ├── vector_service.py   # Vector storage
│   │   └── ollama_service.py   # Ollama integration
│   └── requirements.txt
├── frontend/
│   ├── app.py                  # Streamlit application
│   ├── api_client.py           # API client
│   └── requirements.txt
└── README.md

Troubleshooting

Backend Issues

Ollama Connection: Ensure Ollama is running on localhost:11434
Model Not Found: The app will automatically pull models if available
Vector Storage: FAISS runs in-memory by default

Frontend Issues

API Connection: Ensure backend is running on localhost:8000
CORS Issues: Backend includes CORS middleware for frontend access
File Upload: Check file size limits and PDF format

Development Tips

Use uvicorn main:app --reload for backend development
Use streamlit run app.py --server.reload for frontend development
Check browser console for API errors

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.vscode		.vscode
backend		backend
frontend		frontend
images		images
test		test
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RAG app using Ollama

Architecture

Backend (FastAPI + LangGraph)

Frontend (Streamlit)

Features

Setup

Prerequisites

Installation

Running the Application

1. Start the Backend

2. Start the Frontend

API Endpoints

Backend Endpoints

Usage Examples

Upload Document

Chat

LangGraph Workflow

Configuration

Environment Variables

Default Settings

Project Structure

Troubleshooting

Backend Issues

Frontend Issues

Development Tips

About

Uh oh!

Releases

Packages

Languages

nenosoft131/rag-app-using-ollama

Folders and files

Latest commit

History

Repository files navigation

RAG app using Ollama

Architecture

Backend (FastAPI + LangGraph)

Frontend (Streamlit)

Features

Setup

Prerequisites

Installation

Running the Application

1. Start the Backend

2. Start the Frontend

API Endpoints

Backend Endpoints

Usage Examples

Upload Document

Chat

LangGraph Workflow

Configuration

Environment Variables

Default Settings

Project Structure

Troubleshooting

Backend Issues

Frontend Issues

Development Tips

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages