RAG on Neon Serverless Postgres

This project creates a web-based chat application with an API backend that leverages OpenAI chat models to answer questions using Neon Serverless Postgres. The frontend is built with React and FluentUI, while the backend is written in Python (FastAPI).

This project is designed for deployment on Azure, hosting:

The app in Azure Container Apps
The database in Neon Serverless Postgres
AI models in Azure OpenAI

Features

Hybrid search on a Neon database table, combining:
- pgvector extension for vector search
- Full-text search for text-based ranking
- Reciprocal Rank Fusion (RRF) for combining results
OpenAI function calling to dynamically filter queries.
OpenAI embeddings for efficient RAG (Retrieval-Augmented Generation).

Architecture diagram

Azure Resources Used

Service	Purpose
Neon Serverless Postgres	Stores and queries structured data
Azure OpenAI	Embedding and chat models for RAG
Azure Container Apps	Deploys the API backend and frontend
Azure Container Registry	Stores and manages containerized images for frontend and backend
Azure Developer CLI	Automates deployment and resource management
Azure Log Analytics	Monitors application logs and metrics

Getting started

You have a few options for getting started with this template. The quickest way to get started is GitHub Codespaces, since it will setup all the tools for you, but you can also set it up locally.

GitHub Codespaces

You can run this template virtually by using GitHub Codespaces. The button will open a web-based VS Code instance in your browser:

Open the template (this may take several minutes):
Open a terminal window
Continue with the deployment steps

VS Code Dev Containers

A related option is VS Code Dev Containers, which will open the project in your local VS Code using the Dev Containers extension:

Start Docker Desktop (install it if not already installed)
Open the project:
In the VS Code window that opens, once the project files show up (this may take several minutes), open a terminal window.
Continue with the deployment steps

Local Environment

Make sure the following tools are installed:

Clone this repository locally or fork it to your Github account.

git clone https://github.com/neondatabase-labs/rag-neon-postgres-openai-azure-python

Open the project folder

cd rag-neon-postgres-openai-azure-python

(Optional)Create a Virtual Environment and activate it
```
python3 -m venv .venv
```
```
source .venv/bin/activate
```

Install required Python packages and backend application:

pip install -r requirements-dev.txt
pip install -e src/backend

Continue with the deployment steps

Deployment

Once you've opened the project in Codespaces, Dev Containers, or locally, you can deploy it to Azure.

Sign in to your Azure account:
```
azd auth login
```
For GitHub Codespaces users, if the previous command fails, try:
```
 azd auth login --use-device-code
```
Create a new azd environment:
```
azd env new
```
This will create a folder under .azure/ in your project to store the configuration for this deployment. You may have multiple azd environments if desired.
(Optional) If you would like to customize the deployment to use existing Azure resources, you can set the values now.
Provision the resources and deploy the code:
```
azd up
```
You will be asked to select two locations, first a region for most of the resources, then a region specifically for the Azure OpenAI models. This project uses the gpt-4o-mini and text-embedding-ada-002 models which may not be available in all Azure regions. Check for up-to-date region availability and select a region accordingly.

Local Development

Setting up the environment file

Obtain Neon Database Credentials
- From Azure portal, find the Neon Serverless Postgres Organization service and click on Portal URL.
- This brings you to the Neon Console
- Click “New Project”
- Choose an Azure region
- Give your project a name (e.g., “Neon RAG Python”)
- Click “Create Project”
- Once the project is created successfully, copy the Neon connection string. You can find the connection details in the Connection Details widget on the Neon Dashboard.
```
postgresql://[user]:[password]@[neon_hostname]/[dbname]?sslmode=require
```

Store NeonDB Credentials in .env

Copy .env.sample to .env and update:

POSTGRES_HOST=[neon_hostname]
POSTGRES_USERNAME=[user]
POSTGRES_PASSWORD=[password]
POSTGRES_DATABASE=[dbname]
POSTGRES_SSL=require

Set Azure OpenAI deployed values

Since the local app uses OpenAI models, you should first deploy it for the optimal experience.
- To use Azure OpenAI, set OPENAI_CHAT_HOST and OPENAI_EMBED_HOST to "azure". Then fill in the values of AZURE_OPENAI_ENDPOINT and AZURE_OPENAI_CHAT_DEPLOYMENT based on the deployed values. You can display the values using this command:
```
azd env get-values
```
- To use OpenAI.com OpenAI, set OPENAI_CHAT_HOST and OPENAI_EMBED_HOST to "openai". Then fill in the value for OPENAICOM_KEY.
- To use Ollama, set OPENAI_CHAT_HOST to "ollama". Then update the values for OLLAMA_ENDPOINT and OLLAMA_CHAT_MODEL to match your local setup and model. We recommend using "llama3.1" for the chat model, since it has support for function calling, and "nomic-embed-text" for the embedding model, since the sample data has already been embedded with this model. If you cannot use function calling, then turn off "Advanced flow" in the Developer Settings. If you cannot use the embedding model, then turn off vector search in the Developer Settings.

Running the frontend and backend

Run these commands to install the web app as a local package (named fastapi_app), set up the local database, and seed it with test data:

python -m pip install -r src/backend/requirements.txt
python -m pip install -e src/backend
python ./src/backend/fastapi_app/setup_postgres_database.py
python ./src/backend/fastapi_app/setup_postgres_seeddata.py

Build the frontend:
```
cd src/frontend
npm install
npm run build
cd ../../
```
There must be an initial build of static assets before running the backend, since the backend serves static files from the src/static directory.
Run the FastAPI backend (with hot reloading). This should be run from the root of the project:
```
python -m uvicorn fastapi_app:create_app --factory --reload
```
Or you can run "Backend" in the VS Code Run & Debug menu.
Run the frontend (with hot reloading):
```
cd src/frontend
npm run dev
```
Or you can run "Frontend" or "Frontend & Backend" in the VS Code Run & Debug menu.
Open the browser at http://localhost:5173/ and you will see the frontend.

Costs

Pricing may vary per region and usage. Exact costs cannot be estimated.

Neon Serverless Postgres: Free US$0.00/month, free plan includes 10 projects, 0.5 GB storage, 190 compute hours, autoscaling up to 2 CU, read replicas, 90+ Postgres extensions including pgvector extension.

You may try the Azure pricing calculator for the resources below:

Azure Container Apps: Pay-as-you-go tier. Costs based on vCPU and memory used. Pricing
Azure OpenAI: Standard tier, GPT and Ada models. Pricing per 1K tokens used, and at least 1K tokens are used per question. Pricing
Azure Monitor: Pay-as-you-go tier. Costs based on data ingested. Pricing

Security guidelines

This template uses Managed Identity for authenticating to the Azure services used such as Azure OpenAI.

Additionally, we have added a GitHub Action that scans the infrastructure-as-code files and generates a report containing any detected issues. To ensure continued best practices in your own repository, we recommend that anyone creating solutions based on our templates ensure that the Github secret scanning setting is enabled.

Guidance

Further documentation is available in the docs/ folder:

Please post in the issue tracker with any questions or issues.

Resources

🚀 Start Building AI-Powered RAG with Neon Today! 🚀

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.devcontainer		.devcontainer
.github		.github
.vscode		.vscode
docs		docs
evals		evals
infra		infra
scripts		scripts
src		src
tests		tests
.env.sample		.env.sample
.gitattributes		.gitattributes
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE.md		LICENSE.md
README.md		README.md
azure.yaml		azure.yaml
locustfile.py		locustfile.py
pyproject.toml		pyproject.toml
requirements-dev.txt		requirements-dev.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RAG on Neon Serverless Postgres

Table of Content

Features

Architecture diagram

Azure Resources Used

Getting started

GitHub Codespaces

VS Code Dev Containers

Local Environment

Deployment

Local Development

Setting up the environment file

Running the frontend and backend

Costs

Security guidelines

Guidance

Resources

About

Languages

License

neondatabase-labs/rag-neon-postgres-openai-azure-python

Folders and files

Latest commit

History

Repository files navigation

RAG on Neon Serverless Postgres

Table of Content

Features

Architecture diagram

Azure Resources Used

Getting started

GitHub Codespaces

VS Code Dev Containers

Local Environment

Deployment

Local Development

Setting up the environment file

Running the frontend and backend

Costs

Security guidelines

Guidance

Resources

About

Topics

Resources

License

Stars

Watchers

Forks

Languages