Paper Snap

Paper Snap is a state-of-the-art research paper analysis platform designed to democratize access to academic knowledge. By leveraging the power of Retrieval-Augmented Generation (RAG) and high-performance Large Language Models (LLMs), it transforms dense, complex academic PDFs into clear, actionable insights.

Whether you are a researcher, student, or curious learner, Paper Snap helps you understand papers faster by providing high-fidelity summaries and an interactive Q&A assistant that answers questions based strictly on the source text—eliminating hallucinations.

Why Paper Snap?

Reading research papers is often time-consuming and cognitively demanding. Paper Snap solves this by:

Breaking Down Complexity: Automatically extracting and summarizing core sections like Abstract, Methodology, Results, and Conclusions.
Ensuring Accuracy: Using a specialized RAG pipeline with Cohere Reranking to ensure that answers are grounded in the most legally relevant parts of the document.
Speed: Utilizing Groq's LPU™ Inference Engine to deliver near-instant summaries and answers, even for long papers (using llama-3.3-70b-versatile).

Key Features

Advanced RAG Pipeline

Smart Ingestion: Uses PyPDFLoader and specialized parsing logic to segment papers into meaningful semantic chunks.
Hybrid Search: Combines Supabase's pgvector for semantic search with Cohere's Rerank v3.0 model to prioritize the most accurate context chunks before sending them to the LLM.
Contextual Awareness: Filters knowledge bases by file_id, ensuring the AI only calls upon the specific paper you are currently analyzing.

Intelligent Summarization

Targeted Extraction: Specifically isolates critical sections (Abstract, Methodology, Results, Conclusion) to generate a summary that matters.
Large Context Processing: Capable of processing vast amounts of text to generate a cohesive overview without "tunnel vision."

Performance-First Architecture

Groq Inference: Powered by the llama-3.3-70b-versatile model running on Groq hardware for blazing-fast response times.
Cohere Embeddings: Uses embed-english-v3.0 for industry-leading semantic understanding of academic text.
Supabase Vector Store: Scalable and secure storage for document embeddings.

Tech Stack

Frontend

React.js: Component-based UI library.
React Router: For seamless navigation.
CSS3: Custom responsive styling.
Supabase Client: For authentication and data interactions.

Backend

Python & Flask: robust API server.
LangChain: Framework for building LLM applications.
Cohere: High-quality text embeddings (embed-english-v3.0).
Groq: Ultra-fast LLM inference engine.
LangExtract: For specialized unstructured data extraction.

Database & Storage

Supabase: PostgreSQL database with pgvector extension for vector storage and file management.

Getting Started

Follow these instructions to set up the project locally.

Prerequisites

Node.js & npm
Python 3.8+
Supabase Account
API Keys for: Groq, Cohere, Supabase, and LangExtract.

1. Clone the Repository

git clone https://github.com/your-username/paper-snap.git
cd paper-snap

2. Backend Setup

Navigate to the backend directory and set up the environment.

cd rag-backend
# Optional: Create a virtual environment
python -m venv venv
# Windows: venv\Scripts\activate
# Mac/Linux: source venv/bin/activate

# Install dependencies (ensure you have a requirements.txt, otherwise install manually)
pip install flask flask-cors langchain langchain-groq langchain-cohere langchain-community supabase python-dotenv langextract

Configure Environment Variables: Create a .env file in the rag-backend directory:

SUPABASE_URL=your_supabase_url
SUPABASE_SERVICE_ROLE_KEY=your_supabase_service_key
COHERE_API_KEY=your_cohere_api_key
GROQ_API_KEY=your_groq_api_key
LANGEXTRACT_API_KEY=your_langextract_api_key

Run the Backend:

python main.py

The server typically runs on http://localhost:5000.

3. Frontend Setup

Open a new terminal and navigate to the frontend directory.

cd my-react-app

# Install dependencies
npm install

# Run the development server
npm start

The application should now be running on http://localhost:3000.

Contributors

_Dr-Venom29

_SriharshaG05

_{vislavathmahesh}

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
my-react-app		my-react-app
node_modules		node_modules
rag-backend		rag-backend
.gitignore		.gitignore
README.md		README.md
package-lock.json		package-lock.json
package.json		package.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Paper Snap

Why Paper Snap?

Key Features

Advanced RAG Pipeline

Intelligent Summarization

Performance-First Architecture

Tech Stack

Frontend

Backend

Database & Storage

Getting Started

Prerequisites

1. Clone the Repository

2. Backend Setup

3. Frontend Setup

Contributors

About

Uh oh!

Releases 1

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Paper Snap

Why Paper Snap?

Key Features

Advanced RAG Pipeline

Intelligent Summarization

Performance-First Architecture

Tech Stack

Frontend

Backend

Database & Storage

Getting Started

Prerequisites

1. Clone the Repository

2. Backend Setup

3. Frontend Setup

Contributors

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages