Skip to content

VectifyAI/pageindex-mcp

Repository files navigation

PageIndex MCP

A Model Context Protocol (MCP) server for PageIndex - Next-Generation Reasoning-based RAG.

What is PageIndex?

PageIndex is a revolutionary document processing system that uses reasoning-based RAG instead of traditional vector-based similarity search. Unlike conventional RAG systems that rely on semantic similarity, PageIndex uses multi-step reasoning and tree search to retrieve information like a human expert would.

Key Advantages over Vector-based RAG

  • Higher Accuracy: Relevance beyond similarity - ideal for domain-specific documents where semantics are similar
  • Better Transparency: Clear reasoning trajectory with traceable search paths
  • Like A Human: Retrieve information like a human expert navigates documents
  • No Vector DB: No extra infrastructure overhead
  • No Chunking: Preserve full document context and structure
  • No Top-K: Retrieve all relevant passages automatically

Features

  • Local PDF Processing: Upload local PDF files directly without manual uploads
  • URL Support: Process documents from URLs
  • Full PageIndex Integration: Access all PageIndex capabilities (OCR, tree generation, reasoning-based retrieval)
  • Secure Authentication: API key authentication with PageIndex platform
  • TypeScript: Full type safety with MCP SDK
  • Desktop Extension (DXT): One-click installation for Claude Desktop with secure configuration

Usage

Getting Started

First, you'll need to create an API key:

  1. Visit https://dash.pageindex.ai/api-keys
  2. Create a new API key for your application
  3. Copy the API key for use in the configuration below

For Claude Desktop (Recommended)

One-Click Installation with Desktop Extension (DXT):

  1. Download the latest .dxt file from Releases
  2. Double-click the .dxt file to install automatically in Claude Desktop
  3. Enter your PageIndex API key in the simple configuration interface

Benefits of DXT Installation:

  • No technical setup - just download and double-click
  • Secure configuration - API keys stored securely by Claude Desktop
  • Automatic updates - extensions update seamlessly
  • Full local PDF support - upload and process PDFs directly from your computer

This is the easiest way to get started with PageIndex's reasoning-based RAG capabilities.

For Other MCP-Compatible Clients

Option 1: Local MCP Server (with local PDF upload)

Requirements: Node.js ≥18.0.0

Add to your MCP configuration:

{
  "mcpServers": {
    "pageindex": {
      "command": "npx",
      "args": ["-y", "pageindex-mcp"],
      "env": {
        "PAGEINDEX_API_KEY": "<YOUR_PAGEINDEX_API_KEY>"
      }
    }
  }
}

Option 2: Remote MCP Server

Alternatively, connect directly to PageIndex without this wrapper:

{
  "mcpServers": {
    "pageindex": {
      "type": "http",
      "url": "https://dash.pageindex.ai/api/mcp",
      "headers": {
        "Authorization": "Bearer <YOUR_PAGEINDEX_API_KEY>"
      }
    }
  }
}

Note: Option 1 provides local PDF upload capabilities, while Option 2 connects directly to PageIndex but requires manual PDF uploads via the dashboard.

Available Tools

Tool Description Key Parameters
process_document Upload and process PDF documents from local files or URLs with PageIndex OCR and tree generation url - Local file path or PDF URL
recent_documents Get recent documents with status overview limit (optional)
get_document Get basic document info and status doc_id
get_document_structure Extract hierarchical document structure with configurable detail levels doc_id, max_depth (1-10)
get_page_content Extract specific page content from processed documents doc_id, pages ("5", "3-7", "1,5,10")
remove_document Permanently delete documents and associated data doc_ids (array of document IDs)

Quick Example: Process a local PDF with process_document, then extract content with get_page_content using the returned document ID.

License

This project is licensed under the terms of the MIT open source license. Please refer to MIT for the full terms.

About

MCP server for PageIndex

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Contributors 2

  •  
  •